claude is stupid but not malicious; chroot is sufficient

brianush1 • yesterday at 4:06 AM • 5 replies • view on HN

Replies

furyofantares • yesterday at 4:43 AM

I've many times seen Claude try to execute a command that it's not supposed to, the harness prevents it, and then it writes and executes a python script to do it.

➕ show 1 reply

fl7305 • yesterday at 6:59 PM

Sure, it's not malicious. But it is very eager to get things done, and surprisingly inventive and knowledgeable in all kinds of workarounds.

nofriend • yesterday at 4:16 AM

Malice is not required. If it thinks it is in the right, then it will do whatever it takes to get around limitations.

lxgr • yesterday at 11:08 AM

Until it gets prompt injected. Are you reading every single file your agent reads as part of the tasks you give it, including content fetched from the web or third-party packages?

karhagba • yesterday at 4:20 AM

Claude is far from stupid from my experience. I've used so many models and Claude is king.

alt Hacker News

Replies