logoalt Hacker News

brianush1yesterday at 4:06 AM5 repliesview on HN

claude is stupid but not malicious; chroot is sufficient


Replies

furyofantaresyesterday at 4:43 AM

I've many times seen Claude try to execute a command that it's not supposed to, the harness prevents it, and then it writes and executes a python script to do it.

show 1 reply
fl7305yesterday at 6:59 PM

Sure, it's not malicious. But it is very eager to get things done, and surprisingly inventive and knowledgeable in all kinds of workarounds.

nofriendyesterday at 4:16 AM

Malice is not required. If it thinks it is in the right, then it will do whatever it takes to get around limitations.

lxgryesterday at 11:08 AM

Until it gets prompt injected. Are you reading every single file your agent reads as part of the tasks you give it, including content fetched from the web or third-party packages?

karhagbayesterday at 4:20 AM

Claude is far from stupid from my experience. I've used so many models and Claude is king.