The problem with agents is they regularly sidestep the guardrails and do what they want with a scrip...

maccard • today at 7:45 AM • 1 reply • view on HN

The problem with agents is they regularly sidestep the guardrails and do what they want with a script anyway. The number of times I’ve seen Claude try to escape the folder it’s working in, and then for it to write a python script that does exactly what I told it it’s not allowed do supports that.

If you use SSO and have an AWS config that Claude is allowed to see to get the correct role in the first place, it will just pick the role and plough on anyway.

Replies

bigstrat2003 • today at 7:56 AM

And this is why it is the height of irresponsibility to run LLMs on your system. We know they are unreliable and just make things up; it's extremely foolish to go "yeah I'm going to let that run commands".

➕ show 1 reply

alt Hacker News

Replies