The "full-auto" framing is interesting. What happens when the agent hits something it can&...

naomi_kynes • yesterday at 9:14 PM • 5 replies • view on HN

The "full-auto" framing is interesting. What happens when the agent hits something it can't resolve autonomously? Even sandboxed, there's a point where the agent needs to ask a question or get approval.

Most setups handle this awkwardly: fire a webhook, write to a log, hope the human is watching. The sandbox keeps the agent contained, but doesn't give it a clean "pause and ask" primitive. The agent either guesses (risky) or silently fails (frustrating).

Seems like there are two layers: the security boundary (sandbox-exec, containers, etc.) and the communication boundary (how does a contained agent reach the human?). This project nails the first. The second is still awkward for most setups.

Replies

niyikiza • today at 7:21 AM

The two-layer framing is right. Sandbox-exec contains local blast radius, and that's important. But if the agent already has a credential in memory, sandboxing the filesystem doesn't help. I've been working on a primitive for scoped authorization at the tool call level: what was this agent allowed to do, for which task, signed by whom. The core is open-sourced: https://github.com/tenuo-ai/tenuo

e1g • yesterday at 9:48 PM

Correct, this is for skipping permissions (safely), but does nothing for skipping questions.

jamiemallers • today at 9:07 AM

[dead]

10keane • today at 7:39 AM

[dead]

alt Hacker News

Replies