logoalt Hacker News

vidarhtoday at 6:02 PM1 replyview on HN

I've come to the opposite conclusions: The big limitation of systems like this is starting and ending with human involvement at the same level, instead of directing at a higher level. You end up quibbling over detail the agents can handle themselves with sufficient guardrails and process, instead of setting higher level requirements and reviewing higher level decisions and outcomes, and dealing with exceptions.

You can afford a lot of extra guardrails and process to ensure sufficient quality when the result is a system that gets improved autonomously 24/7.

I'm on my way home from a client, and meanwhile another project has spent the last 10 hours improving with no involvement from me. I spent a few minutes reviewing things this morning, after it's spent the whole night improving unattended.


Replies

stingraycharlestoday at 6:35 PM

I find that that doesn’t work in the long run. Software agents are not yet capable of maintaining a decently active repository for extended periods of time.

I am all for delegating everything to AI agents, but it just becomes a mess over time if you don’t steer things often enough.

show 1 reply