logoalt Hacker News

chillfoxyesterday at 7:59 AM2 repliesview on HN

Isn’t this pretty much how everyone uses agents?

Feels like it’s a lot of words to say what amounts to make the agent do the steps we know works well for building software.


Replies

hrmtst93837yesterday at 8:58 PM

I think most of these writeups are packaging familiar engineering moves into LLM-shaped language. In my experience the real value is operational: explicit tool interfaces, idempotent steps, checkpoints and durable workflows run in Temporal or Airflow, with Playwright for browser tasks and a vector DB for state so you can replay and debug failures. The tradeoff is extra latency, token cost and engineering overhead, so expect to spend most of your time on retries, schema validation and monitoring rather than on clever prompt hacks, and use function calling or JSON schemas to keep tool outputs predictable.

show 1 reply
sosoelekenyesterday at 10:27 AM

G is posting this slop so Anthropic sends him his dinner invitation this month, give him a break.