logoalt Hacker News

Sammiyesterday at 10:51 AM1 replyview on HN

1. Don't implement too much at at time

2. Have the agent review if it followed the plan and relevant skills accurately.


Replies

irthomasthomasyesterday at 11:03 AM

the first link was from a simple request with fewer than 1000 tokens total in the context window, just a short shell script.

here is another one which had about 200 tokens and opus decided to change the model name i requested.

https://x.com/xundecidability/status/2005647216741105962?s=2...

opus is bad at instruction following now.