logoalt Hacker News

cactusplant7374yesterday at 3:30 AM0 repliesview on HN

Probably not. Everyone will still need a lot of reasoning tokens and tool calls. Running the tests for every round is tiring but must be done.