logoalt Hacker News

spwa4today at 6:07 AM0 repliesview on HN

Turning up the thinking (max time spent thinking) lever really changes model performance, even for tiny models. But it's really irritating because it adds a lot of time.