logoalt Hacker News

guilamuyesterday at 8:22 PM1 replyview on HN

Yes Opus 4.7 fast (no reasoning) did a worst job than Sonnet 4.6 high (with reasoning) according to Gemini 3.1 Pro evaluation.


Replies

ac29yesterday at 8:32 PM

Your table doesn't indicate reasoning vs non-reasoning, or reasoning level

show 1 reply