That's just one benchmark, though. Tab to the next one and Sonnet 5 performs better as effort goes up just as you'd expect. I imagine the suggestion is that performance vs effort tradeoff is task dependent.
No it doesn't? It's worse than Opus across the whole shared frontier on both plots.
No it doesn't? It's worse than Opus across the whole shared frontier on both plots.