logoalt Hacker News

blitz_skulllast Friday at 1:10 AM2 repliesview on HN

Again I just tap the sign.

All of your benchmarks mean nothing to me until you include Claude Sonnet on them.

In my experience, GPT hasn’t been able to compete with Claude in years for the daily “economically valuable” tasks I work on.


Replies

jstummbilliglast Friday at 3:46 PM

Since as per Anthropics own benchmarks Sonnet 4.5 is beaten by Opus 4.5 would it not suffice to infer the rest?

https://x.com/OpenAI/status/1999182104362668275

nextworddevlast Friday at 4:44 AM

Claude is pretty trash for anything besides coding

show 3 replies