A relevant recent tweet from antirez:

minimaxir • yesterday at 11:48 PM • 4 replies • view on HN

A relevant recent tweet from antirez: https://x.com/antirez/status/2054854124848415211

> Gentle reminder on how, in the recent DS4 fiesta, not just me but every other contributor found GPT 5.5 able to help immensely and Opus completely useless.

I've noticed the same for lower level squeezing-as-much-performance-as-possible code work.

Replies

throwaway041207 • today at 12:38 AM

Assuming we are talking about Code/Codex are you on API billing or subscription? I have essentially unlimited API billing at my disposal and I haven't noticed any degradation of quality across Opus versions.

➕ show 1 reply

rjh29 • today at 8:41 AM

There's so much subjectivity with models. As soon as a new model comes out people act like the last model they used for 6 months was completely useless.

sanxiyn • today at 1:28 AM

There is a benchmark for performance work, and I think it is not being optimized by model vendors. The latest result from GSO is that both Opus 4.6 and 4.7 slightly outperforms GPT 5.5. This also matches my experience.

https://gso-bench.github.io/

➕ show 1 reply

alt Hacker News

Replies