The best benchmarks are the ones you create yourself. Its not my experience Opus is leagues ahead ...

root-parent • today at 10:39 AM • 2 replies • view on HN

The best benchmarks are the ones you create yourself.

Its not my experience Opus is leagues ahead or even superior, but in any case, since GPT 5.5 has Instant, Medium, High, Extra High and Pro...Should the comparison be with GPT on Pro, instead of Extra High as it seems to be the case in the table?

Replies

d4rkp4ttern • today at 10:45 AM

I didn’t know you could get the “Chat-GPT-5.5 Pro” (the one that’s been solving Erdos problems) inside codex-cli, or maybe I misunderstood?

Terretta • today at 10:40 AM

And, in turn, Opus with ultracode?

alt Hacker News

Replies