logoalt Hacker News

root-parenttoday at 10:39 AM2 repliesview on HN

The best benchmarks are the ones you create yourself.

Its not my experience Opus is leagues ahead or even superior, but in any case, since GPT 5.5 has Instant, Medium, High, Extra High and Pro...Should the comparison be with GPT on Pro, instead of Extra High as it seems to be the case in the table?


Replies

d4rkp4tterntoday at 10:45 AM

I didn’t know you could get the “Chat-GPT-5.5 Pro” (the one that’s been solving Erdos problems) inside codex-cli, or maybe I misunderstood?

Terrettatoday at 10:40 AM

And, in turn, Opus with ultracode?