The lack of broad benchmark reports in this makes me curious: Has OpenAI reverted to benchmaxxing? L...

purplerabbit • yesterday at 6:29 PM • 1 reply • view on HN

The lack of broad benchmark reports in this makes me curious: Has OpenAI reverted to benchmaxxing? Looking forward to hearing opinions once we all try both of these out

Replies

MallocVoidstar • yesterday at 7:24 PM

The -codex models are only for 'agentic coding', nothing else.

➕ show 1 reply

alt Hacker News

Replies