The lack of broad benchmark reports in this makes me curious: Has OpenAI reverted to benchmaxxing? Looking forward to hearing opinions once we all try both of these out
The -codex models are only for 'agentic coding', nothing else.
The -codex models are only for 'agentic coding', nothing else.