GPT 5.2 loses at everything but they included that
Who are they supposed to compare it to? I'm not sure what makes you think that Grok is even remotely comparable to the frontier models right now.
Who are they supposed to compare it to? I'm not sure what makes you think that Grok is even remotely comparable to the frontier models right now.