Gemini models also consistently hallucinate way more than OpenAI or anthropic models in my experience.
Just an insane amount of YOLOing. Gemini models have gotten much better but they’re still not frontier in reliability in my experience.
In my experience, when I asked Gemini very niche knowledge questions, it did better than GPT-5.1 (I assume 5.2 is similar).
True, but it gets you higher accuracy. Gemini had the best aa-omniscience score
https://artificialanalysis.ai/evaluations/omniscience