logoalt Hacker News

whynotminotyesterday at 4:56 AM2 repliesview on HN

Gemini models also consistently hallucinate way more than OpenAI or anthropic models in my experience.

Just an insane amount of YOLOing. Gemini models have gotten much better but they’re still not frontier in reliability in my experience.


Replies

usaar333yesterday at 7:11 AM

True, but it gets you higher accuracy. Gemini had the best aa-omniscience score

https://artificialanalysis.ai/evaluations/omniscience

cubefoxyesterday at 5:52 AM

In my experience, when I asked Gemini very niche knowledge questions, it did better than GPT-5.1 (I assume 5.2 is similar).

show 1 reply