logoalt Hacker News

camgunztoday at 6:55 AM0 repliesview on HN

Hallucination benchmarks accept "I don't know", which Haiku did at least a little. Here are other benchmarks corroborating: https://suprmind.ai/hub/ai-hallucination-rates-and-benchmark...