But most benchmarks are not about that...
Are there even any "hallucination" public benchmarks?
"Benchmarks" for LLMs are a total hoax, since you can train them on the benchmarks themselves.
"Benchmarks" for LLMs are a total hoax, since you can train them on the benchmarks themselves.