logoalt Hacker News

XCSmelast Friday at 10:37 AM1 replyview on HN

But most benchmarks are not about that...

Are there even any "hallucination" public benchmarks?


Replies

andrepdlast Friday at 11:13 AM

"Benchmarks" for LLMs are a total hoax, since you can train them on the benchmarks themselves.

show 1 reply