logoalt Hacker News

andrepdlast Friday at 11:13 AM1 replyview on HN

"Benchmarks" for LLMs are a total hoax, since you can train them on the benchmarks themselves.


Replies

XCSmelast Friday at 7:40 PM

I would assume a good benchmark has hidden tests, or something randomly generated that is harder to game