logoalt Hacker News

miki123211last Sunday at 1:56 PM0 repliesview on HN

> AI companies want adcopy, not legitimate benchmarks.

Labs need accurate benchmark measurements, at least internally, to figure out what model improvements actually matter.

Having models exploit benchmarks serves no purpose. If they wanted to make their models look better than they are, they could just make the data up.