logoalt Hacker News

verdverm12/11/20251 replyview on HN

Internal evals, Big AI certainly has good, proprietary training and eval data, it's one reason why their models are better


Replies

aydyn12/11/2025

Then publish the results of those internal evals. Public benchmark saturation isn't an excuse to be un-quantitative.

show 1 reply