logoalt Hacker News

MagicMoonlightlast Thursday at 6:59 PM1 replyview on HN

They’re definitely just training the models on the benchmarks at this point


Replies

roxolotllast Thursday at 7:05 PM

Yea either this is an incredible jump or we’ve finally gotten confirmation benchmarks are bs.