Tiny model overfit on benchmark published 3 years prior to its training. News at 10

100ms • today at 6:01 PM • 3 replies • view on HN

selimthegrim • today at 6:46 PM

It wasn't important enough to make the 11 o'clock program.

bigyabai • today at 6:02 PM

But GPT-3.5 was benchmaxxing too.

➕ show 1 reply

srslyTrying2hlp • today at 6:28 PM

[dead]

alt Hacker News