logoalt Hacker News

selcukatoday at 6:07 AM1 replyview on HN

It's ok if they never release a BF16 model, but it's less ok if they release it, win the benchmarks, then quantise it after a few weeks.


Replies

retinarostoday at 10:15 AM

that is for sure what everyone does. also they train on evals with the datasets that they would be bench against.