alt
Hacker News
WhitneyLand
•
today at 4:07 PM
•
0 replies
•
view on HN
A bit misleading to say they take 14x less memory, no one is doing inference with 16-bit models.