logoalt Hacker News

WhitneyLandtoday at 4:07 PM0 repliesview on HN

A bit misleading to say they take 14x less memory, no one is doing inference with 16-bit models.