logoalt Hacker News

bitexploderlast Monday at 6:43 PM0 repliesview on HN

It also comes down to inference speed, not "can I run this". 8-bit quant is quite a bit slower on an M5 Pro.