logoalt Hacker News

stavrosyesterday at 2:01 PM1 replyview on HN

Better go for a less-quantized model even if it's slower than go for a faster, quantized one.


Replies

hmokiguessyesterday at 3:36 PM

Thank you!