logoalt Hacker News

Archit3chyesterday at 12:13 PM1 replyview on HN

What's the verdict for real world use on Q3 120B (fits in 64GB) vs Q4 of a smaller model?


Replies

FuckButtonsyesterday at 7:02 PM

Bigger model wins as long as the quantization was done properly.