logoalt Hacker News

zihotkitoday at 8:09 AM1 replyview on HN

For coding it makes no sense to use any quantization worse than Q6_K, from my experience. More quantized models make more mistakes and if for text processing it still can be fine, for coding it's not.


Replies

segmondytoday at 1:28 PM

I don't think most people realize that. Quality of tokens beats quantity of token. I always tell folks to go as high a quant as you can only go lower if you just don't have the memory capacity.

show 1 reply