For coding it makes no sense to use any quantization worse than Q6_K, from my experience. More quant...

zihotki • today at 8:09 AM • 1 reply • view on HN

For coding it makes no sense to use any quantization worse than Q6_K, from my experience. More quantized models make more mistakes and if for text processing it still can be fine, for coding it's not.

Replies

segmondy • today at 1:28 PM

I don't think most people realize that. Quality of tokens beats quantity of token. I always tell folks to go as high a quant as you can only go lower if you just don't have the memory capacity.

➕ show 1 reply

alt Hacker News

Replies