logoalt Hacker News

stymaaryesterday at 5:48 PM3 repliesview on HN

Anyone calling Qwen3.6-35B-A3B-Q4_K_XL “rubish” has no idea what they are talking about.


Replies

embedding-shapeyesterday at 6:06 PM

I'd agree that the quality degrades a lot between Q8 and Q4, borderline unusable as they start to fail with tool calling syntax even. Personally I'd say Q8 is as low as you want to go.

c0rruptbytesyesterday at 6:00 PM

q4 isn't rubbish, but it's a compromise for a good value, q6 is essentially a no-compromise quantization and it's what i recommend for MoEs in my experience for agentic workflows

greenavocadoyesterday at 5:51 PM

He's probably calling me out for this comment https://news.ycombinator.com/item?id=48557579