logoalt Hacker News

nixon_why69today at 11:57 AM1 replyview on HN

Qwen3.6 supports 266k context out of the box. Try using q8 kv cache to enable more of it.


Replies

gchamonlivetoday at 12:09 PM

I limited it to 64k expecting 24GB vram to not be enough to make use of the entire context window, but I'll try with other's suggestions.