logoalt Hacker News

cpburns2009yesterday at 1:37 PM2 repliesview on HN

Does llama.cpp support Qwen3.5 yet? When I tried it before, it failed saying "qwen35moe" is an unsupported architecture.


Replies

hnfongyesterday at 2:42 PM

Yes, but make sure you grab the latest llama.cpp release

New model archs usually involve code changes.

show 1 reply
reactordevyesterday at 1:42 PM

You would need the Dynamic 2.0 GGUF as discussed in the article.

But mmmmmm, Q8_K_XL looks mighty nice.