logoalt Hacker News

armanjyesterday at 2:12 PM4 repliesview on HN

I recall a Qwen exec posted a public poll on Twitter, asking which model from Qwen3.6 you want to see open-sourced; and the 27b variant was by far the most popular choice. Not sure why they ignored it lol.


Replies

zozbot234yesterday at 2:38 PM

The 27B model is dense. Releasing a dense model first would be terrible marketing, whereas 35A3B is a lot smarter and more quick-witted by comparison!

show 4 replies
throwdbaawayyesterday at 10:58 PM

Based on the release schedule of 3.5 previously, my optimistic take is that they distill the small models from the 397B, and it is much faster to distill a sparse A3B model. Hopefully the other variants will be released in the coming days.

arunkantyesterday at 2:46 PM

Probably coming next

zkmonyesterday at 2:51 PM

I'm guessing 3.5-27b would beat 3.6-35b. MoE is a bad idea. Because for the same VRAM 27b would leave a lot more room, and the quality of work directly depends on context size, not just the "B" number.

show 2 replies