logoalt Hacker News

tempaccount420yesterday at 9:21 PM1 replyview on HN

Benchmarks are not interesting in deciding the "size class". Bigger size means more knowledge. Also, the Qwen 3.5 27B is a dense 27B active parameter model. StepFun 3.5 Flash has 11B active parameters.


Replies

lostmsuyesterday at 9:55 PM

> Bigger size means more knowledge.

Qwen 3.5 27B beats StepFun 3.5 Flash on GPQA Diamond too, so probably no.