Number of params isn’t really the relevant metric imo. Top models don’t support local inference. Mor...

janalsncm • today at 9:37 AM • 3 replies • view on HN

Number of params isn’t really the relevant metric imo. Top models don’t support local inference. More relevant is tokens per dollar or per second.

dakolli • today at 10:03 AM

Its an open source model, why wouldn't it be relevant for people who want to self host.....

qeternity • today at 1:54 PM

Number of parameters is at least a proxy for model capability.

You can achieve incredible tok/dollar or tok/sec with Qwen3 0.6b.

It just won't be very good for most use cases.

➕ show 1 reply

lm28469 • today at 11:24 AM

It does since you can run this model locally on a < $3k machine

alt Hacker News