logoalt Hacker News

janalsncmtoday at 9:37 AM3 repliesview on HN

Number of params isn’t really the relevant metric imo. Top models don’t support local inference. More relevant is tokens per dollar or per second.


Replies

dakollitoday at 10:03 AM

Its an open source model, why wouldn't it be relevant for people who want to self host.....

qeternitytoday at 1:54 PM

Number of parameters is at least a proxy for model capability.

You can achieve incredible tok/dollar or tok/sec with Qwen3 0.6b.

It just won't be very good for most use cases.

show 1 reply
lm28469today at 11:24 AM

It does since you can run this model locally on a < $3k machine