logoalt Hacker News

Jackobrientoday at 10:02 AM3 repliesview on HN

I see a world soon where there’s an extremely wide variety of small models for speculative decoding, unique to use cases, companies, and even individuals.


Replies

niccetoday at 10:13 AM

Hopefully that is the case and hardware does not get impossible to get.

pydrytoday at 10:24 AM

yes, heavily constrained by sophisticated guardrails.

this is definitely where things are going. the enormous "eat the world" models have extreme diminishing returns by comparison.

Der_Einzigetoday at 2:54 PM

You clearly didn't read the recent speculative decoding papers because it's been possible to use any model to speculate for any other model for awhile. They solved the tokenization problems that prevented this in the past.