logoalt Hacker News

mirekrusintoday at 8:38 AM2 repliesview on HN

If “speculative” approach works so well in different contexts why not make it first class and use everywhere, possibly recursively?


Replies

saagarjhatoday at 10:55 AM

Speculation is only worth it if you can profit from it. Not every context allows this or has a similar idea of what can be speculated.

show 1 reply
doctorpanglosstoday at 5:10 PM

Multi-token prediction is a good enhancement to training. It isn't necessarily useful for inference. Other speculative decoding like EAGLE is. It is specific to the technology and the authors of these things write about it.