If “speculative” approach works so well in different contexts why not make it first class and use ev...

mirekrusin • today at 8:38 AM • 2 replies • view on HN

If “speculative” approach works so well in different contexts why not make it first class and use everywhere, possibly recursively?

Replies

saagarjha • today at 10:55 AM

Speculation is only worth it if you can profit from it. Not every context allows this or has a similar idea of what can be speculated.

➕ show 1 reply

doctorpangloss • today at 5:10 PM

Multi-token prediction is a good enhancement to training. It isn't necessarily useful for inference. Other speculative decoding like EAGLE is. It is specific to the technology and the authors of these things write about it.

alt Hacker News

Replies