The hugging face models are already up and seem to be the original models with the speculative decod...

kamranjon • today at 11:16 AM • 1 reply • view on HN

The hugging face models are already up and seem to be the original models with the speculative decoding module built in which is very cool:

Flash: https://huggingface.co/deepseek-ai/DeepSeek-V4-Flash-DSpark

Pro: https://huggingface.co/deepseek-ai/DeepSeek-V4-Pro-DSpark

Excited to see if this makes it into DwarfStar for local inference, have been using the flash model extensively since the 2-bit quants were made available by antirez.

Replies

ilaksh • today at 2:02 PM

Any chance they will have this for Qwen 27 b also?

➕ show 1 reply

alt Hacker News

Replies