DwarfStar4 is a small LLM inference runtime that can run DeepSeek 4. The blog post implies that it c...

gcr • today at 1:50 AM • 6 replies • view on HN

DwarfStar4 is a small LLM inference runtime that can run DeepSeek 4. The blog post implies that it currently requires 96GB of VRAM.

For others who are lacking context :-)

Replies

foresto • today at 1:55 AM

Thanks. Outside of LLM circles, DS4 is usually a video game controller.

➕ show 3 replies

smcleod • today at 9:41 AM

That's the flash version not the full model and only at Q2-3~ so while impressive it's still quite different from the full model.

➕ show 1 reply

zozbot234 • today at 6:49 AM

> The blog post implies that it currently requires 96GB of VRAM.

Has anyone tested what happens if you try and run this on lower-RAM Macs? It might work and just be a bit slower as it falls back on fetching model layers from storage.

➕ show 1 reply

Wowfunhappy • today at 10:40 AM

Thanks. How is DwarfStar4 different from llama.cpp?

rpigab • today at 8:05 AM

I knew Death Stranding 3 wasn't out yet!

DeathArrow • today at 4:21 AM

>The blog post implies that it currently requires 96GB of VRAM.

From the Github page it seems it only supports Apple and DGX Spark. I have 128 GB of RAM and a 3090 but it probably won't work.

➕ show 2 replies

alt Hacker News

Replies