logoalt Hacker News

gcrtoday at 1:50 AM6 repliesview on HN

DwarfStar4 is a small LLM inference runtime that can run DeepSeek 4. The blog post implies that it currently requires 96GB of VRAM.

For others who are lacking context :-)


Replies

forestotoday at 1:55 AM

Thanks. Outside of LLM circles, DS4 is usually a video game controller.

show 3 replies
smcleodtoday at 9:41 AM

That's the flash version not the full model and only at Q2-3~ so while impressive it's still quite different from the full model.

show 1 reply
zozbot234today at 6:49 AM

> The blog post implies that it currently requires 96GB of VRAM.

Has anyone tested what happens if you try and run this on lower-RAM Macs? It might work and just be a bit slower as it falls back on fetching model layers from storage.

show 1 reply
Wowfunhappytoday at 10:40 AM

Thanks. How is DwarfStar4 different from llama.cpp?

rpigabtoday at 8:05 AM

I knew Death Stranding 3 wasn't out yet!

DeathArrowtoday at 4:21 AM

>The blog post implies that it currently requires 96GB of VRAM.

From the Github page it seems it only supports Apple and DGX Spark. I have 128 GB of RAM and a 3090 but it probably won't work.

show 2 replies