Same calculation, basically. Any given ~30B model is going to use the same VRAM (assuming loading it...

DiabloD3 • yesterday at 2:17 PM • 0 replies • view on HN

Same calculation, basically. Any given ~30B model is going to use the same VRAM (assuming loading it all into VRAM, which MoEs do not need to do), is going to be the same size

alt Hacker News