logoalt Hacker News

DiabloD3yesterday at 2:17 PM0 repliesview on HN

Same calculation, basically. Any given ~30B model is going to use the same VRAM (assuming loading it all into VRAM, which MoEs do not need to do), is going to be the same size