logoalt Hacker News

tempoponettoday at 3:24 AM1 replyview on HN

It's fine for dense models where you need them in VRAM, less so for MoE where you're offloading layers to ram. But 32/32 is pretty good for both in the popular ~30b range right now.


Replies

xxstoday at 11:33 AM

running 5090 on 32GB RAM is just weird, still