logoalt Hacker News

vladguryesterday at 10:00 PM1 replyview on HN

Curious which models are you able to run and how many 3090s do they require at scale?


Replies

mips_avataryesterday at 10:20 PM

4 3090s with nvlinks on each pair. Super fast inference on Moe models around 20-36b