logoalt Hacker News

DoctorOetkeryesterday at 7:46 PM2 repliesview on HN

Is there a reason GPU's don't use insane "blocks" of sdcard slots (for massively parallel io) so the model weights don't need to pass through a limited PCI bus?


Replies

Neywinyyesterday at 7:59 PM

Yes. Let's do the math. The fastest sd cards can read at around 300 MB/s (https://havecamerawilltravel.com/fastest-sd-cards/). Modern GPUs use 16 lanes of PCIe gen 5, which is 16x32Gb/s = 512Gb/s = 64 GB/s. Meaning you'd need over 200 of the fastest SD cards. So what you're asking is: is there a reason GPUs don't use 200 SD cards? And I can't think of any way that would work

show 1 reply
numpad0yesterday at 9:57 PM

Maybe latency. IIRC flash is a lot laggier than DRAMs and SRAMs.