My point is that it is WAY more efficient if we put the world's DRAM supply into a shared infer...

fleventynine • yesterday at 4:13 PM • 1 reply • view on HN

My point is that it is WAY more efficient if we put the world's DRAM supply into a shared inference pool instead of stranding it in local machines where it won't have as high of batch size or utilization.

The cost of not being efficient is even higher DRAM costs than we have now, given supply and demand.

Replies

zozbot234 • yesterday at 4:24 PM

Much of the world's DRAM stock is sitting idle in consumers' local machines and on-prem servers. If that DRAM gets some use, even "inefficiently", that's a meaningful decrease in demand.

➕ show 1 reply

alt Hacker News

Replies