logoalt Hacker News

fleventyninetoday at 2:29 PM1 replyview on HN

If local models are good enough, doesn't that increase demand for DRAM as everyone buys DRAM for their poorly utilized local machines?

Surely it is a more efficient use of DRAM to run inference on shared hardware with large batch sizes and more utilization.


Replies

szatkustoday at 3:21 PM

Luckily very few people can configure and are interested in local models. But your nearby datacenter running Chinese open-weight models is also good enough.

show 1 reply