> it also would use less electricity How would it use less electricity? I’d like to learn more.

thih9 • today at 7:37 AM • 1 reply • view on HN

> it also would use less electricity

How would it use less electricity? I’d like to learn more.

jychang • today at 7:41 AM

That's completely not true. LLM on device would use MORE electricity.

Service providers that do batch>1 inference are a lot more efficient per watt.

Local inference can only do batch=1 inference, which is very inefficient.

alt Hacker News