Why didn't you take into account batching, input tokens, different costs of electricity, and th...

joefourier • today at 6:45 PM • 1 reply • view on HN

Why didn't you take into account batching, input tokens, different costs of electricity, and the fact that a laptop can still hold a decent % of its resale value, and is useful for many other tasks than running an LLM?

Replies

bigyabai • today at 6:47 PM

> Why didn't you take into account [...] the fact that a laptop can still hold a decent % of its resale value, and is useful for many other tasks than running an LLM?

Because that wasn't what they claimed to research?

  >> for inference it's definitely not worth it.

It's entirely fine if you enjoy local LLMs on your computer, there are people doing horribly inefficient inference on smartphones now. But for pure inference tasks, it's pretty obvious why M5s and Mac Studios aren't replacing TPUs and GPUs.

➕ show 1 reply

alt Hacker News

Replies