I’m 100% sure the future consists of many models running on device. LLMs will be the mobile apps of ...

vadimf • yesterday at 5:07 PM • 3 replies • view on HN

I’m 100% sure the future consists of many models running on device. LLMs will be the mobile apps of the future (or a different architecture, but still intelligence).

Replies

ajnin • yesterday at 5:15 PM

The future right now looks more like everything in remote datacenters, no autonomous capabilities and no control by the user. But I like yours better.

➕ show 1 reply

pizzafeelsright • yesterday at 7:07 PM

This is the path forward, with some overhead.

1. Generic model that calls other highly specific, smaller, faster models. 2. Models loaded on demand, some black box and some open. 3. There will be a Rust model specifically for Rust (or whatever language) tasks.

In about 5-8 years we will have personalized models based upon all our previous social/medical/financial data that will respond as we would, a clone, capable of making decisions similar with direction of desired outcomes.

The big remaining blocker is that generic model that can be imprinted with specifics and rebuilt nightly. Excluding the training material but the decision making, recall, and evaluation model. I am curious if someone is working on that extracted portion that can be just a 'thinking' interface.

coldtea • yesterday at 7:44 PM

If anything, memory ain't getting cheaper, disks aren't either, and as for graphics cards, forget it.

People wont be competing with even a current 2026 SOTA from their home LLM nowhere soon. Even actual SOTA LLM providers are not competing either - they're losing money on energy and costs, hopping to make it up on market capture and win the IPO races.

➕ show 1 reply

alt Hacker News

Replies