logoalt Hacker News

utopiahtoday at 12:05 PM1 replyview on HN

Honestly I don't get the point but if you want to explore that, both on desktop, mobile or headless server Linux allows you to try it.

You can run ollama with whatever you want on a Debian in literally minutes. You can even do that within a virtual machine using e.g. QEMU, so that you can do all the tests you need risk free.

Again I don't understand what that would enable that can't be done today but it's perfectly fine, you can try today anyway, no need to ask permission to anyone.


Replies

josalhortoday at 12:30 PM

No, what I am saying simply does not exist yet.

I am saying I want my OS to expose APIs like it does for the disk or the network for AI. And I want my apps to be able to use those APIs.

I want my backend LLMs to be able to change on a whim. Imagine an Android app consuming from these LLMs. Maybe I am outside and it is making queries to Gemini. And maybe I get home and now it makes queries to my local llm, almost like connecting to local Wifi.

What I am saying does not exist on many levels:

- Agreed upon APIs for this don't think exist (in text maybe, but not in image/sound/video).

- OSs do not expose this (I am not talking manually configured user space stuff here).

- I see a world where your Network provider bundles "calls + data plan + AI tokens". But not only are the offerings for these not standardized, in order to even reach that point we would need to standardize the offerings. How do you compare intelligence among models? How do you compare cost?

- The apps need to start adopting this model

The tech is here, the ecosystem is not.

show 3 replies