It's good that there is a movement for open LLMs, but it's not where the battleground is

dTal • yesterday at 10:53 PM • 13 replies • view on HN

It's good that there is a movement for open LLMs, but it's not where the battleground is right now. The battleground is local vs service LLMs, and we are losing that battle badly despite all the software being here now and viable, entirely because UX sucks.

How many normal people do you know who use "ChatGPT"? A lot, probably.

How many even know what "Gemma" is, let alone have downloaded llama.cpp, a GGUF file from Hugginface, and run "llama-server" from a text console with all the correct command arguments? How many are thinking about this use case when speccing out their next computer? Where is the breathless marketing copy boasting x tok/s?

We are sleepwalking into slavery.

Replies

627467 • yesterday at 11:34 PM

"Normal people" have never bothered to host their own: photos, music, videos, documents, comunications, etc. To the point that for many their computer is essentially a thin client into someone else's server. Why would we think this same people would care about "personal" inference?

trollbridge • today at 1:13 AM

Normal people can go open an account at DeepSeek or Xiaomi and chat away for free. Or, for that matter, a couple other models like z.ai's (GLM-5.2 isn't in the free tier, though, but neither is GPT-5.5-Pro), or Qwen, which does have 3.7-Max for free with no account on their chatbot interface.

Yes, I realise this isn't "running a local model", but it's using models that can be grabbed and run locally. For my pipelines, I feel far more confidence when I use an open model (even one like GLM-5.2 that would be expensive for me to run) since I have a backup plan if the hosted/cloud option becomes unworkable for me. If that happens to me with Opus, I have zero options.

cdata • today at 12:34 AM

If our strategy to avoid "slavery" involves "normal people" taking the local-vs-managed choice seriously, we have already lost.

This choice is made for us. The deciding factors will be convenience and economics.

My sense is that just like Web 2.0 SaaS we are destined for servitude.

A better strategy is to play an assymetrical game IMO. Don't let your would-be master write the rules by which you play.

➕ show 1 reply

8note • yesterday at 11:17 PM

normal people dont really have the hardware to run local models

➕ show 5 replies

conception • today at 12:30 AM

Google Edge Gallery is turn key for people and on the device most people chatgpt on. Just like with most Google Stuff “edge gallery” is maybe the worst name possible for “run AI on your phone”!

theptip • yesterday at 11:47 PM

Why do you feel the important part _now_ is where the weights get run?

I can see this as a future battleground but access to frontier models (which you cannot run locally) seems a lot more relevant today.

➕ show 1 reply

itkovian_ • today at 12:39 AM

You can’t run a closed llm locally. Strange to frame the dichotomy as between local and open. One begets the other.

idiotsecant • yesterday at 10:57 PM

Better UX does not buy you a datacenter farm to train state of the art cutting edge models. Right now the only people who can do that are the technobility class.

➕ show 2 replies

azinman2 • yesterday at 11:32 PM

> We are sleepwalking into slavery.

That’s a bit hyperbolic…

➕ show 1 reply

0gs • yesterday at 11:40 PM

it's funny because i made this thing (called enough) that aims to make it easy for non-technical people to get up and running with local models quickly, but it is impossible to figure out how to break through the noise. every thread and comment like this breaks my heart a lil bit

➕ show 1 reply

double0jimb0 • yesterday at 10:57 PM

Yea, anyone who understands what makes products actually usable is opting to get paid for said skill.

bsder • today at 12:49 AM

> we are losing that battle badly despite all the software being here now and viable, entirely because UX sucks.

Yep. I'm an old time Linux sysadmin, but I am COMPLETELY baffled as to what I can or cannot run on my 32GB R9700 with 128GB main CPU memory.

If I want something Claude or Codex like what do I use that would be useful? If I want a chat system, what do I use? Images--apparently ComfyUI for setup but after that what do I do?

I don't even mind spinning up something in the cloud for a bit, but I need to know how I'm going to get data up and down without racking up massive bandwidth charges.

I'd love to do some tinkering, but the field is moving so fast and so full of charlatans that cleaning the dross out is almost impossible.

➕ show 2 replies

wmf • yesterday at 11:05 PM

LM Studio

alt Hacker News

Replies