logoalt Hacker News

timcobbyesterday at 10:44 PM10 repliesview on HN

Can people share their GLM and open model setups in general please? What provider do you use. Why do you trust it with serving full quality? What harness do you use? Why do you trust it not to have malware (most harnessed are TS apps). I am just trying GLM 5.1 from Nvidia build in open code would love to hear how you all do it, thanks.


Replies

59nadirtoday at 8:27 AM

> What provider do you use?

1. My own harness + Local (which usually means Qwen3.6-35B-A3B), I use this fairly often for research gathering on topics, info gathering on code bases, etc.

2. My own harness + DeepSeek v4 Flash served by DeepSeek, I added $20 quite some time ago and somehow still have $18.77 in there after I don't know how many prompts. I use this pretty often, slightly less than my local setup, it's great and what I'm planning on running locally (eventually).

3. My own harness + OpenRouter with whichever model I want to try out. I use this very rarely.

4. Pi + OpenAI Codex $20 subscription. I don't use this almost at all anymore, but I keep the Codex subscription for testing things out to see how GPT-5.5 will handle a problem the other setups have issues with.

> Why do you trust it with serving full quality?

The only thing I've noticed seems unbearably useless sometimes versus what I noticed before was GPT-5.5 which has had some of the weirdest degradations I've seen. It's not to Anthropic levels but it definitely had some service issues a few times where I was wondering if they had accidentally (or purposefully) lobotomized it.

Everything else has mostly just been the same, except DeepSeek I noticed had some speed issues a few days ago.

> What harness do you use? Why do you trust it not to have malware (most harnessed are TS apps)?

I pretty much only use my own, agents are trivial to make and it's definitely not hard to make one that's better than Claude Code or Codex for whatever you're doing.

show 2 replies
michimagdesignyesterday at 11:13 PM

Next to my Claude Pro plan, I have subbed to OpenCode Go. I find the OpenCode UX much better than in Claude Code CLI. As for models, I started a few months ago with GLM 5.1 and it was solid and could archive near sonnet-level tasks. It weirdly sputtered out Chinese characters sometimes. Then I switched to Kimi K2.6, which is the Chinese model I used the most until now. It used way too many reasoning tokens (improved in k2.7). But executed Claude created plans reliably. Now I’m back with GLM 5.2 and it’s really solid (among other things it’s good at design) and I get good usage with the $10 plan. Still the Claude models have less hiccups but the Chinese models are getting really close.

rescbrtoday at 4:29 PM

Z.ai legacy Pro coding plan which will last me until the end of the year + maki.sh as the agent.

OpenCode works fine, i just find it very resource intensive for no good reason.

gandreaniyesterday at 11:26 PM

I use both the openai subscription and the opencode go subscription. I use the go subscription for my personal work and the openai subscription for my consulting work.

The differences between the models are minimal, but I usually stick with gpt-5.4-mini, gpt-5.4, mimo-pro-2.5, deepseek-v4-pro. These latter ones have way more usage than even using 5.4-mini so I tend to use them in personal projects for that reason.

My harness is https://github.com/can1357/oh-my-pi. I trust it...enough. It updates very frequently so as a safe guard I run it sandboxed with https://github.com/containers/bubblewrap so it can only access the project folder and some whitelisted config files

show 1 reply
johndoughtoday at 7:23 AM

    > What provider do you use.
OpenRouter with pinned DeepSeek provider or OpenCode Go

    > Why do you trust it with serving full quality?
Quality seems good so far.

    > What harness do you use? Why do you trust it not to have malware (most harnessed are TS apps).
I wrote my own. A minimal harness without dependencies is only 65 lines of Python.
chess10kptoday at 3:29 PM

Pi is great, set it up with a system prompt to give the model more direction and think less, and it crushes anything I give it

smoeyesterday at 11:03 PM

For work, I mostly use Codex and some Claude. For personal use, I’ve started using Chinese models directly through their respective providers, mostly for automation tasks and experiments so far, either via the API directly or through the Pi harness.

I do not trust any of them. Everything runs inside virtual machines, not just the sandboxes provided by the harnesses. I also do not run Claude or Codex directly on the host machine. Not just because of supply chain fears, but also because of how incredibly user hostile the VC funded companies are when it comes to installing random stuff on your machine.

ukuinatoday at 3:16 AM

Synthetic.new and Claude Code using GLM-5.2. Great model, but the harness will error out if using subagents. The base plan only allows one concurrent request at a time. Also, GLM will burn through your weekly quota in a day if you're not precise with your scope.

Fr0styMatt88today at 3:27 AM

Local using Qwen3.6-27B; 2xRTX 5070Ti graphics cards; VS Code with Cline at the moment and Ollama back-end (will get to trying the others soon).

rainmakingyesterday at 10:54 PM

GLM 5.2 coding plan- I'll post the agent as soon as I can! But opencode works and their own zcode is really good as well.