logoalt Hacker News

guybedotoday at 12:12 AM4 repliesview on HN

GLM-5.2 has been a step change in how fast i can burn through tokens.

I subscribed to their max plan to try it out. It counted me 700M tokens and drained my weekly quota in under 2 days.

Quota just reset less than 24h ago and i'm already >60% weekly quota usage.

For reference the kind of work i did would have used somewhere between 3% and 5% of Codex max or Claude max.

The model is good, the plan is a scam


Replies

try-workingtoday at 2:59 AM

Kimi and GLM models have coined a new term: Thinkslop. They run a chain of thought that is up to 10x longer than other models and it seems that through a lookback mechanism they are able to use the CoT to reason about solutions to tasks they couldn't otherwise solve.

The downside is of course that they consume many more tokens off your plan, and also that they are significantly slower. Kimi K2.7 takes about 7x longer to finish the same benchmark tasks as DeepSeek V4 Pro on my router benchmarks (https://role-model.dev/).

So for now I'm happy with just two models: GPT and DeepSeek.

show 3 replies
jubilantitoday at 12:16 AM

> The model is good, the plan is a scam

If it is needing to generate that many tokens to do the same tasks, then it probably has higher inference costs. So (for you) the model is bad, the plan is the same plan.

thefourthchimetoday at 3:55 PM

I gave it my standard:

"Make a pac-man game in a single html page"

It went off and argued with itself for 20 minutes about how to lay out the map and then timed out.

anatoliikmttoday at 2:05 AM

What kind of tasks have you been using it for?