Measuring Claude 4.7's tokenizer costs

597 points • by aray07 • yesterday at 3:29 PM • 420 comments • view on HN

Comments

I can manage session cost effectively myself if forking and rewinds were first class features

Contrary to people here who feel the price increases, reduction of subscription limits etc are the result of the Anthropic models being more expensive to run than the API & subscription revenue they generate I have a theory that Anthropic has been in the enshittification & rent seeking phase for a while in which they will attempt to extract as much money out of existing users as possible.

Commercial inference providers serve Chinese models of comparable quality at 0.1x-0.25x. I think Anthropic realised that the game is up and they will not be able to hold the lead in quality forever so it's best to switch to value extraction whilst that lead is still somewhat there.

➕ show 1 reply

dallen33 • yesterday at 4:05 PM

I'm still using Sonnet 4.6 with no issues.

➕ show 1 reply

chakintosh • yesterday at 8:16 PM

Yeah one PRD request of a small scope app cost me 70%

ricardobeat • yesterday at 5:07 PM

I can’t stand reading this. One article. Many words. Not written by a human.

Feels like LLMs are devolving into having a single, instantly recognizable and predictable writing style.

varispeed • yesterday at 4:54 PM

Don't forget that the model doesn't have an incentive to give right solution the first time. At least with Opus 4.6 after it got nerfed, it would go round in circles until you tell it to stop defrauding you and get to correct solution. That not always worked though. I found starting session again and again until less nerfed model was put on the request. Still all points to artificially make customer pay more.

thibran • yesterday at 5:26 PM

For me there is no point in using Claude Opus 4.7, it's too expensive since it does not do 100% of the job. Since AI can anyway only do 90% of most tasks, I can use another model and do the remaining 15-30% myself.

wartywhoa23 • yesterday at 7:41 PM

Seeing this big crowd of people trying to persuade themselves or others that the ever growing hole in their pockets is totally justified and beneficial is pretty hilarious!

rambojohnson • yesterday at 6:09 PM

So intelligence has turned into a utility per Sam Altman et al., and now the same companies get to hike the price of accessing it by 20–30%, right as it’s becoming the backbone of how teams actually ship work. People are pushing out so much, so fast that last week’s output is already a blur. I’ve got colleagues who refuse to go back to writing any of this stuff by hand.

And now maintaining that pace means absorbing arbitrary price increases, shrugged off with “we were operating at a loss anyway.”

It stops being “pay to play” and starts looking more like pay just to stay in the ring, while enterprise players barely feel the hit and everyone else gets squeezed out.

Market maturing my butthole... it’s obviously a dependency being priced in real time. Tech is an utter shit show right now, compounded by the disaster of the unemployment market still reeling from the overhiring of 2020.

save up now and career pivot. pick up gardening.

➕ show 3 replies

rbren • yesterday at 5:41 PM

Good reminder to choose model-agnostic tooling!

JohnMakin • yesterday at 6:30 PM

30% more token use, but even by their benchmarks, don't appear to have any real big successes there, and some regressions. What's the point? It doesn't do any better on the suite of obedience/compliance tests I've written for 4.6, and in some tests, got worse, despite their claim there it is better. Anecdotally, it was gobbling so many tokens on even the simplest queries I immediately shut it off and went back to 4.5.

Why release this?

AIrtemis • yesterday at 7:49 PM

here comes the rug-pull to justify the enterprise pricing...

therobots927 • yesterday at 5:27 PM

As a regular listener of Ed Zitron this comes as absolutely no surprise. Once you understand the levels of obfuscation available to anthro / OAI you will realize that they have almost certainly hit a model plateau ~1 year ago. All benchmark improvements since have come at a high compute cost. And the model used when evaluating said benchmarks is not the same model you get with your subscription.

This is already becoming apparent as users are seeing quality degrade which implies that anthropic is dropping performance across the board to minimize financial losses.

encoderer • yesterday at 4:52 PM

In my “repo os” we have an adversarial agent harness running gpt5.4 for plan and implementation and opus4.6 for review. This was the clear winner in the bake-off when 5.4 came out a couple months ago.

Re-ran the bake-off with 4.7 authoring and… gpt5.4 still clearly winning. Same skills, same prompts, same agents.md.

tornikeo • yesterday at 8:56 PM

Good lord. Reading all these comments makes me feel so much better for dumping anthropic the first time their opus started becoming dumber (circa Month ago). It feels like most people in this thread are somehow bound to Claude, even though it is alread fully enshittfied.

➕ show 1 reply

Bingolotto • yesterday at 5:20 PM

Talked to Claude earlier today and Opus 4.7 cost up to 35% more.

synergy20 • yesterday at 6:32 PM

that's what i feel, going to use codex more

dionian • yesterday at 8:57 PM

I noticed it was compacting more aggressively which i actually like, because i was letting sessions get really long and using them uncached (parallel sessions)

bcjdjsndon • yesterday at 4:27 PM

Because those braniacs added 20-30% more system prompt

JimmaDaRustla • yesterday at 6:31 PM

Am I dumb, or are they not explaining what level thinking they're using? We all read the Anthropic blog post yesterday - 4.7 max consumes/produces an incredible number of tokens and it's not equivalent to 4.6 max; xhigh is the new "max".

bugsense • yesterday at 6:29 PM

I would use a service like Straion.com to avoid the forths and back. It increases token consumption but I can get things right the first time.

saltyoldman • yesterday at 6:26 PM

I was sort of hoping that the peak is something like $15 per hour of vibe help (yes I know some of you burn $15 in 12milliseconds), and that you can have last year's best or the current "nano/small" model at $1 per hour.

But it looks like it's just creeping up. Probably because we're paying for construction, not just inference right now.

AIrtemis • yesterday at 7:48 PM

here comes the rug-pull

stefan_ • yesterday at 4:32 PM

I don't know anything about tokens. Anthropic says Pro has "more usage*", Max has 5x or 20x "more usage*" than Pro. The link to "usage limits" says "determines how many messages you can send". Clearly no one is getting billed for tokens.

➕ show 1 reply

socratic_weeb • yesterday at 8:09 PM

This is good news. It means the bubble is popping. Bye bye VC subsidies...

CodingJeebus • yesterday at 4:29 PM

The fundamental problem with these frontier model companies is that they're incentivized to create models that burn through more tokens, full stop. It's a tale as old as capitalism: you wake up every day and choose to deliver more value to your customers or your shareholders, you cannot do both simultaneously forever.

People love to throw around "this is the dumbest AI will ever be", but the corollary to that is "this is the most aligned the incentives between model providers and customers will ever be" because we're all just burning VC money for now.

➕ show 3 replies

foreman_ • today at 6:55 AM

[dead]

joewongg • today at 4:16 AM

[dead]

sageframe • yesterday at 8:52 PM

[dead]

SamuelBraude • yesterday at 8:26 PM

[dead]

texttopdfnet • yesterday at 5:20 PM

[dead]

climike • yesterday at 7:47 PM

[dead]

mianzubair • yesterday at 11:13 PM

[flagged]

texttopdfnet • yesterday at 5:19 PM

[dead]

kevinten10 • today at 2:16 AM

[dead]

throwaway613746 • yesterday at 4:51 PM

[dead]

storytellera • yesterday at 7:26 PM

[dead]

mikert89 • yesterday at 4:41 PM

The compute is expensive, what is with this outrage? People just want free tools forever?

alt Hacker News

Measuring Claude 4.7's tokenizer costs

Comments