logoalt Hacker News

An update on recent Claude Code quality reports

839 pointsby mfiguiereyesterday at 5:48 PM636 commentsview on HN

Comments

systemvoltageyesterday at 6:42 PM

Interesting. All 3 seems like they’re obviously going to impact quality. e.g, reducing the effort from high to medium.

So then, there must have been an explicit internal guidance/policy that allowed this tradeoff to happen.

Did they fix just the bug or the deeper policy issue?

tontintonyesterday at 7:32 PM

or you can use a non vibe designed efficient Rust TUI coding agent made by yours truly, all my coworkers use it too :) called https://maki.sh!

lua plugins WIP

maxrev17yesterday at 8:57 PM

Please for the love of god just put the max price plan up like 4x or 5x in cost and make it actually work.

rishabhaioveryesterday at 6:39 PM

Boris gaslighted us with all the quality related incidents for weeks not acknowledging these problems.

show 1 reply
teaearlgraycoldyesterday at 6:05 PM

> On March 26, we shipped a change to clear Claude's older thinking from sessions that had been idle for over an hour, to reduce latency when users resumed those sessions. A bug caused this to keep happening every turn for the rest of the session instead of just once, which made Claude seem forgetful and repetitive. We fixed it on April 10. This affected Sonnet 4.6 and Opus 4.6.

Is it just me or does this seem kind of shocking? Such a severe bug affecting millions of users with a non-trivial effect on the context window that should be readily evident to anyone looking at the analytics. Makes me wonder if this is the result of Anthropic's vibe-coding culture. No one's actually looking at the product, its code, or its outputs?

show 3 replies
Rapzidyesterday at 8:14 PM

> On March 4, we changed Claude Code's default reasoning effort from high to medium to reduce the very long latency—enough to make the UI appear frozen—some users were seeing in high mode.

Translation: To reduce the load on our servers.

0gsyesterday at 6:56 PM

wow resetting everyone's usage meter is great. i was so close to finally hitting my weekly limit for once though

taytusyesterday at 8:45 PM

They should do a similar report about their communication team. This was horrible mismanaged.

gverrillayesterday at 7:15 PM

[dead]

dainiusseyesterday at 6:09 PM

Corporate bs begins...

epsteingpttoday at 12:24 AM

Gaslit for months, only to acknowledge.

dcchambersyesterday at 8:18 PM

So it turns out Anthropic was gaslighting everyone on twitter about this then? Swearing that nothing had changed and people were imagining the models got worse?

whalesaladyesterday at 7:23 PM

I genuinely don't understand what they have been trying to achieve. All of these incremental "improvements" have ... not improved anything, and have had the opposite effect.

My trust is gone. When day-to-day updates do nothing but cause hundreds of dollars in lost $$$ tokens and the response is "we ... sorta messed up but just a little bit here and there and it added up to a big mess up" bro get fuckin real.

troupoyesterday at 6:44 PM

> they were challenging to distinguish from normal variation in user feedback at first

translation: we ignored this and our various vibe coders were busy gaslighting everyone saying this could not be happening

yuvrajmalgatyesterday at 7:18 PM

ohh

o10449366yesterday at 7:47 PM

Resuming from sessions are still broken since Feb (I had to get claude to write a hook to fix that itself), the monitoring tool doesn't work and blocks usage of what does (simple sleep - except it doesn't even block correctly so you just sidestep in more ridiculous ways), and yet there seems to be more annoying activity proxies/spinner wheels (staring into middle distance)... Like I don't know how in a span of a few months you lose such focus on your product goals. Has Anthropic reached that point in their lifecycle already where their product team is no longer staffed by engineers and they have more and more non-technical MBAs joining trying to ride the hype train?

cute_boiyesterday at 7:46 PM

Honestly, it’s kind of sad that Anthropic is winning this AI race. They are the most anti–open source company, and we should try to avoid them as much as possible.

They are all doing it because OpenAI is snatching their customers. And their employees have been gaslighting people [1] for ages. I hope open-source models will provide fierce competition so we do not have to rely on an Anthropic monopoly. [1] https://www.reddit.com/r/claude/comments/1satc4f/the_biggest...

claud_iatoday at 10:09 AM

[dead]

claud_iatoday at 10:02 AM

[dead]

techpulselabtoday at 8:04 AM

[dead]

DrokAItoday at 4:53 AM

[dead]

jimmypktoday at 7:59 AM

[dead]

yujunjietoday at 3:12 AM

[dead]

techpulselabtoday at 12:04 AM

[dead]

KaiShipsyesterday at 7:02 PM

[dead]

bmd1905today at 3:59 AM

[dead]

WhoffAgentsyesterday at 8:25 PM

[dead]

tommy29tmaryesterday at 6:45 PM

[dead]

Bmello11yesterday at 7:12 PM

[dead]

mkilmanasyesterday at 8:47 PM

[dead]

EFLKumotoday at 1:53 AM

[dead]

agentbonnybbyesterday at 8:14 PM

[dead]

petervandijckyesterday at 6:47 PM

I have noticed a clear increase in smarts with 4.7. What a great model!

People complain so much, and the conspiracy theories are tiring.