logoalt Hacker News

bensyversonyesterday at 1:12 PM4 repliesview on HN

Everyone who has not hit this bug thinks it’s user error… It’s not. It happened to me a few days ago, and the speed at which I tore through my 5 hour usage cap was easily 10x faster than normal.

Also: sub agents do not get you free usage. They just protect your main context window.


Replies

dmdyesterday at 3:30 PM

I'm on Max. This morning, just to test, before doing anything else whatsoever, I was at 0%, and I typed 'test one two three' into CC.

That put me at 12%.

I have no MCPs except the built in claude-in-chrome.

This is clearly a bug.

cyanydeezyesterday at 11:32 PM

Readimg through this thread, it seems likely is a KV cache "bug". Theyre likely doing too many evictions of the LLM cache so the context is being reloaded to often.

Its a "bug" because its probably an intended effect of capturing the costs of compute but surfacing a fact that they oversold compute to a situations where they cant keep the KV cache hot and now its thrashing.

piva00yesterday at 1:18 PM

Don't they consume less of the token quota in case the subagents are running cheaper models like Sonnet and Haiku compared to Opus?

show 1 reply
master_crabyesterday at 1:44 PM

Yes, sorry. I meant it more as a descriptor of how many tokens it consumes. You are still stuck burning money.