logoalt Hacker News

yummytummylast Sunday at 3:17 PM1 replyview on HN

Ah, so cache usage impacts rate limits. There goes the ”other harnesses aren’t utilizing the cache as efficiently” argument.


Replies

bchernylast Sunday at 3:19 PM

Claude Code is the most prompt cache-efficient harness, I think. The issue is more that the larger the context window, the higher the cost of a cache miss.

show 4 replies