logoalt Hacker News

charcircuityesterday at 6:55 AM1 replyview on HN

>and doing that will cause a huge one-time hit against your token limit if the session has grown large.

Anthropic already profited from generating those tokens. They can afford subsidize reloading context.


Replies

pixl97yesterday at 7:04 PM

No they can't, that's what you don't seem to get.

Reloading those tokens takes around the same effort as processing them in the first place.

It's ok to be ignorant of how the infrastructure for LLMs work, just don't be proud of it.

show 1 reply