logoalt Hacker News

bigyabaiyesterday at 9:26 PM0 repliesview on HN

For larger contexts (eg. 20,000+ token agent workflows), being 10x faster still isn't enough. You have to be close to ~100x faster at crunching contexts for it to feel like realtime.