logoalt Hacker News

kokakiwilast Thursday at 4:11 PM2 repliesview on HN

Headroom looks great for client-side trimming. If you want to tackle this at the infrastructure level, we built Edgee (https://www.edgee.ai) as an AI Gateway that handles context compression, caching, and token budgeting across requests, so you're not relying on each client to do the right thing.

(I work at Edgee, so biased, but happy to answer questions.)


Replies

anandvshahlast Friday at 5:32 AM

I have used Edgee.AI and it is amazing.

gilles_opononolast Thursday at 6:39 PM

100% agree