alt
Hacker News
lostmsu
•
yesterday at 2:53 PM
•
0 replies
•
view on HN
In large providers KV caches are the main bottleneck, no?