alt
Hacker News
visarga
•
yesterday at 12:20 PM
•
0 replies
•
view on HN
There are ways to quantize or compress KV cache down.