alt
Hacker News
zozbot234
•
yesterday at 2:56 PM
•
0 replies
•
view on HN
KV quantization has long been available in llama.cpp