logoalt Hacker News

zozbot234yesterday at 2:56 PM0 repliesview on HN

KV quantization has long been available in llama.cpp