logoalt Hacker News

ggerganovtoday at 6:53 AM1 replyview on HN

Better keep the KV cache in full precision


Replies

freakynittoday at 7:06 AM

Wow.. the GOAT himself.. thank you sooo much for creating llama.cpp ... will re-deploy with full kv cache once requests stop coming.