Is this something that will show up in Ollama any time soon to increase context size of local models...

chr15m • today at 1:54 PM • 1 reply • view on HN

Is this something that will show up in Ollama any time soon to increase context size of local models?

zozbot234 • today at 2:56 PM

KV quantization has long been available in llama.cpp

alt Hacker News