logoalt Hacker News

fortysevenyesterday at 1:33 PM0 repliesview on HN

Strangely, I haven't had a lot of luck with vLLM; I finally ended up ditching Ollama and going straight to the tap with llama-serve in llamacpp. No regrets.