logoalt Hacker News

brcmthrowawaytoday at 5:30 AM2 repliesview on HN

What is the difference between Ollama, llama.cpp, ggml and gguf?


Replies

benobtoday at 5:48 AM

Ollama is a user-friendly UI for LLM inference. It is powered by llama.cpp (or a fork of it) which is more power-user oriented and requires command-line wrangling. GGML is the math library behind llama.cpp and GGUF is the associated file format used for storing LLM weights.

show 1 reply
xiconfjstoday at 5:34 AM

Ollama on MacOS is a one-click solution with stable obe-click updates. Happy so far. But the mlx support was the only missing piece for me.

show 1 reply