What is the difference between Ollama, llama.cpp, ggml and gguf?

brcmthrowaway • today at 5:30 AM • 2 replies • view on HN

Replies

Ollama is a user-friendly UI for LLM inference. It is powered by llama.cpp (or a fork of it) which is more power-user oriented and requires command-line wrangling. GGML is the math library behind llama.cpp and GGUF is the associated file format used for storing LLM weights.

➕ show 1 reply

xiconfjs • today at 5:34 AM

Ollama on MacOS is a one-click solution with stable obe-click updates. Happy so far. But the mlx support was the only missing piece for me.

➕ show 1 reply

alt Hacker News

Replies