logoalt Hacker News

mfa1999today at 5:41 AM1 replyview on HN

How does this compare to llama.cpp in terms of performance?


Replies

solarkrafttoday at 6:23 AM

MLX is a bit faster (low double digit percentage), but uses a bit more RAM. Worthwhile tradeoff for many.

show 1 reply