logoalt Hacker News

mbgerringyesterday at 5:55 PM0 repliesview on HN

Something I find really confusing from this post is the MLX versions of the model running much slower. As I understand it, these model versions are meant to take advantage of Apple Silicon and MacOS APIs, and should produce better/faster results. Any insight into what’s happening here?