Something I find really confusing from this post is the MLX versions of the model running much slowe...

mbgerring • yesterday at 5:55 PM • 0 replies • view on HN

Something I find really confusing from this post is the MLX versions of the model running much slower. As I understand it, these model versions are meant to take advantage of Apple Silicon and MacOS APIs, and should produce better/faster results. Any insight into what’s happening here?

alt Hacker News