logoalt Hacker News

kamranjonyesterday at 4:07 PM2 repliesview on HN

I am curious if anyone knows if the neural cores in apple silicon based machines are at all useful in training? I’ve been using the MLX framework but haven’t seen them mentioned anywhere so I’m just wondering if they are only useful for inference? I know whisper.cpp takes advantage of them in the inference context.

Edit: I changed llama.cpp to whisper.cpp - I didn’t realize that llama.cpp doesn’t have a coreml option like whisper.cpp does.


Replies

lucasoshiroyesterday at 4:28 PM

Well, the TensorFlow port to metal was written by Apple, and it doens't use ANE. If even they have chosen to use only GPU probably the ANE wouldn't help in training. I also heard that the ANE is way less powerful than Apple Silicon's GPU, but I don't have numbers

486sx33yesterday at 4:39 PM

Maybe a quick side shift - What the heck are apples neural cores good for ? Used for ? Use cases ?

show 3 replies