logoalt Hacker News

mattnewtonyesterday at 8:52 PM1 replyview on HN

people are trying, especially for inference. For training, it’s just too high risk to tank your training I think.

TPUs are at least dogfooded by Google deepmind, no team AFAIK has gotten the AMD stack to train well.


Replies

coder-3yesterday at 9:41 PM

Interesting. Why? My current mental model is that AMD chips are just a bit behind, so, less efficient, but no biggie. Do labs even use CUDA?

show 6 replies