logoalt Hacker News

shardullavekartoday at 1:31 PM1 replyview on HN

has anyone come across an r2d3-style explainer for something as high-dimensional as a Transformer's attention mechanism?


Replies