logoalt Hacker News

ainchyesterday at 3:13 AM1 replyview on HN

This understates the possible headroom as technical challenges are addressed - text diffusion is significantly less developed than autoregression with transformers, and Inception are breaking new ground.


Replies

nylonstrungyesterday at 3:55 AM

Very good point- if as much energy/money that's gone into ChatGPT style transformer LLMs were put into diffusion there's a good chance it would outperform in every dimension