No they are clearly not just scaled up versions of gpt 2; there are different LLM architectures like...

ai_slop_hater • today at 6:27 AM • 1 reply • view on HN

No they are clearly not just scaled up versions of gpt 2; there are different LLM architectures like mixture of experts etc that appeared relatively recently. I am not an expert though, far from it.

Replies

otabdeveloper4 • today at 6:35 AM

MoE and such are basically performance enhancements, they don't make the model smarter.

➕ show 3 replies

alt Hacker News

Replies