logoalt Hacker News

ameliusyesterday at 12:17 PM1 replyview on HN

If you can make an LLM solve a problem but from 100 different angles at the same time, that's worth something.


Replies

mmmllmyesterday at 12:30 PM

Isn't that essentially how the MoE models already work? Besides, if that were infinitely scalable, wouldn't we have a subset of super-smart models already at very high cost?

Besides, this would only apply for very few use cases. For a lot of basic customer care work, programming, quick research, I would say LLMs are already quite good without running it 100X.

show 3 replies