Just get the bigger models to figure out the architecture required for hot-swappable sub-experts wit...

FridgeSeal • yesterday at 1:07 AM • 0 replies • view on HN

Just get the bigger models to figure out the architecture required for hot-swappable sub-experts without loss of performance!

Got all those tokens, isn’t that the point of auto research and friends??

(Only sort of joking).

alt Hacker News