I don't think the MoE part has anything to do with it, but the current gen of multimoddal model...

krackers • today at 6:32 AM • 0 replies • view on HN

I don't think the MoE part has anything to do with it, but the current gen of multimoddal models can do thinking interleaved with autoregressive(?*) image-gen so it's probably not long before they bake this into the RL process, same way native thought obviated need for "think carefully step by step" prompts.

alt Hacker News