Outputting video of that quality/consistency at 1 minute, for a 2.6B model seems insane?
It's because it is insane/misleading. It's a two stage process, scroll to the key features:
> A dedicated 17B long-video refiner sharpens texture, motion, and late-window quality on top of the long-rollout backbone.
It's because it is insane/misleading. It's a two stage process, scroll to the key features:
> A dedicated 17B long-video refiner sharpens texture, motion, and late-window quality on top of the long-rollout backbone.