Agreed! A lot of people were also using ZiT as a refiner downstream to help with some of the more problematic visual aspects of the original Qwen-Image.
I'm really looking forward to running the unified model through its paces.
Something I am skeptical about Z-Image is that it uses Gemma which is imo a weak LLM.
If I were to guess, I would say that Z-Image’s life is shorter than it initially appeared. Even as a refiner which are just workarounds for model issues.
Something I am skeptical about Z-Image is that it uses Gemma which is imo a weak LLM.
If I were to guess, I would say that Z-Image’s life is shorter than it initially appeared. Even as a refiner which are just workarounds for model issues.