logoalt Hacker News

wongarsulast Thursday at 7:52 PM3 repliesview on HN

Qwen's flamingo is artistically far more interesting. It's a one-eyed flamingo with sunglasses and a bow tie who smokes pot. Meanwhile Opus just made a boring, somewhat dorky flamingo. Even the ground and sky are more interesting in Qwen's version

But in terms of making something physically plausible, Opus certainly got a lot closer


Replies

kmacdoughlast Thursday at 8:07 PM

Given adherence is a more significant practical barrier, it's probably the better signal. That is, if we decide too look for signal here.

show 2 replies
itakelast Friday at 3:38 AM

"artistically interesting" is IMHO both a subjective and 'solved' problem. These models are trained with an "artistically interesting" reward model that tries to guide the model towards higher quality photos.

I think getting the models to generate realistic and proportional objects is a much harder and important challenge (remember when the models would generate 6 fingers?).

tpmlast Friday at 5:47 AM

The Opus bike isn't very physically plausible though.