logoalt Hacker News

vunderbayesterday at 11:45 PM0 repliesview on HN

Results are in for `gemini-3.1-flash-image-preview` (NB 2) for the GenAI Showdown site in the editing comparisons. Remember to click the "Pass/Fail" button to toggle between pass/fail and a weighted score to account for additional factors like steerability, image quality, etc.

Unfortunately, unlike the leap from NB to NB Pro, we did not see significant gains from NB Pro to NB Pro 2.

In several cases (such as the Jaws Poster), we observed that it was substantially more difficult to prevent NB Pro 2 from making significant changes to the rest of the image. Localization of edits, in general, seems to have changed and not necessarily for the better.

http://genai-showdown.specr.net/image-editing

Comparison solely between the Gemini models (NB, NB Pro, and NB Pro 2):

http://genai-showdown.specr.net/image-editing?models=nb,nbp,...