logoalt Hacker News

teruakohatuyesterday at 11:41 PM2 repliesview on HN

The pelican is really getting old as an a standalone evaluation metric. By now they are certainly going to be in training set if not explicitly tuned to produce it for the press on HN alone.

Keep the pelican but isn’t it time to add something else more novel that all current and past models struggle with?


Replies

caseyf7today at 5:32 AM

It also seems like all of the models have converged on very similar images.