logoalt Hacker News

Scene_Cast2yesterday at 6:12 PM1 replyview on HN

I just ran this with Gemini 3 Pro, Opus 4.6, and Grok 4 (the models I personally find the smartest for my work). All three answered correctly.


Replies

miroljubyesterday at 7:47 PM

They had plenty of time to update their system prompts so they don't be embarrassed.

I noticed whenever such meme comes out, if you check immediately you can reproduce it yourself, but after a free hours it's already updated.

show 4 replies