logoalt Hacker News

DangitBobbyyesterday at 6:47 PM1 replyview on HN

I copied/pasted a comment with faulty logic (self-defeating) directly from a HN comment and asked a bunch of models available to me (Gemini and Claude) if it could spot the issue. I figured it would be a nice test of reasoning since an actual human missed it. The only one that found the logic error without help was Claude 4.6 Opus Extending Thinking. The others at best raised relevant counterpoints in the supporting argument but couldn't identify the central issue. Claude's answer seemed miles ahead. I wonder if SotA advancements will continue to distinguish themselves.


Replies

ninjagooyesterday at 10:49 PM

Care to share the comment in question with the rest of us so we can check for ourselves? :-)