logoalt Hacker News

falcor84today at 1:35 AM1 replyview on HN

What of it?

For me too, it was around that time last year, with GPT-5, Claude Sonnet 4.5 and then Gemini 3 that I started feeling that these models are clearly becoming great at reasoning. I'm not at all opposed to saying that they are around PhD-level on at least some domains.


Replies

kmaitreystoday at 4:28 AM

I think there's a lot of difference between sounding like someone and being someone. The models are excellent at pretending indeed.

show 1 reply