logoalt Hacker News

wat10000yesterday at 6:01 PM3 repliesview on HN

In one study, GPT-4.5 was judged to be human 73% of the time, which means that the actual human was judged to be human only 27% of the time. More human than human, as Tyrell would say.

Edit: folks, the standard Turing test involves a computer and a human, and then a judge communicating with both and giving a verdict about which one is the human. The percentages for the two entities being judged will add up to exactly 100%. That's how this test was conducted. Please don't assume I'm a moron.


Replies

dwpdwpdwpdwpdwpyesterday at 6:29 PM

The implication would be that GPT-4.5 was not judged to be human 27% of the time. You can't determine how often humans were judged correctly as humans from that data point.

show 1 reply
jmalickiyesterday at 7:30 PM

That was also before the crazy AI hysteria we have today with the em-dash police everywhere.

show 1 reply
Melatonicyesterday at 6:36 PM

Those stats dont necessarily line up that way. Do you have a link?

show 1 reply