In one study, GPT-4.5 was judged to be human 73% of the time, which means that the actual human was ...

wat10000 • yesterday at 6:01 PM • 3 replies • view on HN

In one study, GPT-4.5 was judged to be human 73% of the time, which means that the actual human was judged to be human only 27% of the time. More human than human, as Tyrell would say.

Edit: folks, the standard Turing test involves a computer and a human, and then a judge communicating with both and giving a verdict about which one is the human. The percentages for the two entities being judged will add up to exactly 100%. That's how this test was conducted. Please don't assume I'm a moron.

Replies

dwpdwpdwpdwpdwp • yesterday at 6:29 PM

The implication would be that GPT-4.5 was not judged to be human 27% of the time. You can't determine how often humans were judged correctly as humans from that data point.

➕ show 1 reply

jmalicki • yesterday at 7:30 PM

That was also before the crazy AI hysteria we have today with the em-dash police everywhere.

➕ show 1 reply

Melatonic • yesterday at 6:36 PM

Those stats dont necessarily line up that way. Do you have a link?

➕ show 1 reply

alt Hacker News

Replies