> In a blind evaluation of nearly 3,000 anonymized comparisons, professors rated AI responses sig...

wilg • today at 12:45 AM • 3 replies • view on HN

> In a blind evaluation of nearly 3,000 anonymized comparisons, professors rated AI responses significantly higher than answers written by other professors, with AI winning 75% of head-to-head matchups.

75% win rate seems pretty good!

Paper link: https://law.stanford.edu/wp-content/uploads/2026/06/salinas_...

Replies

causal • today at 12:47 AM

I wonder to what degree the AI was just better at communicating. My experience with attorneys is that they are often some of the worst writers.

➕ show 1 reply

falcor84 • today at 12:51 AM

Yeah, 75% win rate is a ~200 points Elo difference, which is quite massive.

jshier • today at 12:48 AM

I do wish they'd used some more objective criteria. Simply being preferable one of the things LLMs have trained for since the beginning, hence its sycophantic nature.

➕ show 2 replies

alt Hacker News

Replies