I am left wondering if it is such a critical task, how even 1% error rate would reduce human review ...

crisnoble • yesterday at 2:37 PM • 1 reply • view on HN

I am left wondering if it is such a critical task, how even 1% error rate would reduce human review of all outputs.

Replies

Humans of course will screw at least 1% of the time, at least judged retroactively.

The fun part is, if you have non-trivial inputs, even if you don’t change anything, you’ll likely get a different 1% set of errors each time no matter how perfect your judges.

10% seems pretty high, but it really all depends on what you’re evaluating. If it’s all weird edge cases….

alt Hacker News

Replies