referencing this: | alt Hacker News

throawayonthe • yesterday at 2:21 PM • 1 reply • view on HN

referencing this:

(had to add it to the chart, wasn't displayed by default. is it the lowest rate in the datasetor no?)

This counts only incorrect answers though. A model can get 0% hallucination rate just by refusing to answer all questions.

➕ show 4 replies