logoalt Hacker News

throawayontheyesterday at 2:21 PM1 replyview on HN

referencing this:

https://artificialanalysis.ai/evaluations/omniscience?models...

(had to add it to the chart, wasn't displayed by default. is it the lowest rate in the datasetor no?)


Replies

jampekkayesterday at 7:27 PM

This counts only incorrect answers though. A model can get 0% hallucination rate just by refusing to answer all questions.

show 4 replies