logoalt Hacker News

gslepakyesterday at 3:34 PM4 repliesview on HN

> The non-hallucination rate in AA-omniscience is SOTA

Note that a perfect "non-hallucination rate" is rather meaningless as such tests can contain human hallucinations.

It means the model aligns with the possibly-true, possibly-false beliefs of the group that made the test.


Replies

rltyesterday at 4:09 PM

Well, yes, garbage in garbage out. That's a given and not what's meant by "hallucination" in this context.

jchengyesterday at 5:45 PM

Here are some examples of the questions in the benchmark. If these are representative, they seem pretty cut and dry. https://artificialanalysis.ai/evaluations/omniscience#exampl...

areweaiyesterday at 8:48 PM

Was there something about this specific model and submission that made you feel compelled to write this self-evident observation?

Or would you describe your methodology as more like picking a random sentence fragment as an input value then generating completions from your existing corpus without any post-input "learning" process related to the rest of the source material?

anti-zionistyesterday at 9:00 PM

[dead]