logoalt Hacker News

joriswtoday at 7:40 AM2 repliesview on HN

I like the Possible Hallucinations feature. Seems like a feature that could stand on its own. Interested in how you separate those out.


Replies

turtlesouptoday at 8:06 AM

Thanks! It is part of the clustering step, I tell the model to make a judgement of whether something is inanimate or hallucinated (as defined by low support from only non-frontier models / judgement). I iterated on this a lot and made an eval set out of my LinkedIn contacts where I run GPT5.5 with web search and xhigh reasoning to determine pseudo ground truth. I tuned this to be higher recall (more things classified as non-hallucination) but it is definitely not perfect

loboftatoday at 7:46 AM

In my case the possible hallucination was the only one that was 100% factual.