logoalt Hacker News

suprfnkyesterday at 6:19 PM2 repliesview on HN

But then, if an agent picks the best response, how would you know that that is reliable?


Replies

onion2kyesterday at 7:43 PM

You could get the agents to output something structured and then use a deterministic test if you're worried about that.

xienzeyesterday at 6:34 PM

Obviously you have multiple agents justify why they picked a certain response and then create another agent that picks the solution with the best justification.

show 1 reply