logoalt Hacker News

zahlmanyesterday at 4:20 AM1 replyview on HN

It seems to me like they're saying the agent made the tool call they expected, but the harness didn't reject it like they expected it to.


Replies

etermyesterday at 10:19 AM

But it sounds like it's not even a harness issue if they have a process where they send a reset email to an address that isn't associated with the account.

This isn't (just) a validation issue, and shouldn't be at the harness level.