It seems to me like they're saying the agent made the tool call they expected, but the harness didn't reject it like they expected it to.
But it sounds like it's not even a harness issue if they have a process where they send a reset email to an address that isn't associated with the account.
This isn't (just) a validation issue, and shouldn't be at the harness level.
But it sounds like it's not even a harness issue if they have a process where they send a reset email to an address that isn't associated with the account.
This isn't (just) a validation issue, and shouldn't be at the harness level.