Most "productive" flow I found was when I had both memberships and let Claude do the "I go yeet your feature" side and Codex do the "WTF bro, that's full of race conditions!" review phase.
But now I just use Codex. Claude is unreliable and leaves data races all over and leaves, as you say, negative conditions unhandled fairly consistently.