1. It's 1 in 10 failures that can take half of your time or bugs that can take a long time to surface. Plus the way they change things largely depends on the current codebase (and how it was created)
2. In my case codex seem to be writing a more solid code, but I still use claude most of the time because it's my witty rubber ducky and I can actually sometimes force some legit insights out of it. Codex is much worse at this. And whether that matters or not depends on the project.