As a Claude Code user, you almost certainly will. Even using the same OpenAI model with an agent like Codex will likely perform better. An agent can test and iterate on its own solution until it meets the defined success metrics. If you’re working within an existing code base, having preexisting code to demonstrate similar implementations also significantly improves the quality of results, and that’s something an agent can dig into as part of its solution process. I haven’t used ChatGPT in a while, so maybe it’s more sophisticated than it used to be, but I think you’ll see much better results with the latest agent tools.
As a Claude Code user, you almost certainly will. Even using the same OpenAI model with an agent like Codex will likely perform better. An agent can test and iterate on its own solution until it meets the defined success metrics. If you’re working within an existing code base, having preexisting code to demonstrate similar implementations also significantly improves the quality of results, and that’s something an agent can dig into as part of its solution process. I haven’t used ChatGPT in a while, so maybe it’s more sophisticated than it used to be, but I think you’ll see much better results with the latest agent tools.