logoalt Hacker News

cagefacetoday at 6:52 AM2 repliesview on HN

Verified this locally myself. Thanks for the concrete test. I guess it's time to give Claude another try.


Replies

nsingh2today at 4:26 PM

This is preliminary, but it seems like it might somehow be related to the `## Intermediary updates` system prompt that's provided to the model. Seems like it forces the model to stop thinking and return early to provide updates. Removing that entirely makes all runs succeed [1].

I wonder if it's somehow getting confused between what's supposed to be an intermediate update vs the final result.

[1] https://github.com/openai/codex/issues/30364#issuecomment-48...

show 1 reply
zuzululutoday at 7:29 AM

I would switch to Claude if they kept Fable 5 in the sub

I'm also afraid to lose my "spot" if I leave codex and 5.6 is coming out so...