logoalt Hacker News

vthallamyesterday at 7:03 PM5 repliesview on HN

This model is great at long horizon tasks, and Codex now has heartbeats, so it can keep checking on things. Give it your hardest problem that would take hours with verifiable constraints, you will see how good this is:)

*I work at OAI.


Replies

spaceman_2020today at 9:08 AM

Is there any task that actually doesn't require human intervention in-between, even if its just to setup stuff?

Like I will get Opus to make me an app but it will stop in between because I need to setup the db and plug in the API keys and Opus really can't do that on its own yet

show 1 reply
thereeldeeltoday at 8:26 AM

Will Codex App support new context window, rather than compaction, for "unrelated" sub-tasks during long horizon tasks?

dandakayesterday at 7:23 PM

Could be a great feature, can't wait to test! Tired of other models (looking at you Opus) constantly stuck mid-task lately.

show 2 replies
dannywyesterday at 7:47 PM

It's genuinely so great at long horizon tasks! GPT-5.5 solved many long-horizon frontier challenges, for the first time for an AI model we've tested, in our internal evals at Canva :) Congrats on the launch!

show 1 reply
bkyanyesterday at 10:43 PM

Sorry, what is "heartbeats", exactly?

show 1 reply