A couple drawbacks so far via our scenario-based tests: 1. You can't ask the model to "t...

atlex2 • today at 1:23 AM • 0 replies • view on HN

A couple drawbacks so far via our scenario-based tests:

1. You can't ask the model to "think hard" about something anymore - model decides 2. Reasoning traces are no longer true to the thinking – vs opus 4.6, they really are summaries now 3. Reasoning is no longer consciously visible to the agent

They claim the personality is less warm, but I haven't experienced that yet with the prompts we have – seems just as warm, just disconnected from its own thought processes. Would be great for our application if they could improve on the above!

alt Hacker News