I am having a shit experience lately. Opus 4.7, max effort. > You're right, that was a shi...

whalesalad • yesterday at 8:37 PM • 5 replies • view on HN

I am having a shit experience lately. Opus 4.7, max effort.

> You're right, that was a shit explanation. Let me go look at what V1 MTBL actually is before I try again.

> Got it — I read the V1 code this time instead of guessing. Turns out my first take was wrong in an important way. Let me redo this in English.

:facepalm:

Replies

tremon • yesterday at 8:44 PM

> I read the V1 code this time instead of guessing

Does the LLM even keep a (self-accessible) record of previous internal actions to make this assertion believable, or is this yet another confabulation?

➕ show 2 replies

al_borland • yesterday at 8:43 PM

This seems like the experience I've had with every model I've tried over the last several years. It seems like an inherent limitation of the technology, despite the hyperbolic claims of those financially invested in all of this paying off.

➕ show 1 reply

ed_elliott_asc • yesterday at 9:23 PM

If it isn’t working for you why don’t you choose an older model? 4.6

ericol • yesterday at 8:45 PM

Matches what I am experiencing. Makes incredible stupid mistakes.

The weird stuff is yesterday I asked it to test and report back on a 30+ commit branch for a PR and it did that flawlessly.

alphabettsy • yesterday at 8:58 PM

The docs suggest not using max effort in most cases to avoid overthinking :shrug:

➕ show 1 reply

alt Hacker News

Replies