logoalt Hacker News

rayladtoday at 1:22 AM2 repliesview on HN

I am using 4.7 with the default extra high thinking, and it is clearly very stupid. It's worse than old Sonnet 4.5.

I had it suggest some parameters for BCFtools and it suggested parameters that would do the opposite of what I wanted to do. I pointed out the error and it apologized.

It also is not taking any initiative to check things, but wants me to check them (ie: file contents, etc.).

And it is claiming that things are "too complex" or "too difficult" when they are super easy. For instance refreshing an AWS token - somehow it couldn't figure out that you could do that in a cron task.

A really really bad downgrade. I will be using Codex more now, sadly.


Replies

sothatsittoday at 1:33 AM

You can’t make up your mind about a model by using it on one task. Especially to say it’s such a bad downgrade after that is ludicrous. I’ve had great experiences with it this morning.

show 1 reply
solenoid0937today at 2:45 AM

It's been dramatically better than any model I have ever used before on my tasks.