logoalt Hacker News

comboylast Tuesday at 7:33 PM5 repliesview on HN

Unrelated, but Claude was performing so tragically last few days, maybe week(s), but days mostly, that I had to reluctantly switch. Reluctantly because I enjoy it. Even the most basic stuff, like most python scripts it has to rerun because of some syntax error.

The new reality of coding took away one of the best things for me - that the computer always just does what it is told to do. If the results are wrong it means I'm wrong, I made a bug and I can debug it. Here.. I'm not a hater, it's a powerful tool, but.. it's different.


Replies

scandinavianlast Wednesday at 6:19 AM

I'm not a big user, but I have been doing some vibe-ish coding for a PoC the past few days, and I'm astonished at how bad it is at python in particular (Opus 4.6 High).

* It likes to put inline imports everywhere, even though I specify in my CLAUDE.md that it should not.

* We use ruff and pyright and require that all problems are addressed or at least ignored for a good reason, but it straight up #noqa ignores all issues instead.

* For typing it used the builtin 'any' instead of typing.Any which is nonsense.

* I asked it to add a simple sum of a column from a related database table, but instead of using a calculated sum in SQL it did a classic n+1 where it gets every single row from the related table and calculates the sum in python.

Just absolute beginner errors.

show 3 replies
bluegattylast Tuesday at 9:12 PM

Codex with 5.4 xhigh. It's a bad communicator but does the job.

show 1 reply
pacha3000last Tuesday at 7:52 PM

I'm the first to be tired of everyone, for every model, that says "uuuh became dumber" because I didn't believe them

... until this week! Opus is struggling worse than Sonnet those last two weeks.

show 6 replies
bicepjailast Wednesday at 3:07 AM

Yes totally agree it’s regurgitating crazy expansive text like book author who needs to publish 10 books a day