SotA models have cracked a handful of research-level math problems though. The default Claude Code...

helloplanets • today at 5:19 AM • 1 reply • view on HN

SotA models have cracked a handful of research-level math problems though.

The default Claude Code style harness is bad for complicated problems as well. Just taking the specific class or function you're working on, and putting it into a deep research style loop yields way better results. Limiting the initial context by hand is still the way to go in a lot of cases.

Replies

habinero • today at 7:54 AM

No, it's the same for math from what I've seen, aka it can do some of the easy things, usually with a lot of help. People usually mean the Erdos problems (aka "a list of things Paul Erdos thought were neat") and, well, here:

> While Erdős generated a huge number of problems, they are not all equally significant and important. I have, unfortunately, seen some mathematicians grow dismissive of Erdős problems recently, perhaps because they have seen reports of AI solving problems on this site that turned out to be quite simple, and wrongly generalised this to assume that all problems posed by Erdős are amusing novelties, of the level of olympiad problems.

From: https://www.erdosproblems.com/forum/thread/blog:5

The rest of the article isn't about AI at all, but I did think it was funny that it describes mathematicians as having more or the same opinion as SWEs.

➕ show 1 reply

alt Hacker News

Replies