logoalt Hacker News

otabdeveloper4today at 2:54 PM5 repliesview on HN

> the spicy autocomplete can solve difficult open math problems

No it can't. It can't even solve my son's 4th grade math homework. (This is a real use case for me, not a dumb benchmark.)

You just know nothing about math and are happy to parrot bullshit AI salesmen are selling you.


Replies

ConceptJunkietoday at 4:10 PM

Terrence Tao disagrees with what you're saying. I think he's in a slightly better position to speak on the subject.

skinner_today at 3:09 PM

> You just know nothing about math and are happy to parrot bullshit AI salesmen are selling you.

Not the parent poster here. I do know things about math. I wrote a few papers related to the unit distance problem (https://arxiv.org/abs/2311.10069, https://arxiv.org/abs/2406.15317) and spent quite some time trying to solve it. I had no chance of coming up with the proof that the spicy autocomplete came up with. Dumb benchmark, sure.

show 1 reply
sanderjdtoday at 3:15 PM

I would genuinely be interested in knowing what you're doing that led you to this conclusion.

I would be shocked if I was unable to solve 4th grade math homework with any of the contemporary frontier models. I spend most days using them to do significantly more complex things than that.

show 1 reply
threatofraintoday at 3:11 PM

We've already long past that threshold.

simonwtoday at 3:03 PM

Reasoning models with access to Python have been able to solve 4th grade math homework for over a year now. Prove me wrong: show me a 4th grade math problem they can't handle.

show 1 reply