logoalt Hacker News

adamgordonbelltoday at 2:30 AM10 repliesview on HN

Here is the chat:

    don't search the internet. This is a test to see how well you can craft non-trivial, novel and creative proofs given a "number theory and primitive sets" math problem. Provide a full unconditional proof or disproof of the problem.

    {{problem}}

    REMEMBER - this unconditional argument may require non-trivial, creative and novel elements.
Then "Thought for 80m 17s"

https://chatgpt.com/share/69dd1c83-b164-8385-bf2e-8533e9baba...


Replies

urutomtoday at 9:53 AM

What I find fascinating about the shared prompt isn’t just the result, but the visible thinking process. Math papers usually skip all the messy parts and just present the polished proof. But here you get something closer to their notepad. I also find it oddly endearing when the AI says things like “Interesting!” It almost feels like a researcher encouraging themselves after a small progress. It gives me rare feeling of watching the search itself, not just the final result.

petratoday at 12:41 PM

I don't haven ChatGPT but Gemini and Claude. But how do you make a language model think for 80 minutes ???

show 4 replies
chvidtoday at 10:13 AM

I am curious if there is a “harness” for maths out there (like the system prompt and tool collection in Claude code but for maths instead of coding)?

Asking the llm to structure its response in plan and implementation, allowing it to call tools like python, sage, lean etc.

nycdatascitoday at 4:11 AM

Tried w/ 5.5 Pro, Extended Thinking. 17 minutes:

-----------------------------

Yes. In fact the proposed bound is true, and the constant 1 is sharp.

Let w(a)= 1/alog(a)

I will prove that, uniformly for every primitive A⊂[x,∞), ∑w(a)≤1+O(1/log(x)) , which is stronger than the requested 1+o(1).

https://chatgpt.com/share/69ed8e24-15e8-83ea-96ac-784801e4a6...

cryptoegorophytoday at 3:17 AM

Mine took 20min. Pro. https://chatgpt.com/share/69ed83b1-3704-8322-bcf2-322aa85d7a... But I wish I was math smart to know if it worked or not.

show 2 replies
sfdlkj3jk342atoday at 11:31 AM

When using the web interface for ChatGPT like this, is there any way to tell which model is actually being used?

ipaddrtoday at 2:45 AM

Tried the same prompt and ended up no where close on the free plan.

show 3 replies
DeathArrowtoday at 9:54 AM

>don't search the internet.

I think this was key. Otherwise the LLM could think it can't be done.

show 1 reply
ArtIntoNihonjintoday at 5:00 AM

[dead]