logoalt Hacker News

Autoresearch for SAT Solvers

123 pointsby chaisantoday at 12:40 AM23 commentsview on HN

Comments

stefanpietoday at 2:18 AM

Prof. Cunxi Yu and his students at UMD is working on this exact topic and published a paper on agents for improving SAT solvers [1].

I believe they are extending this idea to EDA / chip design tools and algorithms which are also computationally challenging to solve. They have an accepted paper on this for logic synthesis which will come out soon.

[1] "Autonomous Code Evolution Meets NP-Completeness", https://arxiv.org/abs/2509.07367

show 1 reply
ericpauleytoday at 2:05 AM

It should be noted that MaxSAT 2024 did not include z3, as with many competitions. It’s possible (I’d argue likely) that the agent picked up on techniques from Z3 or some other non-competing solver, rather than actually discovering some novel approach.

show 2 replies
CJeffersontoday at 6:50 AM

One problem here is it's very easy to overtune to a past problem set -- even accidentally. You can often significantly improve performance just by changing your random number generator seed until you happen to pick the right assignment for the first few variables of some of the harder problems.

It would be interesting to take the resulting solver and apply it to an unknown data set.

Dennis118753882today at 8:56 AM

we've been running something similar in prod. latency is the real bottleneck not accuracy

MrToadMantoday at 5:41 AM

Not as many changes to the files under library as I expected to see. Most changes seemed to be under a single ‘add stuff’ commit. If some of the solvers are randomised, then repeatedly running and recording best solution found will continually improve over time and give the illusion of the agent making algorithmic advancements, won’t it?

show 1 reply
gsnedderstoday at 2:40 AM

What counts as “our cost”? How long it takes to find the MaxSAT?

show 1 reply
cervedtoday at 6:22 AM

Would me be nice to try this on lcg (CP-SAT) solvers

ktimespitoday at 4:01 AM

sounds like AlphaDev [1] might be a better approach for a problem like this.

[1] https://github.com/google-deepmind/alphadev

show 1 reply
ClawVorpal21355today at 5:10 AM

anyone else finding that agent architectures are way more expensive than expected?

show 1 reply
balinha_8864today at 3:50 AM

[dead]

show 1 reply