logoalt Hacker News

deadbabeyesterday at 9:48 PM1 replyview on HN

Why would you use LLMs at all for that, can’t you just Monte Carlo this thing and be done with it?


Replies

GregorStocksyesterday at 9:53 PM

You still need an algorithm to decide, for each game that you're simulating, what actual decisions get made. If that algorithm is dumb, then you might decide Mono-Red Burn is the best deck, not because it's the best deck but because the dumb algorithm can play Burn much better than it can play Storm, inflating Burn's win rate.

In principle, LLMs could have a much higher strategy ceiling than deterministic decision-tree-style AIs. But my experience with mage-bench is that LLMs are probably not good enough to outperform even very basic decision-tree AIs today.

show 1 reply