Yeah, it's a search problem. When verification is cheap, reducing success rate in exchange for massively reducing cost and runtime is the right approach.
You underestimating the algorithmic complexity of such brute forcing, and the indirect cost of brittle code that's produced by inferior models
You underestimating the algorithmic complexity of such brute forcing, and the indirect cost of brittle code that's produced by inferior models