I asked it to prove the theoretical result in a (published, prize-winning - though not really for the theory) academic paper of mine. The proofs hadn’t been that hard objectively, but they’d taken at least a week. I fed it the model. It got the correct basic results in about 5 minutes.
Are you sure the models did not have your exact solved proof already in their dataset?