Simple bunch of rules and goals backed by extremely sophisticated visual intuition.
Pretty sure someone already tried throwing VLMs and diffusion models at this, wonder how that fared.