They are training the model to 1. Produce code (as opposed to answer a question, write a poem, etc.) 2. Produce long enough output to be a valid solution. So they are doing exactly what I said. Cheers.
In layman, they are putting wet tyres on when it is raining and saying the car performs better over the next lap?
In layman, they are putting wet tyres on when it is raining and saying the car performs better over the next lap?