That's just because true statements are more likely to occur in their training corpus.
The overwhelming majority of true statements isn't in the training corpus due to a combinatorial explosion. What it means that they are more likely to occur there?
The training set is far too small for that to explain it.
Try to explain why one shotting works.
The overwhelming majority of true statements isn't in the training corpus due to a combinatorial explosion. What it means that they are more likely to occur there?