this is not really backed by any empirical evidence. there are simply more efficient means of verifying outputs than TDD.