I would assume a good benchmark has hidden tests, or something randomly generated that is harder to game