Once the bench is public it’s out and probably in the training data. Better to have your own and test it on a new model.