Statistically. Do many trials and measure how often it succeeds/fails.

Aka a benchmark.