The parent comment is describing a test they ran so they could assess their trust in the model for scenarios they don't have time to fully understand.

Do you not believe in running tests, evaluations, or experiments at all to better understand your environment?

The ROI in the case of a positive outcome is the reduced time needed to inspect the results in the future (the entire point of AI is to know what you can trust it on, so you can delegate everything at that level with less oversight). The ROI in the negative case is the tokens not wasted on tasks to ambitious for the model.