From my experience, pre LLMs, it was a valid proxy metric for effort