The assertion in the issue report is that Claude saw a sharp decline in quality over the last few months. However, the report itself was allegedly generated by Claude.

Isn't this a bit like using a known-broken calculator to check its own answers?

If a known-broken calculator claims it's broken, I more or less concur. (Chain of reasoning omitted here.)