The hard part is defining what a mistake is. If you ask Claude to write code and it works, but you don't like the approach is that a mistake? If it generates a UI with the wrong colors, but everything else is correct, does that count? The amount of subjectivity alone makes it too difficult and nearly impossible for a refund system to be implemented properly.