What makes you think the AI can instead generate the correct answers to double check the developer's answers?