How do we learn what a good output actually is?