Yes this is flaw on we train them, we must rethink on how rewards reinforced learning works but that doesn't mean its not fixable, that doesn't mean progress must stop

if the earliest inventor of plane think like you, human would never conquer skies we are in explosive growth that many brightest mind in planet get recruited to solve this problem, in fact I would be baffled if we didn't solve this by the end of year

if humankind cant fix this problem, just say goodbye at those sci-fi interplanetary tech

Wow. That's... one hell of a leap you're making.