Unless you train them with RL in the right task specifically