I agree with you that transformers are probably not the architecture of choice. Not sure what that has to do with the viability of RL though.