it could be helpful in gettig their learnings to generalize from RL