I think it's an extremely important distinction because self supervised learning has real inherent reward signals. Something like clustering does not.