They already have this data. See jukebox from OpenAI, released before chatgpt.