I actually think the vocals from ~2:00-~2:35 are pretty impressive there. It's wild to me that the models can play with tempo like that.
I've been listening to this across a variety of genres though, maybe these lyrics and vocals are more to your taste:
(similar to Opeth) https://suno.com/song/9ab8da05-c3f2-412d-80b4-c7d0b3ae840f?s...
(indie rock) https://suno.com/song/756dd139-4cba-4e40-b29c-03ace1c69673
I don't know but it doesn't impress me one bit? like I'm not trying to hate, but it just seems kind of like the model is given the track and then it tries to just follow it by matching words and then spitting them out, like as if it could talk about making a sandwich over some epic track and it'd sound the same?
like, LLMs are fantastic at generating patterns, so words that match and same with images etc.
But there's not much uniqueness? it's "impressive" like a savantic kind of ability to come up with rap, but it doesn't really product something I'd want to listen to..?
I listened to the metal thing and kind of the same thing?
It's very high fidelity, like the quality of the drums and etc it's quite impressive, but the vocals seem off? it's like a poem being read by TTS then transformed into "metal voice"
and kind of just an averaging of "metal music" kind of like stock photos and into a track, very formulaic
not to mention many metal bands etc they do formulaic stuff especially if they have an identifying kind of hit
But to me this is cool tech, but I wouldn't listen to it
I've listened music for a long time but I don't listen to a wide variety today, however for example with pop it can be very complex or very simple, but average or "almost" will really not make a good song, it can seem simple in hindsight but probably blood sweat and tears went into such songs, or creative energy that might never come back as strong.
just my raw thoughts though. it could be me being biased knowing it's AI, but I don't think so. I think my brain has kind of adapted to a point where I can feel if something is AI because it always seems super "average"/mid?
I'd love to see a blind study comparing a wide spectrum of these AI tracks to lesser known real artists (so the participants don't just recognize the songs) to see whether people can tell or if knowledge of the source biases them. I'm genuinely curious as to the results.
I don't think people would think anything strange of a lot of these tracks if they just randomly heard them on the radio.