Stable-audio and MusicGen sounds better than Jukebox.
But the best so far is Suno.ai ( https://app.suno.ai ) especially with their V3 model they have very impressive results, the fidelity is not studio quality but they're getting very close.
It's very likely based on their TTS model they have released before (Bark), but trained on more data and with higher resolution.
I tried a dozen of different prompts in suno ai to generate some music - it completely ignored them. It just generated some simple pop sounding tunes every time. I’m not impressed.
The lyrics on those songs are basically a pastiche of everything popular. But just based on sound it's pretty convincing. I bet you could train a generative AI to push out trance bangers as long as they don't have any lyrics.
I mean the user wrote the lyrics (or maybe ChatGPT lol). People of course write more interesting ones (Just found a Ukrainian about Kyiv: https://sonauto.ai/songs/xRDqe57ZgT6QrFXiIzF0). There was another one about Frank Sinatra being attacked with a baseball bat but I can't find it rn.
I just tried Suno and the results to me are terrible.
It seems designed for making pop music no one will listen to.
I have spent many hours with MusicLM making wild experimental music no one will listen to.
MusicLM has no problem making really weird sound combinations.
I just gave SunoAI some of my MusicLM prompts I have saved and the results are garbage. The problem with the AI test kitchen model though is the results sound like they are in mono.
The ultimate for me will be when we can make rap/hiphop no one will listen to.