gwern can maybe comment here.
An actually scary thing is that AIs are getting okay at reproducing people’s voices.
Music, I'm afraid, appears stuck in the doldrums of small one-offs doing stuff like MIDI. Nothing like the breadth & quality of Jukebox has come out since it, even though it's super-obvious that there is a big overhang there and applying diffusion & other new methods would give you something like much like DALL-E 2 / Imagen for general music.
https://nonint.com/2022/05/04/friends-dont-let-friends-train...