zlacker

[return to "Imagen, a text-to-image diffusion model"]
1. jonahb+Ve[view] [source] 2022-05-23 22:14:04
>>kevema+(OP)
I know that some monstrous majority of cognitive processing is visual, hence the attention these visually creative models are rightfully getting, but personally I am much more interested in auditory information and would love to see a promptable model for music. Was just listening to "Land Down Under" from Men At Work. Would love to be able to prompt for another artist I have liked: "Tricky playing Land Down Under." I know of various generative music projects, going back decades, and would appreciate pointers, but as far as I am aware we are still some ways from Imagen/Dalle for music?
◧◩
2. addand+Rf[view] [source] 2022-05-23 22:19:19
>>jonahb+Ve
I agree. How cool would it be to get an 8 min version of your favorite song? Or an instant DnB remix? Or 10 more songs in the style of your favorite album?
◧◩◪
3. exac+EP[view] [source] 2022-05-24 03:48:24
>>addand+Rf
You can sort of do that with https://fairuseify.ml
◧◩◪◨
4. aemble+Ap1[view] [source] 2022-05-24 09:50:10
>>exac+EP
I tried that site and the music sounds the same. I wonder if you can use this to bypass YouTube content ID check.
[go to top]