zlacker

[return to "Imagen, a text-to-image diffusion model"]
1. jonahb+Ve[view] [source] 2022-05-23 22:14:04
>>kevema+(OP)
I know that some monstrous majority of cognitive processing is visual, hence the attention these visually creative models are rightfully getting, but personally I am much more interested in auditory information and would love to see a promptable model for music. Was just listening to "Land Down Under" from Men At Work. Would love to be able to prompt for another artist I have liked: "Tricky playing Land Down Under." I know of various generative music projects, going back decades, and would appreciate pointers, but as far as I am aware we are still some ways from Imagen/Dalle for music?
◧◩
2. addand+Rf[view] [source] 2022-05-23 22:19:19
>>jonahb+Ve
I agree. How cool would it be to get an 8 min version of your favorite song? Or an instant DnB remix? Or 10 more songs in the style of your favorite album?
◧◩◪
3. exac+EP[view] [source] 2022-05-24 03:48:24
>>addand+Rf
You can sort of do that with https://fairuseify.ml
◧◩◪◨
4. jrh206+Wx1[view] [source] 2022-05-24 11:09:54
>>exac+EP
I believe that this tech is possible, but this site doesn't provide it. Look at the source of the page: it's just a bunch of sleeps and then you 'download' the same file you provided.
[go to top]