zlacker

[return to "Imagen, a text-to-image diffusion model"]
1. y04nn+D7[view] [source] 2022-05-23 21:33:33
>>kevema+(OP)
Really impressive. If we are able to generate such detailed images, is there anything similar for text to music? I would I though that it would be simpler to achieve than text to image.
◧◩
2. nomel+79[view] [source] 2022-05-23 21:41:36
>>y04nn+D7
Compare the size of a raw image file to a raw music file, to get an idea of the complexity difference.
◧◩◪
3. penney+ua[view] [source] 2022-05-23 21:49:47
>>nomel+79
Think sheet music, not an mp3
◧◩◪◨
4. nomel+AT5[view] [source] 2022-05-25 16:58:42
>>penney+ua
Fair enough, but that's a little dissimilar to what's being done with these images. These images are a per-pixel construction.
[go to top]