zlacker

[return to "Imagen, a text-to-image diffusion model"]

>>kevema+(OP)
Really impressive. If we are able to generate such detailed images, is there anything similar for text to music? I would I though that it would be simpler to achieve than text to image.

>>y04nn+D7
Compare the size of a raw image file to a raw music file, to get an idea of the complexity difference.

>>nomel+79
Think sheet music, not an mp3

>>penney+ua
Fair enough, but that's a little dissimilar to what's being done with these images. These images are a per-pixel construction.

[go to top]