zlacker

[return to "Imagen, a text-to-image diffusion model"]
1. hn_thr+3j[view] [source] 2022-05-23 22:40:22
>>kevema+(OP)
As someone who has a layman's understanding of neural networks, and who did some neural network programming ~20 years ago before the real explosion of the field, can someone point to some resources where I can get a better understanding about how this magic works?

I mean, from my perspective, the skill in these (and DALL-E's) image reproductions is truly astonishing. Just looking for more information about how the software actually works, even if there are big chunks of it that are "this is beyond your understanding without taking some in-depth courses".

◧◩
2. rvnx+1k[view] [source] 2022-05-23 22:46:50
>>hn_thr+3j
Check https://github.com/multimodalart/majesty-diffusion or https://github.com/lucidrains/DALLE2-pytorch

There is a Google Colab workbook that you can try and run for free :)

This is the image-text pairs behind: https://laion.ai/laion-400-open-dataset/

[go to top]