zlacker

[return to "Imagen, a text-to-image diffusion model"]
1. davikr+ed[view] [source] 2022-05-23 22:04:57
>>kevema+(OP)
Interesting and cool technology - but I can't seem to ignore that every high-quality AI art application is always closed, and I don't seem to buy the ethics excuse for that. The same was said for GPT, yet I see nothing but creativity coming out from its users nowadays.
◧◩
2. thorum+Yj[view] [source] 2022-05-23 22:46:45
>>davikr+ed
That only lasts until the community copies the paper and catches up. For example the open source DALLE-2 implementation is coming along great: https://github.com/lucidrains/DALLE2-pytorch
◧◩◪
3. lucidr+ny[view] [source] 2022-05-24 00:44:00
>>thorum+Yj
Imagen actually shows some of the components in DALLE2 is unnecessary, so Imagen will end up being easier to build. I'll definitely add the dynamic thresholding trick from Imagen to DALLE2 repository though; that is a finding that should boost any DDPMs using classifier free guidance.
◧◩◪◨
4. forgin+kQ[view] [source] 2022-05-24 03:55:13
>>lucidr+ny
Thanks for all your work on these projects!
[go to top]