Google Imagen 2

>>geox+(OP)
I think the competition for text to image services is over and open source, stable diffusion won. It doesn't matter how detailed (or whatever counts as "better") corporate text-to-image products get, stable diffusion is good enough which really is good enough. Unlike the corporate offerings, open source txt2img doesn't have random restrictions (no its not just porn at this point) and actually allows for additional scripts/tooling/models. If you're attempting to do anything on a professional level or produce an image with specific details via txt2img, you likely have a workflow with txt2img being only step one.

Why bother using a product from a company that is notorious for failing to commit to most of their services, when you can run something which produces output that is pretty close (and maybe better) and is free to run and change and train?

>>boh+VY
SD still can't do interactions (between people, objects) as well as DALL-E 3 can. I hope that improves. And unfortunately this isn't like software where we can just slowly build a better open source version. This costs millions to train. I hope that as the hardware and algorithms improve (and perhaps the datasets as well) it won't be that way in the future. Random kick starters can get hundreds of thousands easily and I think we could see something like that with with something like SD as well in the future.

zlacker