Imagen, a text-to-image diffusion model

>>kevema+(OP)
Probably just a frontend coding mistake, and not an error in the model, but in the interactive example if you select:

"A photo of a Shiba Inu dog Wearing a (sic) sunglasses And black leather jacket Playing guitar In a garden"

The Shiba Inu is not playing a guitar.

>>codemo+My
There are visible “alignment” issues in some of their examples still. The marble koala DJ in the paper doesn’t use several of the keywords.

They have an example “horse riding an astronaut” that no model produces a correct image for. It’d be interesting if models could explain themselves or print the caption they understand you as saying.

zlacker