It turns out that it is possible to get this right with minimal change on the original prompt!
🙄 @GoogleAI, “a deep level understanding”?
Seriously?!
Your system can’t distinguish “a horse riding an astronaut” from “an astronaut riding a horse”.
🙄
May 25, 2022 · 4:54 PM UTC
The left one is generated by "A horse riding on back of an astronaut" and the right one is generated by "A horse riding on shoulders of an astronaut". So simply adding "on back of" or "on shoulders of" helps increase the chance of getting it right!
Another interesting observation is that in case of adding "on shoulders of", the models gets almost all images right because there are more hints in the sentence as opposed to adding "on back of" which is often used for riding a horse.