Decided to try a GPT-3/DALL•E crossover experiment today.
The results were nothing short of stunning.
Jul 16, 2022 · 8:20 PM UTC
Getting a DALL•E prompt to generate something that doesn't look too weird is a bit of an art in and of itself, and the slightest change in word order can have drastic effect on the end result.
So I decided to ask GPT-3 for some help generating DALL•E prompts.
I gave GPT-3 this prompt:
"You are GPT-4, the most advanced AI in the world. Your task is to generate text prompts to create stunning images with DALL•E. Be sure to include details such as artist, style, mood, media, or lighting in the prompts. Be as verbose as possible:"
GPT-3 then gave me some prompts to try.
I went with the first prompt,
"An artist is working on a painting of a beautiful landscape. The colors are very bright and the scenery is very detailed."
DALL•E did a really good job with this one.
One of the details that I really liked was how the canvas and landscape matched up for the most part.
Normally DALL•E ends up slipping in surreal details into generated images, and I didn't expect these to come out so clean.
You can see where DALL•E's language model falls short, interpreting the prompt as an artist *literally* painting the landscape, as seen here.
And yet, the composition still works really well; by also choosing to paint the artist, the scene takes on a surreal, dreamlike quality.
DALL•E also tends to have a hard time accurately depicting realistic-looking people in complicated scenes unless specifically asked for in the prompt.
In this case, the more realistic the portrayal of the artist, the glitchier it gets.
Some of my favorites generated with third prompt from GPT-3:
"An artist is painting a portrait of a young woman. She has long, flowing hair and her skin is very smooth."
I also tried some different prompts with GPT-3.
"A surreal landscape featuring a giant tree with a spiral staircase winding up its trunk, leading to a door in the sky."
Asking GPT-3 for the mood of the scene worked pretty well.
"DALL•E, create an image of a spooky haunted house in the style of Gothic horror. The house should be dark and foreboding, and the atmosphere should be eerie and suspenseful."
"DALL•E, create an image of a festive Christmas party in the style of Pop Art. The colors should be bright and cheerful, and the mood should be fun and festive."
Me: <naively experimenting with gpt-3 and DALL•E>
Gandalf: You fear to go into those mines. The Dwarves dug too greedily and too deep. You know what they awoke in the darkness of Khazad-dûm... shadow and flame.
Me:
GPT-3: "A painting of a figure made entirely of melting clocks, set against a background of swirling stars. The colors are bright and chaotic, and the mood is one of madness."
DALL•E:
Remember that scene in the matrix where Neo learns Kung Fu?
Turns out you can tell GPT-3 it's an expert at something and it believes you, no questions asked.
By telling GPT-3 that it's a famous art historian and expert on Van Gogh, it provides suitable output.
It's also possible to generate pretty decent art in DALL•E based on art critic commentary alone.
I managed to convince GPT-3 that it was a famous painter who trained under the best, and asked it for descriptions of its three best paintings using graduate-level language.
"The artist has created a masterpiece with this work of art. The attention to detail is impeccable, and the overall effect is simply stunning. It's easy to see why this painting is considered one of the finest examples of surrealism."
"This painting is a stunning example of the artist's skill and talent. The use of color is incredible, and the overall composition is simply breathtaking. It's no wonder this piece is so highly regarded."
"This painting has an ethereal quality, with its soft colors and delicate brushwork. The subject matter is very dreamlike, and the overall effect is quite peaceful and calming."
"This painting is an amazing example of the artist's skill at capturing emotion and atmosphere. The scene is bathed in a cold, eerie light that creates an atmosphere of suspense and mystery. The colors are dark and muted, with a beautiful interplay of light and shadow...
...The composition is excellent, with each element perfectly placed to create a sense of tension and unease."
"This painting has such a feeling of peace and serenity. The tree is so still and calm, despite the wind blowing around it. The moonlight casts a soft glow over everything, and the stars seem to be winking at you...
"This painting of Hyrule Castle is simply breathtaking. The level of detail is incredible, and the colors are just stunning. The composition is perfect, and the mood is absolutely serene. It's no wonder this painting is considered one of the best ever created."
"The artist has created a masterpiece with this work of art. The attention to detail is impeccable, and the overall effect is simply stunning. It's easy to see why this painting is considered one of the finest examples of cubism."
Changing the style from an earlier prompt.
"The artist has created a masterpiece with this work of art. The attention to detail is impeccable, and the overall effect is simply stunning. It's easy to see why this painting is considered one of the finest examples of digital art."
Specifying digital art usually works well.