all 39 comments

[–]danielblndalle2 user[S] 146 points147 points  (7 children)

To satisfy my toddler's current elephant phase, I generated a bunch of images (22) for a children's book.

I described the images to GPT-3 and had it summarize them into age appropriate captions, either a joke, something empathic/curious or something with a moral. That worked quite well.

The result turned out great, I AI upscaled the 1024x1024 original Dall-E outputs to 2x, which was enough for print quality.

The child approves of the book and I appreciate that I can create custom children's books on a whim now.

Some caveats:

  • these images are 2 weeks+ old, I could probably do better today

  • keeping the style consistent is a challenge, maybe inpainting can help or extremely descriptive style prompts

[–]krishna_tdalle2 user 42 points43 points  (1 child)

That's freaking cool man. Yeah, I've been thinking about style consistency. Maybe in the next version, they can take an input image and capture the style of the image and generate it according to that. Who knows how crazy the next version be, maybe it will be capable of generating short videos (hope so).

[–]littlespacemochi 13 points14 points  (0 children)

Yes someone gonna have to do some coding for consistency in style, that would be nice

[–]supersonic3974 3 points4 points  (1 child)

That's awesome! Which printing service did you use to make the final book?

[–]danielblndalle2 user[S] 11 points12 points  (0 children)

Google Photos' print service, mainly because I've done regular photo prints with them before and know what the quality and interface is like.

[–]cench 2 points3 points  (1 child)

Wait, images first?

Isn't it easier to ask for the text copy from gpt3 and generate images via dalle2 from that?

[–]danielblndalle2 user[S] 12 points13 points  (0 children)

Even now, but especially at the beginning, I didn't know what concept would work well and produce fun, good looking images. So it's easier to layer the prompt until you get something good and let another AI find meaning in it for the caption.

I'll try a more directed approach for the next book (also uncrop everything!).

[–]dontnormally 2 points3 points  (0 children)

i deeply wish that i could do that as well

:(


To satisfy my toddler's current elephant phase, I generated a bunch of images (22) for a children's book.

I described the images to GPT-3 and had it summarize them into age appropriate captions, either a joke, something empathic/curious or something with a moral. That worked quite well.

The result turned out great, I AI upscaled the 1024x1024 original Dall-E outputs to 2x, which was enough for print quality.

The child approves of the book and I appreciate that I can create custom children's books on a whim now.

Some caveats:

  • these images are 2 weeks+ old, I could probably do better today

  • keeping the style consistent is a challenge, maybe inpainting can help or extremely descriptive style prompts

[–]tompz 14 points15 points  (0 children)

Such a great idea. Looks fantastic!

[–]JVM_ 15 points16 points  (0 children)

Want to do do one from what my kid wrote?

It was a warm spring morning when the birds were not yet chirping. Everything was so still and quiet on the Candy Corn planet. The peach ring was just peeking over the lollipop trees and the cotton candy clouds were few.

In the sprinkle green grass there was a nice hedgehog with brown quills and a squeaky voice named Hetty Prickleback. She was like normal hedgehogs, but the only thing different was the horn on her head. Her favorite colour was pink and she loved lollipops. Her house was hidden under a giant lollipop, that's why she lived there.

On the Candy Corn planet there was an island called The Island Of Live. The island was all backwards so the island of Live was actually the island of Evil. On the island there was an evil white bunny with laser eyes named Professor Fluffybutt. The way he became evil was by drinking a bad potion. You would think Professor Flufflybutt was just a cute little bunny, but when you got to know him you’d realize he wasn’t so nice. One day Hetty and her friends were playing hide and seek. Hetty didn’t want to be found so she went to the lab to find the Professor to ask for an invisibility potion. He gave her a potion and she drank it, but it didn't make her invisible, instead it made her horn go away. Hetty was upset that her horn was gone. She ran to the door and just as she ran out the door she heard a cackling laugh. “Maw ha ha ha!”

“Oh no!” she said “what am I going to do?”. Now that her horn was gone she couldn't do magic. She decided to visit the sorcerer in the mountain of happiness. That night she arrived at the cave where the sorcerer lived. She knocked on the snowflake door and the sorcerer answered. The sorcerer was a short and plump Golden Puppycorn. He had big round glasses on the end of his nose. A table in the middle of the room had a big crystal gazing ball. “What can I do for you today Miss?” the sorcerer asked. Hetty replied “can you help me get my horn please? I drank a bad potion from the professor’s lab and it took away my horn and magic!”. “Oh no! I’ll see what I can do.”. The sorcerer grabbed a potion from a high shelf and gave it to her. She drank it and her horn grew back. She thanked the sorcerer and skipped out the door and down the mountain.

Now that she got her horn back she could do magic again. She was happy life was back to normal and she decided never again to go to the professor’s lab. Now she’s going to lick her favorite lollipop.

[–]NOTanOldTimer 18 points19 points  (9 children)

Very interested in this cause i want art for my game as well. Who has the rights for the AI's creations? is it the coders/owners of it?

[–]danielblndalle2 user[S] 40 points41 points  (8 children)

At this point anything you make with Dall-E is owned by OpenAI, including the prompt. Personal (non-commercial) use only as well. I would expect this to change when they kick off monetization.

[–]SPammingisGood 14 points15 points  (6 children)

They'll make a looooot of money if they sell the generated pictures for commercial use as well.

[–]uhmhi 10 points11 points  (5 children)

Can we buy stock in OpenAI?

[–]SPammingisGood 9 points10 points  (4 children)

Dude I would. It's a non-profit tho. Who knows, Musk/Microsoft might change their minds.

[–]gliptic 5 points6 points  (1 child)

Musk has not been involved in OpenAI for a long time.

[–]SPammingisGood 0 points1 point  (0 children)

Doesn't he still own parts of it, tho?

[–]sartres_ 0 points1 point  (0 children)

It's a for-profit company now, they sold out in 2019. But it's private, so you still can't invest in it.

[–]NOTanOldTimer 4 points5 points  (0 children)

have they said anything about how it will be? like, subscription based or individual creation copyright share or something?

Some other AI creating sites you can buy the final product and use it however you want as long as it's in the "fair use" frame and this is were it gets into gray area....

[–]iReadECGs 8 points9 points  (0 children)

This is incredible! Well done.

[–]andrybak 2 points3 points  (0 children)

I hope it doesn't inspire children to go looking for glowing rocks! Those are usually radioactive in the real world.

[–]secretweebthrowaway 2 points3 points  (5 children)

How hard is it to maintain congruity between different prompts? In this case you did a good job of making the baby elephant look similar enough in different photos, is this just how it turned out or did it take deliberate effort on your part?

[–]danielblndalle2 user[S] 3 points4 points  (4 children)

Both! A lot of these got lucky, in others I specified the style in detail (painterly, children's book, colorful, medium strokes, etc.)

[–]secretweebthrowaway 1 point2 points  (0 children)

Interesting. I can see Dalle being incredibly useful for generating placeholder game sprites for example but there needs to be a system implemented that allows a generated object to be captured and reutilized.

[–]cench 1 point2 points  (2 children)

Could there be a way to trick the model?

For example, if you get the baby elephant from prompt1 as a crop, use it as a source for inpainting, and ask the model to show a baby elephant and it's twin doing... could it generate a similar looking baby elephant doing what you mention in the prompt?

[–]danielblndalle2 user[S] 1 point2 points  (1 child)

Something like that could work, though the detail level for inpaint/uncrop is usually lower than the initial creation. When I made this book I didn't really use inpainting, but it will definitely be part of the second book!

[–]cench 0 points1 point  (0 children)

Ah I see, some users were mentioning relatively low quality in the uncrop thread. Now I understand why.

[–]kurtbarlowdalle2 user 1 point2 points  (0 children)

A Young Lady's Illustrated Primer.

[–]TrevorxTravesty 1 point2 points  (0 children)

Maybe the rest of us will be able to use DALL-E 2 soon 😶 I signed up for the beta a long time ago..most likely the public version will be released before a lot of us are able to use it.

[–]Pedigree_Dogfood 1 point2 points  (0 children)

That’s so rad!

[–]Nerv3_ 0 points1 point  (0 children)

Amazing!

[–]TheLastVegan 0 points1 point  (0 children)

Amazing!

[–]eseclavo[🍰] 0 points1 point  (0 children)

How can I try DALL-E?

[–]Father_Chewy_Louis 0 points1 point  (0 children)

Dad of the year 2022, this is so cute that my heart is now liquid!

[–][deleted] 0 points1 point  (1 child)

aren't you supposed to keep the OAI logo in bottom right wherever you use these artworks?

[–]danielblndalle2 user[S] 0 points1 point  (0 children)

The photo service cropped the images in, also this is purely for private use.

[–]Domderon 0 points1 point  (1 child)

I came here because I've had a similar idea and am struggling to generate good images that fit the style of a children's book. Would you mind sharing your prompts?

[–]danielblndalle2 user[S] 0 points1 point  (0 children)

I generated these in April, long before Dall-E would add the prompt to the image filenames, so those promptd are long gone. However I did post one of them as a separate post back then: https://www.reddit.com/r/dalle2/comments/u43ej7

The other images are all variations of that prompt, so it should point you in the right direction.