#navbar { margin-top: 7em; } @media all and (max-width: 649px) { #navbar { margin-top: 10em; } }

Warning: JavaScript Disabled!

For support of key website features (link annotation popups/popovers & transclusions, collapsible sections, backlinks, tablesorting, image zooming, sidenotes etc.), you must enable JavaScript.

‘DALL·E 2’ directory

See Also
Links
Miscellaneous
Bibliography

See Also

Parent (‘DALL·E’ tag)

Links

“Coarse Is Better [DALL·E 2 vs MJv2 vs Nano Banana Pro]”, Borretti 2025

Coarse is Better [DALL·E 2 vs MJv2 vs Nano Banana Pro]

“The Hyperfitting Phenomenon: Sharpening and Stabilizing LLMs for Open-Ended Text Generation”, Carlsson et al 2024

The Hyperfitting Phenomenon: Sharpening and Stabilizing LLMs for Open-Ended Text Generation

“Why AI Isn’t Going to Make Art”, Chiang 2024

Why AI Isn’t Going to Make Art

“Epistemic Calibration and Searching the Space of Truth”, Lee 2024

Epistemic calibration and searching the space of truth

“AstroPT: Scaling Large Observation Models for Astronomy”, Smith et al 2024

AstroPT: Scaling Large Observation Models for Astronomy

“The Carbon Emissions of Writing and Illustrating Are Lower for AI Than for Humans”, Tomlinson et al 2024

The carbon emissions of writing and illustrating are lower for AI than for humans

“Where Memory Ends and Generative AI Begins: New Photo Manipulation Tools from Google and Adobe Are Blurring the Lines between Real Memories and Those Dreamed up by AI”, Goode 2023

Where Memory Ends and Generative AI Begins: New photo manipulation tools from Google and Adobe are blurring the lines between real memories and those dreamed up by AI

“Generalizable Synthetic Image Detection via Language-Guided Contrastive Learning”, Wu et al 2023

Generalizable Synthetic Image Detection via Language-guided Contrastive Learning

“TorToise: Better Speech Synthesis through Scaling”, Betker 2023

TorToise: Better speech synthesis through scaling

“3DALL·E: Integrating Text-To-Image AI in 3D Design Workflows”, Liu et al 2022

3DALL·E: Integrating Text-to-Image AI in 3D Design Workflows

“DALL·E 2 Is Seeing Double: Flaws in Word-To-Concept Mapping in Text2Image Models”, Rassin et al 2022

DALL·E 2 is Seeing Double: Flaws in Word-to-Concept Mapping in Text2Image Models

“DALL·E-Bot: Introducing Web-Scale Diffusion Models to Robotics”, Kapelyukh et al 2022

DALL·E-Bot: Introducing Web-Scale Diffusion Models to Robotics

“DALL·E Now Available Without Waitlist”, OpenAI 2022

DALL·E Now Available Without Waitlist

“Discovering Bugs in Vision Models Using Off-The-Shelf Image Generation and Captioning”, Wiles et al 2022

Discovering Bugs in Vision Models using Off-the-shelf Image Generation and Captioning

“Adversarial Attacks on Image Generation With Made-Up Words”, Millière 2022

Adversarial Attacks on Image Generation With Made-Up Words

“NUWA-∞: Autoregressive over Autoregressive Generation for Infinite Visual Synthesis”, Wu et al 2022

NUWA-∞: Autoregressive over Autoregressive Generation for Infinite Visual Synthesis

“Training Transformers Together”, Borzunov et al 2022

Training Transformers Together

“Compositional Visual Generation With Composable Diffusion Models”, Liu et al 2022

Compositional Visual Generation with Composable Diffusion Models

“DALL·E 2 Prompt Engineering Guide”, rendo1 & luc 2022

DALL·E 2 Prompt Engineering Guide

“Imagen: Photorealistic Text-To-Image Diffusion Models With Deep Language Understanding”, Saharia et al 2022

Imagen: Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding

“Hierarchical Text-Conditional Image Generation With CLIP Latents”, Ramesh et al 2022

Hierarchical Text-Conditional Image Generation with CLIP Latents

“DALL·E 2: Hierarchical Text-Conditional Image Generation With CLIP Latents § 7. Limitations and Risks”, Ramesh et al 2022 (page 16 org openai)

DALL·E 2: Hierarchical Text-Conditional Image Generation with CLIP Latents § 7. Limitations and Risks

“Make-A-Scene: Scene-Based Text-To-Image Generation With Human Priors”, Gafni et al 2022

Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors

“DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-To-Image Generative Transformers”, Cho et al 2022

DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generative Transformers

“Medical Domain Knowledge in Domain-Agnostic Generative AI”, Kather et al 2022

Medical domain knowledge in domain-agnostic generative AI

“GLIDE: Towards Photorealistic Image Generation and Editing With Text-Guided Diffusion Models”, Nichol et al 2021

GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models

“Min(DALL·E) Is a Fast, Minimal Port of DALL·E-2”

min(DALL·E) is a fast, minimal port of DALL·E-2

“Prompt Design for DALL·E 2: Photorealism—Emulating Reality”, Merzmensch 2026

Prompt Design for DALL·E 2: Photorealism—Emulating Reality

The Bees

“Please Stop Using Mediocre AI Art in Your Posts”

Please stop using mediocre AI art in your posts

genekogan

Here’s what a few years of progress on text-to-image generation looks like, one prompt at a time. "Frank Sinatra as a purple alien in surrealist style": 1. AttnGAN (2018) • 2. CLIP+VQGAN (2020) • 3. CLIP+Diffusion (2021) • 4. DALL·E 2 (2022)

Sort By Magic

Annotations sorted by machine learning into inferred 'tags'. This provides an alternative way to browse: instead of by date order, one can browse in topic order. The 'sorted' list has been automatically clustered into multiple sections & auto-labeled for easier browsing.

Beginning with the newest annotation, it uses the embedding of each annotation to attempt to create a list of nearest-neighbor annotations, creating a progression of topics. For more details, see the link.

`ai-carbon-emissions`

[see previous entry]

`artificial-creativity`

[see previous entry]

[see previous entry]

`dall-e-availability`

[see previous entry]

[see previous entry]

`text-to-image`

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

[see previous entry]

Miscellaneous

Bibliography

https://www.newyorker.com/culture/the-weekend-essay/why-ai-isnt-going-to-make-art: “Why AI Isn’t Going to Make Art”, Ted Chiang

link-bibliography
https://arxiv.org/abs/2405.14930: “AstroPT: Scaling Large Observation Models for Astronomy”, Michael J. Smith, Ryan J. Roberts, Eirini Angeloudi, Marc Huertas-Company

link-bibliography
https://www.nature.com/articles/s41598-024-54271-x: “The Carbon Emissions of Writing and Illustrating Are Lower for AI Than for Humans”, Bill Tomlinson, Rebecca W. Black, Donald J. Patterson, Andrew W. Torrance

link-bibliography
https://openai.com/blog/dall-e-now-available-without-waitlist/: “DALL·E Now Available Without Waitlist”, OpenAI

link-bibliography
https://arxiv.org/abs/2208.08831#deepmind: “Discovering Bugs in Vision Models Using Off-The-Shelf Image Generation and Captioning”, Olivia Wiles, Isabela Albuquerque, Sven Gowal

link-bibliography
https://arxiv.org/abs/2207.09814#microsoft: “NUWA-∞: Autoregressive over Autoregressive Generation for Infinite Visual Synthesis”, Chenfei Wu, Jian Liang, Xiaowei Hu, Zhe Gan, Jianfeng Wang, Lijuan Wang, Zicheng Liu, Yuejian Fang, Nan Duan

link-bibliography
https://arxiv.org/abs/2205.11487#google: “Imagen: Photorealistic Text-To-Image Diffusion Models With Deep Language Understanding”, Chitwan Saharia, William Chan, Saurabh Saxena, Lala Li, Jay Whang, Emily Denton, Seyed Kamyar Seyed Ghasemipour, Burcu Karagol Ayan, S. Sara Mahdavi, Rapha Gontijo Lopes, Tim Salimans, Jonathan Ho, David J. Fleet, Mohammad Norouzi

link-bibliography
https://arxiv.org/pdf/2204.06125#page=16&org=openai: “DALL·E 2: Hierarchical Text-Conditional Image Generation With CLIP Latents § 7. Limitations and Risks”, Aditya A. Ramesh, Prafulla Dhariwal, Alex Nichol, Casey Chu, Mark Chen

link-bibliography
https://arxiv.org/abs/2203.13131#facebook: “Make-A-Scene: Scene-Based Text-To-Image Generation With Human Priors”, Oran Gafni, Adam Polyak, Oron Ashual, Shelly Sheynin, Devi Parikh, Yaniv Taigman

link-bibliography

[Quote Of The Day]

[Site Of The Day]

[Annotation Of The Day]

[adblock public service announcement]