Art-Free Generative Models: Art Creation Without Graphic Art Knowledge
Revisiting Your Memory: Reconstruction of Affect-Contextualized Memory via EEG-guided Audiovisual Generation
Data Scaling Laws in Imitation Learning for Robotic Manipulation
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformers
Copying style, Extracting value: Illustrators’ Perception of AI Style Transfer and its Impact on Creative Labor
The Rise of Terminator Zero with Writer Mattson Tomlin & Director Masashi Kudo
Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model
Stretching Each Dollar: Diffusion Training from Scratch on a Micro-Budget
Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion
MAR: Autoregressive Image Generation without Vector Quantization
Adversarial Perturbations Cannot Reliably Protect Artists From Generative AI
Consistency-diversity-realism Pareto fronts of conditional image generative models
Interpreting the Weight Space of Customized Diffusion Models
Lateralization MLP: A Simple Brain-inspired Architecture for Diffusion
Dynamic Typography: Bringing Text to Life via Video Diffusion Prior
VASA-1: Lifelike Audio-Driven Talking Faces Generated in Real Time
ControlNet++: Improving Conditional Controls with Efficient Consistency Feedback
Is Model Collapse Inevitable? Breaking the Curse of Recursion by Accumulating Real and Synthetic Data
TextCraftor: Your Text Encoder Can be Image Quality Controller
Improving Text-to-Image Consistency via Automatic Prompt Optimization
SDXS: Real-Time One-Step Latent Diffusion Models with Image Conditions
CMD: Efficient Video Diffusion Models via Content-Frame Motion-Latent Decomposition
Atomically accurate de novo design of single-domain antibodies
Sketch2Manga: Shaded Manga Screening from Sketch with Diffusion Models
ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment
Transparent Image Layer Diffusion using Latent Transparency
CartoonizeDiff: Diffusion-Based Photo Cartoonization Scheme
Discovering Universal Semantic Triggers for Text-to-Image Synthesis
Benchmarking Robustness of Multimodal Image-Text Models under Distribution Shift
Why a Chinese court’s landmark decision recognising the copyright for an AI-generated image benefits creators in this nascent field
Bridging the Gap: Sketch to Color Diffusion Model with Semantic Prompt Learning
Applying Conditional Information in Guiding Diffusion-Based method for Anime-Style Face Drawing
GenCast: Diffusion-based ensemble forecasting for medium-range weather
Generative AI Beyond LLMs: System Implications of Multi-Modal Generation
DreamTuner: Single Image is Enough for Subject-Driven Generation
ECLIPSE: A Resource-Efficient Text-to-Image Prior for Image Generations
Self-conditioned Image Generation via Generating Representations
Retrieving Conditions from Reference Images for Diffusion Models
Analyzing and Improving the Training Dynamics of Diffusion Models
DiffiT: Diffusion Vision Transformers for Image Generation
MicroCinema: A Divide-and-Conquer Approach for Text-to-Video Generation
AnyLens: A Generative Diffusion Model with Any Rendering Lens
Visual Anagrams: Generating Multi-View Optical Illusions with Diffusion Models
Stability AI Explores Sale As Investor Urges CEO to Resign: Move Follows Letter from Investor Coatue Calling for Changes; Coatue Concerned about Stability AI’s Financial Position
TextDiffuser-2: Unleashing the Power of Language Models for Text Rendering
MobileDiffusion: Subsecond Text-to-Image Generation on Mobile Devices
Generative Models: What do they know? Do they know things? Let’s find out!
Test-time Adaptation of Discriminative Models via Diffusion Generative Feedback
Diffusion Model Alignment Using Direct Preference Optimization
Concept Sliders: LoRA Adaptors for Precise Control in Diffusion Models
UFOGen: You Forward Once Large Scale Text-to-Image Generation via Diffusion GANs
I2VGen-XL: High-Quality Image-to-Video Synthesis via Cascaded Diffusion Models
CADS: Unleashing the Diversity of Diffusion Models through Condition-Annealed Sampling
CommonCanvas: An Open Diffusion Model Trained with Creative-Commons Images
Nightshade: Prompt-Specific Poisoning Attacks on Text-to-Image Generative Models
Compositional Abilities Emerge Multiplicatively: Exploring Diffusion Models on a Synthetic Task
Generalization in diffusion models arises from geometry-adaptive harmonic representation
Emu: Enhancing Image Generation Models Using Photogenic Needles in a Haystack
Generating and Imputing Tabular Data via Diffusion and Flow-based Gradient-Boosted Trees
InstaFlow: One Step is Enough for High-Quality Diffusion-Based Text-to-Image Generation
MetaDiff: Meta-Learning with Conditional Diffusion for Few-Shot Learning
Provable Guarantees for Generative Behavior Cloning: Bridging Low-Level Stability and High-Level Behavior
FABRIC: Personalizing Diffusion Models with Iterative Feedback
SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis
SDXL § Micro-Conditioning: Conditioning the Model on Image Size
DiffSketcher: Text Guided Vector Sketch Synthesis through Latent Diffusion Models
Fighting Uncertainty with Gradients: Offline Reinforcement Learning via Diffusion Score Matching
Evaluating the Robustness of Text-to-image Diffusion Models against Real-world Attacks
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
Beyond Surface Statistics: Scene Representations in a Latent Diffusion Model
Exposing flaws of generative model evaluation metrics and their unfair treatment of diffusion models
Artificial intelligence and art: Identifying the esthetic judgment factors that distinguish human & machine-generated artwork
Spontaneous symmetry breaking in generative diffusion models
Tree-Ring Watermarks: Fingerprints for Diffusion Images that are Invisible and Robust
Generalizable Synthetic Image Detection via Language-guided Contrastive Learning
Common Diffusion Noise Schedules and Sample Steps are Flawed
Diffusart: Enhancing Line Art Colorization with Conditional Diffusion Models
Continual Diffusion: Continual Customization of Text-to-Image Diffusion with C-LoRA
Reference-based Image Composition with Sketch via Structure-aware Diffusion Model
HyperDiffusion: Generating Implicit Neural Fields with Weight-Space Diffusion
Masked Diffusion Transformer is a Strong Image Synthesizer
Prompting AI Art: An Investigation into the Creative Skill of Prompt Engineering
TRACT: Denoising Diffusion Models with Transitive Closure Time-Distillation
Understanding the Diffusion Objective as a Weighted Integral of ELBOs
Adding Conditional Control to Text-to-Image Diffusion Models
Glaze: Protecting Artists from Style Mimicry by Text-to-Image Models
Hard Prompts Made Easy: Gradient-Based Discrete Optimization for Prompt Tuning and Discovery
Msanii: High Fidelity Music Synthesis on a Shoestring Budget
DIRAC: Neural Image Compression with a Diffusion-Based Decoder
Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation
Diffusion Transformers (DiTs): Scalable Diffusion Models with Transformers
Point·E: A System for Generating 3D Point Clouds from Complex Prompts
The Stable Artist: Steering Semantics in Diffusion Latent Space
Latent Video Diffusion Models for High-Fidelity Video Generation with Arbitrary Lengths
VectorFusion: Text-to-SVG by Abstracting Pixel-Based Diffusion Models
DreamArtist: Towards Controllable One-Shot Text-to-Image Generation via Contrastive Prompt-Tuning
Null-text Inversion for Editing Real Images using Guided Diffusion Models
InstructPix2Pix: Learning to Follow Image Editing Instructions
Versatile Diffusion: Text, Images and Variations All in One Diffusion Model
Rickrolling the Artist: Injecting Invisible Backdoors into Text-Guided Image Generation Models
eDiff-I: Text-to-Image Diffusion Models with an Ensemble of Expert Denoisers
DiffusionDB: A Large-scale Prompt Gallery Dataset for Text-to-Image Generative Models
Imagic: Text-Based Real Image Editing with Diffusion Models
Hierarchical Diffusion Models for Singing Voice Neural Vocoder
Improving Sample Quality of Diffusion Models Using Self-Attention Guidance
Rectified Flow: A Marginal Preserving Approach to Optimal Transport
RealSinger: Ultra-Realistic Singing Voice Generation via Stochastic Differential Equations
g.pt: Learning to Learn with Generative Models of Neural Network Checkpoints
This Artist Is Dominating AI-Generated Art. And He’s Not Happy about It. Greg Rutkowski Is a More Popular Prompt Than Picasso
Flow Straight and Fast: Learning to Generate and Transfer Data with Rectified Flow
Frido: Feature Pyramid Diffusion for Complex Scene Image Synthesis
DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation
Cold Diffusion: Inverting Arbitrary Image Transforms Without Noise
Diffusion-QL: Diffusion Policies as an Expressive Policy Class for Offline Reinforcement Learning
An Image is Worth One Word: Personalizing Text-to-Image Generation using Textual Inversion
Prompt-to-Prompt Image Editing with Cross Attention Control
Text-Guided Synthesis of Artistic Images with Retrieval-Augmented Diffusion Models
NUWA-∞: Autoregressive over Autoregressive Generation for Infinite Visual Synthesis
Compositional Visual Generation with Composable Diffusion Models
DPM-Solver: A Fast ODE Solver for Diffusion Probabilistic Model Sampling in Around 10 Steps
Elucidating the Design Space of Diffusion-Based Generative Models
Text2Human: Text-Driven Controllable Human Image Generation
Maximum Likelihood Training of Implicit Nonlinear Diffusion Models
Imagen: Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding
Retrieval-Augmented Diffusion Models: Semi-Parametric Neural Image Synthesis
Truncated Diffusion Probabilistic Models and Diffusion-based Adversarial Autoencoders
Learning Fast Samplers for Diffusion Models by Differentiating Through Sample Quality
From data to functa: Your data point is a function and you should treat it like one
DiffuseVAE: Efficient, Controllable and High-Fidelity Generation from Low-Dimensional Latents
Itô-Taylor Sampling Scheme for Denoising Diffusion Probabilistic Models using Ideal Derivatives
High-Resolution Image Synthesis with Latent Diffusion Models
High Fidelity Visualization of What Your Self-Supervised Representation Knows About
More Control for Free! Image Synthesis with Semantic Diffusion Guidance
Come-Closer-Diffuse-Faster: Accelerating Conditional Diffusion Models for Inverse Problems through Stochastic Contraction
VQ-DDM: Global Context with Discrete Diffusion in Vector Quantized Modeling for Image Generation
Diffusion Autoencoders: Toward a Meaningful and Decodable Representation
Blended Diffusion for Text-driven Editing of Natural Images
Vector Quantized Diffusion Model for Text-to-Image Synthesis
Unleashing Transformers: Parallel Token Prediction with Discrete Absorbing Diffusion for Fast High-Resolution Image Generation from Vector-Quantized Codes
Restormer: Efficient Transformer for High-Resolution Image Restoration
Tackling the Generative Learning Trilemma with Denoising Diffusion GANs
Progressive Distillation for Fast Sampling of Diffusion Models
DiffusionCLIP: Text-guided Image Manipulation Using Diffusion Models
ImageBART: Bidirectional Context with Multinomial Diffusion for Autoregressive Image Synthesis
PriorGrad: Improving Conditional Denoising Diffusion Models with Data-Dependent Adaptive Prior
CDM: Cascaded Diffusion Models for High Fidelity Image Generation
Learning to Efficiently Sample from Diffusion Probabilistic Models
Gotta Go Fast When Generating Data with Score-Based Models
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism
Learning Energy-Based Models by Diffusion Recovery Likelihood
Maximum Likelihood Training of Score-Based Diffusion Models
Score-Based Generative Modeling through Stochastic Differential Equations
NoGAN: Decrappification, DeOldification, and Super Resolution
Conceptual Captions: A Cleaned, Hypernymed, Image Alt-text Dataset For Automatic Image Captioning
Improving Sampling from Generative Autoencoders with Markov Chains
Deep Unsupervised Learning using Nonequilibrium Thermodynamics
A Connection Between Score Matching and Denoising Autoencoders
Estimation of Non-Normalized Statistical Models by Score Matching
High-Resolution Image Synthesis With Latent Diffusion Models
Neonbjb/tortoise-Tts: A Multi-Voice TTS System Trained With an Emphasis on Quality
PaintsUndo: A Base Model of Drawing Behaviors in Digital Paintings
Keypoint Based Anime Generation With Additional CLIP Guided Tuning
A Closer Look Into The Latent-Diffusion Repo, Do Better Than Just Looking
Model Comparison Study for Disco Diffusion v. 5---PLMS Sampling Edition
Stability AI CEO Resigns Because You Can’t Beat Centralized AI With More Centralized AI
Case Study: Interpreting, Manipulating, and Controlling CLIP With Sparse Autoencoders
Generative Modeling by Estimating Gradients of the Data Distribution
2023-podell-figure2-datalossduetononsquareimageaspectratios.jpg
2023-podell-figure3-demonstrationofuseofsizeconditioningtogenerateblurredzoomedimages.png
2023-podell-figure4-olderstablediffusionmodelsshowcutoffheadsduetorandomcroppingbutnotsizeconditioningcomparedtosdxl.png
2023-podell-figure5-examplesofvaryingthecropconditioningsizetocontrolsdxloutput.png
2022-09-22-gwern-stablediffusionv14-textualinversion-yinit-dropcapsexperiments.png
2022-09-21-gwern-stablediffusionv14-circulardropcapinitialsamples.png
2022-09-20-novelai-kurumuz-animestablediffusion-asukasamples.png
2022-ashual-figure4-knndiffusionsamplescontrastedtoretrievedexemplarsandnoexemplars.png
2022-ashual-figure5-knndiffusiontexttophotographsamplescontrastedtoretrievedexemplarsandnoexemplars.png
2021-nichol-figure10-scalinglawsforddpmincomputevsnllfid.png
https://blog.metaphysic.ai/the-road-to-realistic-full-body-deepfakes/
https://blog.novelai.net/novelai-improvements-on-stable-diffusion-e10d38db82ac
https://bonfx.com/how-to-use-dreamstudio-stablediffusion-to-create-a-traditional-illustration/
https://cloud.google.com/blog/products/ai-machine-learning/imagen-2-on-vertex-ai-is-now-generally-available
https://colab.research.google.com/drive/1dlgggNa5Mz8sEAGU0wFCHhGLFooW_pf1
https://discuss.huggingface.co/t/decoding-latents-to-rgb-without-upscaling/23204/4
https://fortune.com/2023/11/29/stability-ai-sale-intel-ceo-resign/
https://generalrobots.substack.com/p/dimension-hopper-part-1
https://github.com/curiousjp/toy_sd_genetics?tab=readme-ov-file#toy_sd_genetics
https://github.com/marqo-ai/marqo/blob/mainline/examples/StableDiffusion/hot-dog-100k.md
https://github.com/vitoplantamura/OnnxStream/tree/846da873570a737b49154e8f835704264864b0fe
https://globalcomix.com/c/paintings-photographs/chapters/en/1/1
https://hforsten.com/identifying-stable-diffusion-xl-10-images-from-vae-artifacts.html
https://huggingface.co/Gustavosta/MagicPrompt-Stable-Diffusion
https://huggingface.co/Onodofthenorth/SD_PixelArt_SpriteSheet_Generator
https://huggingface.co/Ryukijano/CatCon-Controlnet-WD-1-5-b2R
https://jgeekstudies.org/2023/04/06/a-practical-implication-of-the-astolfo-effect-bias-in-ai-generated-images/
https://jxmo.notion.site/The-Weird-and-Wonderful-World-of-AI-Art-b9615a2e7278435b98380ff81ae1cf09
https://keras.io/examples/generative/random_walks_with_stable_diffusion/
https://lambdalabs.com/blog/inference-benchmark-stable-diffusion
https://medium.com/@catmus2048/not-only-is-stable-diffusion-2-0-not-bad-but-really-better-my-prompt-engineering-experiments-459fbc5cec2
https://medium.com/@enryu9000/anifusion-diffusion-models-for-anime-pictures-138cf1af2cbe
https://minimaxir.com/2022/11/stable-diffusion-negative-prompt/
https://nostalgebraist.tumblr.com/post/672300992964050944/franks-image-generation-model-explained
https://old.reddit.com/r/StableDiffusion/comments/y91pp7/stable_diffusion_v15/
https://paperswithcode.com/sota/text-to-image-generation-on-coco
https://pub.towardsai.net/stable-diffusion-based-image-compresssion-6f1f0a399202
https://replicate.com/tommoore515/material_stable_diffusion
https://research.google/blog/google-research-2022-beyond-language-vision-and-generative-models/
https://research.google/blog/mobilediffusion-rapid-text-to-image-generation-on-device/
https://saltacc.notion.site/saltacc/WD-1-5-Beta-3-Release-Notes-1e35a0ed1bb24c5b93ec79c45c217f63
https://sweet-hall-e72.notion.site/Mimicking-Diffusion-Models-by-Sequencing-Frequency-Coefficients-8e5a60e876d640c390369627d55330b1
https://talesofsyn.com/posts/creating-isometric-rpg-game-backgrounds
https://waxy.org/2022/08/exploring-12-million-of-the-images-used-to-train-stable-diffusions-image-generator/
https://www.404media.co/facebook-is-being-overrun-with-stolen-ai-generated-images-that-people-think-are-real/
https://www.bloomberg.com/news/features/2023-04-24/a-high-school-teacher-s-free-image-database-powers-ai-unicorns
https://www.crosslabs.org/blog/diffusion-with-offset-noise
https://www.facebook.com/marcello.herreshoff/posts/10160262954262798
https://www.hollywoodreporter.com/tv/tv-news/secret-invasion-ai-opening-1235521299/
https://www.justice.gov/opa/pr/man-arrested-producing-distributing-and-possessing-ai-generated-images-minors-engaged
https://www.reddit.com/r/Bard/comments/1795exq/google_sge_image_generation_is_so_good_at/
https://www.reddit.com/r/MachineLearning/comments/ykxr4v/p_made_a_text_generation_model_to_extend_stable/
https://www.reddit.com/r/StableDiffusion/comments/10ikjxg/me_and_some_friends_are_working_on_a_fanai_that/
https://www.reddit.com/r/StableDiffusion/comments/10v9z6m/v11_of_our_mfkonosuba_model_is_out_heres_some/
https://www.reddit.com/r/StableDiffusion/comments/11f4zgt/remixing_memes_with_multi_controlnet_is/
https://www.reddit.com/r/StableDiffusion/comments/12huyk4/evaluation_of_the_latent_horniness_of_the_most/
https://www.reddit.com/r/StableDiffusion/comments/14ssg1g/stable_diffusion_attracts_various_enthusiasts/
https://www.reddit.com/r/StableDiffusion/comments/152wtrh/sdxl_recognises_the_styles_of_thousands_of/
https://www.reddit.com/r/StableDiffusion/comments/15aapcb/sdxl_10_is_out/
https://www.reddit.com/r/StableDiffusion/comments/18r7mqf/top_online_nsfw_creators_updated/
https://www.reddit.com/r/StableDiffusion/comments/1bsi2xs/the_experiment/
https://www.reddit.com/r/StableDiffusion/comments/1by0dgs/nsfw_thumbnail_but_sfw_wallpaper_illusions/
https://www.reddit.com/r/StableDiffusion/comments/1c4oytl/some_examples_of_pixart_sigmas_excellent_prompt/
https://www.reddit.com/r/StableDiffusion/comments/1expa9n/fake_body_transformation_photos_from_fitness/
https://www.reddit.com/r/StableDiffusion/comments/1fm368h/cats_with_hairdos_flux_lora_thats_all_it_does/
https://www.reddit.com/r/StableDiffusion/comments/1gdkpqp/the_gory_details_of_finetuning_sdxl_for_40m/
https://www.reddit.com/r/StableDiffusion/comments/xr8cs8/brutalist_joi_dreambooth_training_combined_with/
https://www.reddit.com/r/StableDiffusion/comments/yhikn3/new_dreambooth_model_classic_animation_styles/
https://www.reddit.com/r/StableDiffusion/comments/ylroyp/made_in_abyss_dreambooth_model_i_am_working_on/
https://www.reddit.com/r/StableDiffusion/comments/ys434h/animating_generated_face_test/
https://www.reddit.com/r/StableDiffusion/comments/yv75hb/prompt_to_create_double_exposure_images_workflow/
https://www.reddit.com/r/StableDiffusion/comments/z0xyk2/dreambooth_model_for_cutting_machines/
https://www.reddit.com/r/aigamedev/comments/142j3yt/valve_is_not_willing_to_publish_games_with_ai/
https://www.reddit.com/r/sdnsfw/comments/ylo4eh/huge_list_of_sexy_tested_photorealism_keywords/
https://www.samdickie.me/writing/experiment-1-creating-a-landing-page-using-ai-tools-no-code
https://www.stavros.io/posts/compressing-images-with-stable-diffusion/
https://www.technologyreview.com/2024/04/10/1091053/generative-ai-turn-your-most-precious-memories-into-photos/
https://www.theguardian.com/music/2022/feb/18/confucius-beowulf-and-an-ai-called-kevin-everything-everythings-search-for-hope-in-strange-places
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformers
https%253A%252F%252Farxiv.org%252Fabs%252F2410.10629%2523nvidia.html
Copying style, Extracting value: Illustrators’ Perception of AI Style Transfer and its Impact on Creative Labor
https%253A%252F%252Farxiv.org%252Fabs%252F2409.15997%2523novelai.html
Stretching Each Dollar: Diffusion Training from Scratch on a Micro-Budget
Why a Chinese court’s landmark decision recognising the copyright for an AI-generated image benefits creators in this nascent field
https%253A%252F%252Fwww.scmp.com%252Ftech%252Ftech-trends%252Farticle%252F3248510%252Fwhy-chinese-courts-landmark-decision-recognising-copyright-ai-generated-image-benefits-creators.html
https%253A%252F%252Fwww.databricks.com%252Fblog%252Fcategory%252Fgenerative-ai%252Fmosaic-research.html
DiffiT: Diffusion Vision Transformers for Image Generation
MicroCinema: A Divide-and-Conquer Approach for Text-to-Video Generation
https%253A%252F%252Farxiv.org%252Fabs%252F2311.18829%2523microsoft.html
TextDiffuser-2: Unleashing the Power of Language Models for Text Rendering
https%253A%252F%252Farxiv.org%252Fabs%252F2311.17042%2523stability.html
Concept Sliders: LoRA Adaptors for Precise Control in Diffusion Models
UFOGen: You Forward Once Large Scale Text-to-Image Generation via Diffusion GANs
https%253A%252F%252Farxiv.org%252Fabs%252F2311.09257%2523google.html
I2VGen-XL: High-Quality Image-to-Video Synthesis via Cascaded Diffusion Models
https%253A%252F%252Farxiv.org%252Fabs%252F2311.04145%2523alibaba.html
CommonCanvas: An Open Diffusion Model Trained with Creative-Commons Images
Jonathan Frankle—Chief Neural Network Scientist at Databricks
Emu: Enhancing Image Generation Models Using Photogenic Needles in a Haystack
https%253A%252F%252Farxiv.org%252Fabs%252F2309.15807%2523facebook.html
InstaFlow: One Step is Enough for High-Quality Diffusion-Based Text-to-Image Generation
SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis
https%253A%252F%252Farxiv.org%252Fabs%252F2307.01952%2523stability.html
SDXL § Micro-Conditioning: Conditioning the Model on Image Size
https%253A%252F%252Farxiv.org%252Fpdf%252F2307.01952%2523page%253D3%2526org%253Dstability.html
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
Artificial intelligence and art: Identifying the esthetic judgment factors that distinguish human & machine-generated artwork
%252Fdoc%252Fai%252Fnn%252Ftransformer%252Fgpt%252Fdall-e%252F1%252F2023-samo.pdf.html
Masked Diffusion Transformer is a Strong Image Synthesizer
https%253A%252F%252Farxiv.org%252Fabs%252F2303.01469%2523openai.html
Glaze: Protecting Artists from Style Mimicry by Text-to-Image Models
https%253A%252F%252Farxiv.org%252Fabs%252F2302.01329%2523google.html
https%253A%252F%252Fraw.githubusercontent.com%252Fflavioschneider%252Fmaster-thesis%252Fmain%252Faudio_diffusion_thesis.pdf.html
https%253A%252F%252Farxiv.org%252Fabs%252F2212.10562%2523google.html
Point·E: A System for Generating 3D Point Clouds from Complex Prompts
https%253A%252F%252Farxiv.org%252Fabs%252F2212.08751%2523openai.html
InstructPix2Pix: Learning to Follow Image Editing Instructions
eDiff-I: Text-to-Image Diffusion Models with an Ensemble of Expert Denoisers
https%253A%252F%252Farxiv.org%252Fabs%252F2211.01324%2523nvidia.html
Hierarchical Diffusion Models for Singing Voice Neural Vocoder
https%253A%252F%252Farxiv.org%252Fabs%252F2210.07508%2523sony.html
https%253A%252F%252Farxiv.org%252Fabs%252F2210.03142%2523google.html
g.pt: Learning to Learn with Generative Models of Neural Network Checkpoints
DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation
https%253A%252F%252Farxiv.org%252Fabs%252F2208.12242%2523google.html
Cold Diffusion: Inverting Arbitrary Image Transforms Without Noise
An Image is Worth One Word: Personalizing Text-to-Image Generation using Textual Inversion
NUWA-∞: Autoregressive over Autoregressive Generation for Infinite Visual Synthesis
https%253A%252F%252Farxiv.org%252Fabs%252F2207.09814%2523microsoft.html
https%253A%252F%252Farxiv.org%252Fabs%252F2205.16007%2523microsoft.html
Imagen: Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding
https%253A%252F%252Farxiv.org%252Fabs%252F2205.11487%2523google.html
High-Resolution Image Synthesis with Latent Diffusion Models
More Control for Free! Image Synthesis with Semantic Diffusion Guidance
https%253A%252F%252Farxiv.org%252Fabs%252F2106.09685%2523microsoft.html
CDM: Cascaded Diffusion Models for High Fidelity Image Generation
https%253A%252F%252Fcascaded-diffusion.github.io%252F.html
https%253A%252F%252Farxiv.org%252Fabs%252F2105.05233%2523openai.html
https%253A%252F%252Farxiv.org%252Fabs%252F2104.07636%2523google.html
https%253A%252F%252Farxiv.org%252Fabs%252F2102.09672%2523openai.html
Conceptual Captions: A Cleaned, Hypernymed, Image Alt-text Dataset For Automatic Image Captioning
%252Fdoc%252Fai%252Fnn%252Fdiffusion%252F2018-sharma.pdf%2523google.html
A Connection Between Score Matching and Denoising Autoencoders
%252Fdoc%252Fai%252Fnn%252Fdiffusion%252F2011-vincent.pdf.html
Estimation of Non-Normalized Statistical Models by Score Matching
https%253A%252F%252Fwww.jmlr.org%252Fpapers%252Fvolume6%252Fhyvarinen05a%252Fhyvarinen05a.pdf.html
Wikipedia Bibliography: