Bibliography:

  1. ‘AI video’ tag

  2. Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion

  3. SF-V: Single Forward Video Generation Model

  4. Sakuga-42M Dataset: Scaling Up Cartoon Research

  5. VideoGigaGAN: Towards Detail-rich Video Super-Resolution

  6. Dynamic Typography: Bringing Text to Life via Video Diffusion Prior

  7. VASA-1: Lifelike Audio-Driven Talking Faces Generated in Real Time

  8. CMD: Efficient Video Diffusion Models via Content-Frame Motion-Latent Decomposition

  9. ZigMa: Zigzag Mamba Diffusion Model

  10. TF-T2V: A Recipe for Scaling up Text-to-Video Generation with Text-free Videos

  11. W.A.L.T: Photorealistic Video Generation with Diffusion Models

  12. StyleCrafter: Enhancing Stylized Text-to-Video Generation with Style Adapter

  13. MicroCinema: A Divide-and-Conquer Approach for Text-to-Video Generation

  14. I2VGen-XL: High-Quality Image-to-Video Synthesis via Cascaded Diffusion Models

  15. Where Memory Ends and Generative AI Begins: New photo manipulation tools from Google and Adobe are blurring the lines between real memories and those dreamed up by AI

  16. Parsing-Conditioned Anime Translation: A New Dataset and Method

  17. Dreamix: Video Diffusion Models are General Video Editors

  18. OpenAI CEO Sam Altman on GPT-4: ‘people are begging to be disappointed and they will be’

  19. Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation

  20. MAGVIT: Masked Generative Video Transformer

  21. Latent Video Diffusion Models for High-Fidelity Video Generation with Arbitrary Lengths

  22. AnimeRun: 2D Animation Visual Correspondence from Open Source 3D Movies

  23. Imagen Video: High Definition Video Generation with Diffusion Models

  24. Phenaki: Variable Length Video Generation From Open Domain Textual Description

  25. Make-A-Video: Text-to-Video Generation without Text-Video Data

  26. CelebV-HQ: A Large-Scale Video Facial Attributes Dataset

  27. InfiniteNature-Zero: Learning Perpetual View Generation of Natural Scenes from Single Images

  28. NUWA-∞: Autoregressive over Autoregressive Generation for Infinite Visual Synthesis

  29. OmniMAE: Single Model Masked Pretraining on Images and Videos

  30. Cascaded Video Generation for Videos In-the-Wild

  31. CogVideo: Large-scale Pretraining for Text-to-Video Generation via Transformers

  32. Flexible Diffusion Modeling of Long Videos

  33. Ethan Caballero on Private Scaling Progress

  34. Video Diffusion Models

  35. TATS: Long Video Generation with Time-Agnostic VQGAN and Time-Sensitive Transformer

  36. Reinforcement Learning with Action-Free Pre-Training from Videos

  37. Transframer: Arbitrary Frame Prediction with Generative Models

  38. Diffusion Probabilistic Modeling for Video Generation

  39. General-purpose, long-context autoregressive modeling with Perceiver AR

  40. Microdosing: Knowledge Distillation for GAN based Compression

  41. StyleGAN-V: A Continuous Video Generator with the Price, Image Quality and Perks of StyleGAN-2

  42. U.S. vs. China Rivalry Boosts Tech—and Tensions: Militarized AI threatens a new arms race

  43. NÜWA: Visual Synthesis Pre-training for Neural visUal World creAtion

  44. Advances in Neural Rendering

  45. Learning a perceptual manifold with deep features for animation video resequencing

  46. Autoregressive Latent Video Prediction with High-Fidelity Image Generator

  47. FitVid: Overfitting in Pixel-Level Video Prediction

  48. Alias-Free Generative Adversarial Networks

  49. GANs N’ Roses: Stable, Controllable, Diverse Image to Image Translation (works for videos too!)

  50. NWT: Towards natural audio-to-video generation with representation learning

  51. Vector Quantized Models for Planning

  52. GODIVA: Generating Open-DomaIn Videos from nAtural Descriptions

  53. VideoGPT: Video Generation using VQ-VAE and Transformers

  54. China’s GPT-3? BAAI Introduces Superscale Intelligence Model ‘Wu Dao 1.0’: The Beijing Academy of Artificial Intelligence (BAAI) releases Wu Dao 1.0, China’s first large-scale pretraining model.

  55. Greedy Hierarchical Variational Autoencoders (GHVAEs) for Large-Scale Video Prediction

  56. CW-VAE: Clockwork Variational Autoencoders

  57. Scaling Laws for Autoregressive Generative Modeling

  58. SIREN: Implicit Neural Representations with Periodic Activation Functions

  59. NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis

  60. High Fidelity Video Prediction with Large Stochastic Recurrent Neural Networks

  61. Learning to Predict Without Looking Ahead: World Models Without Forward Prediction

  62. Learning to Predict Without Looking Ahead: World Models Without Forward Prediction [blog]

  63. Scaling Autoregressive Video Models

  64. NoGAN: Decrappification, DeOldification, and Super Resolution

  65. Model-Based Reinforcement Learning for Atari

  66. Parallel Multiscale Autoregressive Density Estimation

  67. VPN: Video Pixel Networks

  68. THUDM/CogVideo: Text-To-Video Generation. The Repo for ICLR2023 Paper "CogVideo: Large-Scale Pretraining for Text-To-Video Generation via Transformers"

  69. PaintsUndo: A Base Model of Drawing Behaviors in Digital Paintings

  70. Flexible Diffusion Modeling of Long Videos

  71. Text2Bricks: Fine-Tuning Open-Sora in 1,000 GPU-Hours

  72. bdfdfcaa29b374da7fcca5ac091961f3538fb23f.html

  73. EfficientZero: How It Works

  74. 2021-karras-figure17-totalelectricityuse.jpg

  75. https://blog.metaphysic.ai/the-road-to-realistic-full-body-deepfakes/

  76. https://github.com/facebookresearch/AnimatedDrawings

  77. https://lilianweng.github.io/posts/2024-04-12-diffusion-video/

  78. 6a4a04c04011e74f68bee051ead6effe897241f2.html

  79. https://openai.com/blog/sora-first-impressions

  80. https://research.google/blog/google-research-2022-beyond-language-vision-and-generative-models/

  81. https://research.google/blog/videopoet-a-large-language-model-for-zero-shot-video-generation/

  82. https://stability.ai/news/introducing-stable-video-3d

  83. 8be37b0a7f00d37de8cf38f37c00e253b606a75c.html

  84. https://wilsonyan.com/teco/

  85. 0a250e36bbf6fb0d6208084b147169a0e19702e3.html

  86. https://www.bloomberg.com/news/articles/2023-04-27/fed-s-powell-tricked-by-russian-pranksters-posing-as-zelenskiy?y

  87. https://www.chinatalk.media/p/reflections-from-neurips-the-worlds#%C2%A7chinas-ai-generated-youtube-propaganda

  88. https://www.csm.ai/commonsim-1-generating-3d-worlds

  89. https://www.fxguide.com/fxfeatured/actually-using-sora/

  90. https://www.reddit.com/r/OpenAI/comments/1bgcvut/the_world_will_never_be_the_same_after_sora/

  91. https://www.reddit.com/r/StableDiffusion/comments/119vvzg/bad_apple_but_its_rendered_and_colorized_with/

  92. https://www.reddit.com/r/StableDiffusion/comments/12pvhhm/animov01_highresolution_anime_finetune_of/

  93. e67b738c76d285dcc1b52a326320f4ac93422150.html

  94. https://www.reddit.com/r/StableDiffusion/comments/161qkeb/ai_burger_commercial_source_matancohengrumi/

  95. https://www.reddit.com/r/StableDiffusion/comments/17b4dfc/my_first_try_with_video/

  96. https://www.reddit.com/r/StableDiffusion/comments/1avou9y/the_current_state_of_img2vid_will_smith_eating/

  97. 1aa3691aaa1bf19a686ee195a7440009b2229ede.html

  98. https://www.reddit.com/r/StableDiffusion/comments/1bhs3rl/openai_keeps_dropping_more_insane_sora_videos/

  99. e5317a2fb46b46053af5651b3dd3a7c70acb6ae1.html

  100. https://www.reddit.com/r/StableDiffusion/comments/1f5x795/movement_is_almost_human_with_klingai/

  101. 6141bbd07d747f1951902589475584a63c0da04e.html

  102. https://www.reddit.com/r/StableDiffusion/comments/ys434h/animating_generated_face_test/

  103. a872726d6f0763bb43865cecdf6818c53045bfe9.html

  104. https://www.reddit.com/r/midjourney/comments/12xw3d2/definitely_wasted_3_hours_of_my_life_making_this/

  105. https://www.reddit.com/r/midjourney/comments/1g7hk22/cursed_shore/

  106. https://www.reddit.com/r/midjourney/comments/1gi1ptl/morphing_within_a_morphing/

  107. https://www.samdickie.me/writing/experiment-1-creating-a-landing-page-using-ai-tools-no-code

  108. https://www.theguardian.com/world/2023/nov/06/chinese-influencers-using-ai-digital-clones-of-themselves-to-pump-out-content

  109. https://www.tomshardware.com/news/nvidia-hints-at-dlss-10-delivering-full-neural-rendering-potentially-replacing-rasterization-and-ray-tracing

  110. https://www.wired.com/story/yahoo-boys-real-time-deepfake-scams/

  111. https://www.youtube.com/watch?v=9oryIMNVtto

  112. https://www.youtube.com/watch?v=CYeqbtYAIzY

  113. https://www.youtube.com/watch?v=OA8-6q7igwE

  114. https://www.youtube.com/watch?v=f75eoFyo9ns

  115. https://www.youtube.com/watch?v=u1R-jxDPC70

  116. https://x.com/SteveMills/status/1674219548147585024

  117. https://x.com/WillSmith2real/status/1759703359727300880

  118. https://x.com/aaronkemmer/status/1604570089059061760

  119. https://x.com/aimikummd/status/1655231878369275904

  120. https://x.com/ammaar/status/1615133036974321665

  121. https://x.com/frantzfries/status/1651316031762071553

  122. https://x.com/joshua_xu_/status/1689019874667024384

  123. https://x.com/pika_labs/status/1729510078959497562

  124. https://x.com/sundarpichai/status/1587872629137948672

  125. https://x.com/toyxyz3/status/1695134607317012749

  126. https://x.com/umesh_ai/status/1854861074463445332

  127. https://x.com/umesh_ai/status/1855079179999400197

  128. https://yosefk.com/blog/the-state-of-ai-for-hand-drawn-animation-inbetweening.html

  129. 0a695ab36130011f89dc325d56337648256a8f0d.html

  130. ZigMa: Zigzag Mamba Diffusion Model

  131. https%253A%252F%252Farxiv.org%252Fabs%252F2403.13802.html

  132. TF-T2V: A Recipe for Scaling up Text-to-Video Generation with Text-free Videos

  133. https%253A%252F%252Farxiv.org%252Fabs%252F2312.15770%2523alibaba.html

  134. MicroCinema: A Divide-and-Conquer Approach for Text-to-Video Generation

  135. https%253A%252F%252Farxiv.org%252Fabs%252F2311.18829%2523microsoft.html

  136. I2VGen-XL: High-Quality Image-to-Video Synthesis via Cascaded Diffusion Models

  137. https%253A%252F%252Farxiv.org%252Fabs%252F2311.04145%2523alibaba.html

  138. Dreamix: Video Diffusion Models are General Video Editors

  139. https%253A%252F%252Farxiv.org%252Fabs%252F2302.01329%2523google.html

  140. OpenAI CEO Sam Altman on GPT-4: ‘people are begging to be disappointed and they will be’

  141. https%253A%252F%252Fwww.theverge.com%252F23560328%252Fopenai-gpt-4-rumor-release-date-sam-altman-interview.html

  142. MAGVIT: Masked Generative Video Transformer

  143. https%253A%252F%252Farxiv.org%252Fabs%252F2212.05199%2523google.html

  144. NUWA-∞: Autoregressive over Autoregressive Generation for Infinite Visual Synthesis

  145. https%253A%252F%252Farxiv.org%252Fabs%252F2207.09814%2523microsoft.html

  146. OmniMAE: Single Model Masked Pretraining on Images and Videos

  147. https%253A%252F%252Farxiv.org%252Fabs%252F2206.08356%2523facebook.html

  148. CogVideo: Large-scale Pretraining for Text-to-Video Generation via Transformers

  149. https%253A%252F%252Farxiv.org%252Fabs%252F2205.15868.html

  150. Ethan Caballero on Private Scaling Progress

  151. https%253A%252F%252Ftheinsideview.ai%252Fethan.html

  152. TATS: Long Video Generation with Time-Agnostic VQGAN and Time-Sensitive Transformer

  153. https%253A%252F%252Farxiv.org%252Fabs%252F2204.03638%2523facebook.html

  154. General-purpose, long-context autoregressive modeling with Perceiver AR

  155. https%253A%252F%252Farxiv.org%252Fabs%252F2202.07765%2523deepmind.html

  156. StyleGAN-V: A Continuous Video Generator with the Price, Image Quality and Perks of StyleGAN-2

  157. https%253A%252F%252Farxiv.org%252Fabs%252F2112.14683.html

  158. U.S. vs. China Rivalry Boosts Tech—and Tensions: Militarized AI threatens a new arms race

  159. https%253A%252F%252Fspectrum.ieee.org%252Fchina-us-militarized-ai.html

  160. Vector Quantized Models for Planning

  161. Sherjil Ozair

  162. https%253A%252F%252Farxiv.org%252Fabs%252F2106.04615%2523deepmind.html

  163. VideoGPT: Video Generation using VQ-VAE and Transformers

  164. Aravind Srinivas

  165. https%253A%252F%252Farxiv.org%252Fabs%252F2104.10157.html

  166. China’s GPT-3? BAAI Introduces Superscale Intelligence Model ‘Wu Dao 1.0’: The Beijing Academy of Artificial Intelligence (BAAI) releases Wu Dao 1.0, China’s first large-scale pretraining model.

  167. https%253A%252F%252Fsyncedreview.com%252F2021%252F03%252F23%252Fchinas-gpt-3-baai-introduces-superscale-intelligence-model-wu-dao-1-0%252F%2523baai.html

  168. Scaling Laws for Autoregressive Generative Modeling

  169. Jared Kaplan

  170. Speaker Details: EmTech MIT 2023

  171. Alec Radford

  172. Aditya A. Ramesh

  173. John Schulman’s Homepage

  174. Sam McCandlish

  175. https%253A%252F%252Farxiv.org%252Fabs%252F2010.14701%2523openai.html