Bibliography:

  1. Generative AI mode collapse

  2. ‘preference learning’ tag

  3. ‘GPT-4 poetry’ tag

  4. ‘GPT calibration’ tag

  5. ‘Claude AI’ tag

  6. ‘DALL·E 3’ tag

  7. Benchmarking LLM Diversity & Creativity

  8. Commentary on weaknesses in Midjourney’s new ranking-based personalization feature

  9. Why do writers still underestimate LLMs?

  10. Novelty Nets: Classifier Anti-Guidance

  11. Hidden Persuaders: LLMs’ Political Leaning and Their Influence on Voters

  12. Do LLMs estimate uncertainty well in instruction-following?

  13. SimpleStrat: Diversifying Language Model Generation with Stratification

  14. I Quit Teaching Because of ChatGPT

  15. Thoughts while watching myself be automated

  16. Why AI Isn’t Going to Make Art

  17. Epistemic Calibration and Searching the Space of Truth

  18. Are Large Language Models Consistent over Value-laden Questions?

  19. Pron vs Prompt: Can Large Language Models already Challenge a World-Class Fiction Author at Creative Text Writing?

  20. Sonnet or Not, Bot? Poetry Evaluation for Large Models and Datasets

  21. AI Doesn’t Kill Jobs? Tell That to Freelancers: There’s now data to back up what freelancers have been saying for months

  22. What Are the Odds? Language Models Are Capable of Probabilistic Reasoning

  23. Consistency-diversity-realism Pareto fronts of conditional image generative models

  24. Self-Consuming Generative Models with Curated Data Provably Optimize Human Preferences

  25. Creativity Has Left the Chat: The Price of Debiasing Language Models

  26. I Wish I Knew How to Force Quit You

  27. Enhancing Confidence Expression in Large Language Models Through Learning from Past Experience

  28. A Tale of Tails: Model Collapse as a Change of Scaling Laws

  29. The Non-Effect of Sampling Temperature on Problem Solving in GPT-3.5/GPT-4

  30. Weaver: Foundation Models for Creative Writing

  31. Does Using ChatGPT Result in Human Cognitive Augmentation?

  32. Originality Dies When Being Average Is Easier

  33. Experimental Narratives: A Comparison of Human Crowdsourced Storytelling and AI Storytelling

  34. Helping or Herding? Reward Model Ensembles Mitigate but do not Eliminate Reward Hacking

  35. EQ-Bench: An Emotional Intelligence Benchmark for Large Language Models

  36. Generative artificial intelligence enhances creativity but reduces the diversity of novel content

  37. When ‘A Helpful Assistant’ Is Not Really Helpful: Personas in System Prompts Do Not Improve Performances of Large Language Models

  38. The Impact of Large Language Models on Scientific Discovery: a Preliminary Study using GPT-4

  39. A Coder Considers the Waning Days of the Craft: Coding has always felt to me like an endlessly deep and rich domain. Now I find myself wanting to write a eulogy for it

  40. When Ruthless Cultural Elitism Is Exactly the Job

  41. Does GPT-4 Pass the Turing Test?

  42. Large language models can replicate cross-cultural differences in personality

  43. Assessing the nature of large language models: A caution against anthropocentrism

  44. Simple synthetic data reduces sycophancy in large language models

  45. I’m a Screenwriter. These AI Jokes Give Me Nightmares

  46. Can a chatbot preach a good sermon? Hundreds attend church service generated by ChatGPT to find out

  47. ChatGPT is fun, but it is not funny! Humor is still challenging Large Language Models

  48. The False Promise of Imitating Proprietary LLMs

  49. Bits of Grass: Does GPT already know how to write like Whitman?

  50. Inducing anxiety in GPT-3.5 increases exploration and bias

  51. GPT-4 Technical Report § Limitations: Calibration

  52. Rewarding Chatbots for Real-World Engagement with Millions of Users

  53. Discovering Language Model Behaviors with Model-Written Evaluations

  54. Mysteries of mode collapse § Inescapable wedding parties

  55. RL with KL penalties is better viewed as Bayesian inference

  56. Janus

  57. Introducing V4

  58. Situational Awareness and Out-Of-Context Reasoning § GPT-4-Base Has Non-Zero Longform Performance

  59. I Finally Got ChatGPT to Sound like Me

  60. The Case for More Ambitious Language Model Evals

  61. 1241c140cfbe7e7f2478a11b1d7413c09055724c.html

  62. GPT-3 Catching Fish in Morse Code

  63. Mysteries of Mode Collapse

  64. Mysteries of Mode Collapse

  65. 9882caab6513d64435f8c352c23592b177a59092.html

  66. Please Stop Using Mediocre AI Art in Your Posts

  67. b6fcca82c8c4495e1f0cc9c1959a73e5ddc5dc4f.html

  68. What Kind of Writer Is ChatGPT?

  69. The New Poem-Making Machinery

  70. Here Are 120K 𝑤 Samples from @AydaoAI’s Large Anime Model (aka TADNE) Clustered into a Set of 256 Centroids. 𝘸𝘢𝘵𝘤𝘩 𝘪𝘵 𝘴𝘩𝘪𝘯𝘦

  71. design#future-tag-features

    [Transclude the forward-link's context]

  72. 2024-astolfi-figure1-paretofrontierofqualityvsdiversitytradeoffshowsnoconsistentgaininldmimagegenmodelsovertime.jpg

  73. 2024-astolfi-figure2-mscocoexamplesdemonstrationcollapseofdiversityinldmtunedimagegenmodels.png

  74. https://cs.nyu.edu/~davise/papers/GPT-Poetry.pdf

  75. 7f4670d12599f09d8302da70db6997f2190b3a1a.pdf

  76. https://docs.midjourney.com/docs/personalization

  77. https://news.ycombinator.com/item?id=38850202#38852945

  78. https://nostalgebraist.tumblr.com/post/706390430653267968/weve-been-talking-about-the-blandness-of

  79. 1798bf5ced1c9794a37bbb1bab97fb5454c5aea5.html

  80. https://nostalgebraist.tumblr.com/post/706441900479152128/novel-writing-chatgpt-vs-code-davinci-002

  81. https://nostalgebraist.tumblr.com/post/728556535745232896/claude-is-insufferable

  82. https://openai.com/blog/chatgpt/

  83. https://thezvi.wordpress.com/2024/02/27/the-gemini-incident-continues/

  84. https://www.astralcodexten.com/p/constitutional-ai-rlhf-on-steroids

  85. https://www.frontiersin.org/journals/robotics-and-ai/articles/10.3389/frobt.2017.00071/full

  86. https://www.lesswrong.com/posts/3ou8DayvDXxufkjHD/openai-api-base-models-are-not-sycophantic-at-any-size

  87. https://www.lesswrong.com/posts/DfqcyGXcFcukYbWZ5/i-measure-google-s-musiclm-over-3-months-as-it-appears-to-go

  88. https://www.lesswrong.com/posts/Fgzh2wLmvsBDmiFcN/sheikh-abdur-raheem-ali-s-shortform?commentId=ZtLC5dTTKrwLJxCBf

  89. dae2ed70bae6d468837b7ef79b0f15e7097b8d1d.html

  90. https://www.lesswrong.com/posts/MJyud5Qs6MheDemfE/artifex0-s-shortform?commentId=DzQapZEhTHxtjgbxh

  91. https://www.lesswrong.com/posts/t9svvNPNmFf5Qa3TA/mysteries-of-mode-collapse#pfHTedu4GKaWoxD5K

  92. https://www.lesswrong.com/posts/tbJdxJMAiehewGpq2/impressions-from-base-gpt-4

  93. https://www.reddit.com/r/ApplyingToCollege/comments/1h0vhlq/in_the_past_three_days_ive_reviewed_over_100/

  94. https://www.reddit.com/r/LocalLLaMA/comments/1ftn6s1/all_llms_are_converging_towards_the_same_point/

  95. 57e7318c91db50a5e6c7c6d59caafb08251dfe43.html

  96. https://www.reddit.com/r/LocalLLaMA/comments/1fuxw8d/just_for_kicks_i_looked_at_the_newly_released/

  97. https://www.reddit.com/r/mlscaling/comments/1gyb54z/the_fate_of_gpt4o/

  98. https://www.reddit.com/r/reinforcementlearning/comments/1dhkn9o/creativity_has_left_the_chat_the_price_of/l8xisr7/

  99. https://www.wired.com/story/confessions-viral-ai-writer-chatgpt/

  100. https://x.com/BlancheMinerva/status/1662521904727756801

  101. https://x.com/TrueTrollish/status/1810979650207723813

  102. https://x.com/aiamblichus/status/1810696605760155989

  103. https://x.com/aidangomez/status/1812363593985519838

  104. https://x.com/andrew_n_carr/status/1857262016106520655

  105. https://x.com/anthrupad/status/1623574021651714048

  106. https://x.com/bryancsk/status/1802421384842252293

  107. https://x.com/deedydas/status/1856016096228454791

  108. https://x.com/fofrAI/status/1810729254172381669

  109. https://x.com/fofrAI/status/1841854401717403944

  110. https://x.com/iScienceLuvr/status/1676891218075344896

  111. https://x.com/karpathy/status/1763303767292940684

  112. https://x.com/kliu128/status/1623524370391142405

  113. https://x.com/mattsclancy/status/1860532323080458400

  114. https://x.com/maximelabonne/status/1812066317383442813

  115. https://x.com/paulg/status/1777030573220933716

  116. https://x.com/repligate/status/1614435643475501056

  117. https://x.com/sherjilozair/status/1719665475452592495

  118. https://x.com/skirano/status/1810717536008208385

  119. https://x.com/skirano/status/1810717864451789233

  120. https://x.com/teortaxesTex/status/1817809140028186853

  121. https://x.com/thesephist/status/1801882388437360797

  122. I Quit Teaching Because of ChatGPT

  123. https%253A%252F%252Ftime.com%252F7026050%252Fchatgpt-quit-teaching-ai-essay%252F.html

  124. Thoughts while watching myself be automated

  125. https%253A%252F%252Fdynomight.net%252Fautomated%252F.html

  126. Why AI Isn’t Going to Make Art

  127. https%253A%252F%252Fwww.newyorker.com%252Fculture%252Fthe-weekend-essay%252Fwhy-ai-isnt-going-to-make-art.html

  128. Sonnet or Not, Bot? Poetry Evaluation for Large Models and Datasets

  129. https%253A%252F%252Farxiv.org%252Fabs%252F2406.18906.html

  130. I Wish I Knew How to Force Quit You

  131. https%253A%252F%252Fwww.thisamericanlife.org%252F832%252Ftranscript%2523act2.html

  132. A Tale of Tails: Model Collapse as a Change of Scaling Laws

  133. https%253A%252F%252Farxiv.org%252Fabs%252F2402.07043.html

  134. EQ-Bench: An Emotional Intelligence Benchmark for Large Language Models

  135. https%253A%252F%252Farxiv.org%252Fabs%252F2312.06281.html

  136. When Ruthless Cultural Elitism Is Exactly the Job

  137. https%253A%252F%252Fwww.nytimes.com%252Finteractive%252F2023%252F11%252F12%252Fmagazine%252Fandrew-wylie-interview.html.html

  138. Simple synthetic data reduces sycophancy in large language models

  139. https%253A%252F%252Farxiv.org%252Fabs%252F2308.03958%2523deepmind.html

  140. I’m a Screenwriter. These AI Jokes Give Me Nightmares

  141. https%253A%252F%252Ftime.com%252F6301288%252Fthe-ai-jokes-that-give-me-nightmares%252F.html

  142. The False Promise of Imitating Proprietary LLMs

  143. Sergey Levine

  144. https%253A%252F%252Farxiv.org%252Fabs%252F2305.15717.html

  145. Bits of Grass: Does GPT already know how to write like Whitman?

  146. https%253A%252F%252Farxiv.org%252Fabs%252F2305.11064.html

  147. GPT-4 Technical Report § Limitations: Calibration

  148. https%253A%252F%252Farxiv.org%252Fpdf%252F2303.08774%2523page%253D12%2526org%253Dopenai.html

  149. Mysteries of mode collapse § Inescapable wedding parties

  150. Janus

  151. https%253A%252F%252Fwww.lesswrong.com%252Fposts%252Ft9svvNPNmFf5Qa3TA%252Fmysteries-of-mode-collapse-due-to-rlhf%2523Inescapable_wedding_parties.html

  152. RL with KL penalties is better viewed as Bayesian inference

  153. https%253A%252F%252Farxiv.org%252Fabs%252F2205.11275.html