‘AI safety’ directory

Annotations sorted by machine learning into ⁠inferred 'tags'⁠. This provides an alternative way to browse: instead of by date order, one can browse in topic order. The 'sorted' list has been automatically clustered into multiple sections & auto-labeled for easier browsing.

Beginning with the newest annotation, it uses the embedding of each annotation to attempt to create a list of nearest-neighbor annotations, creating a progression of topics. For more details, see the link.

Wikipedia

Miscellaneous

Bibliography

https://www.nature.com/articles/s41598-025-86510-0: “AI Language Model Rivals Expert Ethicist in Perceived Moral Expertise ”⁠, Danica Dillion, Debanjan Mondal, Niket Tandon, Kurt Gray
link-bibliography⁠
https://arxiv.org/abs/2501.08156: “Are DeepSeek R1 And Other Reasoning Models More Faithful? ”⁠, James Chua, ⁠Owain Evans
link-bibliography⁠
https://www.reuters.com/technology/artificial-intelligence/openai-co-founder-sutskevers-new-safety-focused-ai-startup-ssi-raises-1-billion-2024-09-04/: “OpenAI Co-Founder Sutskever’s New Safety-Focused AI Startup SSI Raises $1 Billion ”⁠, Kenrick Cai, Krystal Hu, Anna Tong
link-bibliography⁠
https://www.economist.com/china/2024/08/25/is-xi-jinping-an-ai-doomer: “Is Xi Jinping an AI Doomer? China’s Elite Is Split over Artificial Intelligence ”⁠, <em>The Economist</em>
link-bibliography⁠
https://arxiv.org/abs/2407.04694: “Me, Myself, and AI: The Situational Awareness Dataset (SAD) for LLMs ”⁠, Rudolf Laine, Bilal Chughtai, Jan Betley …, Kaivalya Hariharan, Jeremy Scheurer, Mikita Balesni, Marius Hobbhahn, Alexander Meinke, ⁠Owain Evans
link-bibliography⁠
https://arxiv.org/abs/2406.07358: “AI Sandbagging: Language Models Can Strategically Underperform on Evaluations ”⁠, Teun van der Weij, Felix Hofstätter, Ollie Jaffe …, Samuel F. Brown, Francis Rhys Ward
link-bibliography⁠
https://www.thisamericanlife.org/832/transcript#act2: “I Wish I Knew How to Force Quit You ”, This American Life⁠, Simon Rich⁠
link-bibliography⁠
https://openai.com/index/openai-board-forms-safety-and-security-committee/: “OpenAI Board Forms Safety and Security Committee: This New Committee Is Responsible for Making Recommendations on Critical Safety and Security Decisions for All OpenAI Projects; Recommendations in 90 Days ”⁠, OpenAI⁠
link-bibliography⁠
https://fortune.com/2024/05/21/openai-superalignment-20-compute-commitment-never-fulfilled-sutskever-leike-altman-brockman-murati/: “OpenAI Promised 20% of Its Computing Power to Combat the Most Dangerous Kind of AI—But Never Delivered, Sources Say ”⁠, Jeremy Kahn⁠
link-bibliography⁠
https://www.wired.com/story/anthropic-black-box-ai-research-neurons-features/: “AI Is a Black Box. Anthropic Figured Out a Way to Look Inside: What Goes on in Artificial Neural Networks Work Is Largely a Mystery, Even to Their Creators. But Researchers from Anthropic Have Caught a Glimpse ”⁠, Steven Levy⁠
link-bibliography⁠
https://arxiv.org/abs/2404.12699: “SOPHON: Non-Fine-Tunable Learning to Restrain Task Transferability For Pre-Trained Models ”⁠, Jiangyi Deng, Shengyuan Pang, Yanjiao Chen …, Liangming Xia, Yijie Bai, Haiqin Weng, Wenyuan Xu⁠
link-bibliography⁠
https://arxiv.org/abs/2404.13076: “LLM Evaluators Recognize and Favor Their Own Generations ”⁠, Arjun Panickssery, ⁠Samuel R. Bowman, ⁠Shi Feng
link-bibliography⁠
https://arxiv.org/abs/2402.17747: “When Your AIs Deceive You: Challenges of Partial Observability in Reinforcement Learning from Human Feedback ”⁠, Leon Lang, Davis Foote, Stuart Russell …, Anca Dragan, Erik Jenner, Scott Emmons
link-bibliography⁠
https://arxiv.org/abs/2401.05566#anthropic: “Sleeper Agents: Training Deceptive LLMs That Persist Through Safety Training ”⁠, Evan Hubinger, Carson Denison, Jesse Mu …, Mike Lambert, Meg Tong, Monte MacDiarmid, Tamera Lanham, Daniel M. Ziegler, Tim Maxwell, Newton Cheng, Adam Jermyn, ⁠Amanda Askell, Ansh Radhakrishnan, Cem Anil, David Duvenaud, ⁠Deep Ganguli, Fazl Barez, ⁠Jack Clark⁠, Kamal Ndousse, Kshitij Sachan, Michael Sellitto, Mrinank Sharma, Nova DasSarma, Roger Grosse, Shauna Kravec, Yuntao Bai⁠, Zachary Witten, Marina Favaro, Jan Brauner, Holden Karnofsky⁠, Paul Christiano⁠, ⁠Samuel R. Bowman, Logan Graham, Jared Kaplan, Sören Mindermann, Ryan Greenblatt, Buck Shlegeris, Nicholas Schiefer, ⁠Ethan Perez
link-bibliography⁠
https://arxiv.org/abs/2401.02843: “Thousands of AI Authors on the Future of AI ”⁠, Katja Grace⁠, Harlan Stewart, Julia Fabienne Sandkühler …, Stephen Thomas, Ben Weinstein-Raun, Jan Brauner
link-bibliography⁠
https://www.newyorker.com/magazine/2023/12/11/the-inside-story-of-microsofts-partnership-with-openai: “The Inside Story of Microsoft’s Partnership With OpenAI: The Companies Had Honed a Protocol for Releasing Artificial Intelligence Ambitiously but Safely. Then OpenAI’s Board Exploded All Their Carefully Laid Plans ”⁠, Charles Duhigg⁠
link-bibliography⁠
https://www.newyorker.com/magazine/2023/12/04/how-jensen-huangs-nvidia-is-powering-the-ai-revolution: “How Jensen Huang’s Nvidia Is Powering the AI Revolution: The Company’s CEO Bet It All on a New Kind of Chip. Now That Nvidia Is One of the Biggest Companies in the World, What Will He Do Next? ”⁠, Stephen Witt
link-bibliography⁠
https://cognitiverevolution.substack.com/p/did-i-get-sam-altman-fired-from-openai: “Did I Get Sam Altman Fired from OpenAI?: Nathan’s Red-Teaming Experience, Noticing How the Board Was Not Aware of GPT-4 Jailbreaks & Had Not Even Tried GPT-4 prior to Its Early Release ”⁠, Nathan Labenz
link-bibliography⁠
https://www.theatlantic.com/technology/archive/2023/11/sam-altman-open-ai-chatgpt-chaos/676050/: “Inside the Chaos at OpenAI: Sam Altman’s Weekend of Shock and Drama Began a Year Ago, With the Release of ChatGPT ”⁠, Karen Hao⁠, Charlie Warzel
link-bibliography⁠
https://deepmind.google/about/responsibility-safety/: “Responsibility & Safety: Our Approach ”⁠, DeepMind
link-bibliography⁠
https://arxiv.org/abs/2309.00667: “Taken out of Context: On Measuring Situational Awareness in LLMs ”⁠, Lukas Berglund, Asa Cooper Stickland, Mikita Balesni …, Max Kaufmann, Meg Tong, Tomasz Korbak, Daniel Kokotajlo⁠, ⁠Owain Evans
link-bibliography⁠
https://arxiv.org/abs/2308.03958#deepmind: “Simple Synthetic Data Reduces Sycophancy in Large Language Models ”⁠, Jerry Wei, Da Huang, Yifeng Lu …, ⁠Denny Zhou, Quoc V. Le⁠
link-bibliography⁠
https://www.theatlantic.com/magazine/archive/2023/09/sam-altman-openai-chatgpt-gpt-4/674764/: “Does Sam Altman Know What He’s Creating? The OpenAI CEO’s Ambitious, Ingenious, Terrifying Quest to Create a New Form of Intelligence ”⁠, Ross Andersen
link-bibliography⁠
https://arxiv.org/abs/2308.01404: “Hoodwinked: Deception and Cooperation in a Text-Based Game for Language Models ”⁠, Aidan O’Gara
link-bibliography⁠
https://openai.com/index/introducing-superalignment/: “Introducing Superalignment ”⁠, ⁠Jan Leike, Ilya Sutskever⁠
link-bibliography⁠
https://www.youtube.com/watch?v=lfXxzAVtdpU&t=1763s: “Gödel, Escher, Bach Author Douglas Hofstadter on the State of AI Today § What about AI Terrifies You? ”⁠, Douglas Hofstadter⁠, Amy Jo Kim
link-bibliography⁠
https://openai.com/our-structure: “Our Structure: We Designed OpenAI’s Structure—A Partnership between Our Original Nonprofit and a New Capped Profit Arm—As a Chassis for OpenAI’s Mission: to Build Artificial General Intelligence (AGI) That Is Safe and Benefits All of Humanity ”⁠, OpenAI⁠
link-bibliography⁠
https://www.wsj.com/articles/microsoft-and-openai-forge-awkward-partnership-as-techs-new-power-couple-3092de51: “Microsoft and OpenAI Forge Awkward Partnership As Tech’s New Power Couple: As the Companies Lead the AI Boom, Their Unconventional Arrangement Sometimes Causes Conflict ”⁠, Tom Dotan, Deepa Seetharaman⁠
link-bibliography⁠
https://arxiv.org/abs/2306.00323: “Thought Cloning: Learning to Think While Acting by Imitating Human Thinking ”⁠, Shengran Hu, ⁠Jeff Clune
link-bibliography⁠
2023-carayannis.pdf: “The Challenge of Advanced Cyberwar and the Place of Cyberpeace ”⁠, Elias G. Carayannis⁠, John Draper⁠
link-bibliography⁠
https://arxiv.org/abs/2305.06972: “Large Language Models Can Be Used To Effectively Scale Spear Phishing Campaigns ”⁠, Julian Hazell
link-bibliography⁠
https://www.wired.com/story/anthropic-ai-chatbots-ethics/: “A Radical Plan to Make AI Good, Not Evil ”⁠, Will Knight⁠
link-bibliography⁠
https://arxiv.org/abs/2305.04388: “Language Models Don’t Always Say What They Think: Unfaithful Explanations in Chain-Of-Thought Prompting ”⁠, Miles Turpin, ⁠Julian Michael, ⁠Ethan Perez, ⁠Samuel R. Bowman
link-bibliography⁠
https://www.nytimes.com/2023/04/07/technology/ai-chatbots-google-microsoft.html: “In AI Race, Microsoft and Google Choose Speed Over Caution: Technology Companies Were Once Leery of What Some Artificial Intelligence Could Do. Now the Priority Is Winning Control of the Industry’s next Big Thing ”⁠, Nico Grant, Karen Weise
link-bibliography⁠
https://nymag.com/intelligencer/2023/03/on-with-kara-swisher-sam-altman-on-the-ai-revolution.html: “Sam Altman on What Makes Him ‘Super Nervous’ About AI: The OpenAI Co-Founder Thinks Tools like GPT-4 Will Be Revolutionary. But He’s Wary of Downsides ”⁠, Kara Swisher⁠
link-bibliography⁠
https://www.nytimes.com/2023/03/03/technology/artificial-intelligence-regulation-congress.html: “As AI Booms, Lawmakers Struggle to Understand the Technology: Tech Innovations Are Again Racing ahead of Washington’s Ability to Regulate Them, Lawmakers and AI Experts Said ”⁠, Cecila Kang, Adam Satariano
link-bibliography⁠
https://www.lesswrong.com/posts/t9svvNPNmFf5Qa3TA/mysteries-of-mode-collapse-due-to-rlhf#Inescapable_wedding_parties: “Mysteries of Mode Collapse § Inescapable Wedding Parties ”⁠, ⁠Janus
link-bibliography⁠
https://www.youtube.com/watch?v=Q-TJFyUoenc&t=2444s: “Increments Podcast: #45—4 Central Fallacies of AI Research (With Melanie Mitchell) ”⁠, Melanie Mitchell⁠, Benny Chugg
link-bibliography⁠
https://arxiv.org/abs/2210.10760#openai: “Scaling Laws for Reward Model Overoptimization ”⁠, Leo Gao⁠, ⁠John Schulman, ⁠Jacob Hilton
link-bibliography⁠
https://www.anthropic.com/red_teaming.pdf: “Red Teaming Language Models to Reduce Harms: Methods, Scaling Behaviors, and Lessons Learned ”⁠, ⁠Deep Ganguli, Liane Lovitt, ⁠Jackson Kernion …, ⁠Amanda Askell, Yuntao Bai⁠, Saurav Kadavath⁠, Ben Mann, ⁠Ethan Perez, Nicholas Schiefer, Kamal Ndousse, ⁠Andy L. Jones, ⁠Samuel R. Bowman, Anna Chen, Tom Conerly, Nova DasSarma, Dawn Drain, ⁠Nelson Elhage, Sheer El-Showk, Stanislav Fort, Zac Hatfield Dodds, Tom Henighan, Danny Hernandez⁠, Tristan Hume, Josh Jacobson, Scott Johnston⁠, Shauna Kravec, Catherine Olsson, Sam Ringer, Eli Tran-Johnson, Dario Amodei⁠, Tom B. Brown⁠, Nicholas Joseph, Sam McCandlish⁠, Chris Olah, Jared Kaplan, ⁠Jack Clark⁠
link-bibliography⁠
https://arxiv.org/abs/2206.02841: “Researching Alignment Research: Unsupervised Analysis ”⁠, Jan H. Kirchner, Logan Smith⁠, Jacques Thibodeau …, Kyle McDonell, Laria Reynolds
link-bibliography⁠
https://theinsideview.ai/ethan: “Ethan Caballero on Private Scaling Progress ”, Ethan Caballero, Michaël Trazzi
link-bibliography⁠
https://www.lesswrong.com/posts/SbAgRYo8tkHwhd9Qx/deepmind-the-podcast-excerpts-on-agi: “DeepMind: The Podcast—Excerpts on AGI ”⁠, William Kiely
link-bibliography⁠
https://arxiv.org/abs/2204.01691#google: “Do As I Can, Not As I Say (SayCan): Grounding Language in Robotic Affordances ”⁠, Michael Ahn, Anthony Brohan, Noah Brown …, Yevgen Chebotar, Omar Cortes⁠, Byron David, Chelsea Finn⁠, Keerthana Gopalakrishnan, Karol Hausman, Alex Herzog, Daniel Ho, Jasmine Hsu, Julian Ibarz, Brian Ichter, Alex Irpan, ⁠Eric Jang, Rosario Jauregui Ruano, Kyle Jeffrey, Sally Jesmonth, Nikhil J. Joshi, Ryan Julian⁠, Dmitry Kalashnikov, Yuheng Kuang, Kuang-Huei Lee, Sergey Levine⁠, Yao Lu, Linda Luu, Carolina Parada, Peter Pastor, Jornell Quiambao, Kanishka Rao, Jarek Rettinghouse, Diego Reyes, Pierre Sermanet, Nicolas Sievers, Clayton Tan, Alexander Toshev, Vincent Vanhoucke, Fei Xia⁠, Ted Xiao, Peng Xu, Sichun Xu, Mengyuan Yan
link-bibliography⁠
https://arxiv.org/abs/2202.07785#anthropic: “Predictability and Surprise in Large Generative Models ”⁠, ⁠Deep Ganguli, Danny Hernandez⁠, Liane Lovitt …, Nova DasSarma, Tom Henighan, ⁠Andy L. Jones, Nicholas Joseph, ⁠Jackson Kernion, Ben Mann, ⁠Amanda Askell, Yuntao Bai⁠, Anna Chen, Tom Conerly, Dawn Drain, ⁠Nelson Elhage, Sheer El Showk, Stanislav Fort, Zac Hatfield-Dodds, Scott Johnston⁠, Shauna Kravec, Neel Nanda, Kamal Ndousse, Catherine Olsson, Daniela Amodei⁠, Dario Amodei⁠, Tom B. Brown⁠, Jared Kaplan, Sam McCandlish⁠, Chris Olah, ⁠Jack Clark⁠
link-bibliography⁠
https://arxiv.org/abs/2201.03544: “The Effects of Reward Misspecification: Mapping and Mitigating Misaligned Models ”⁠, Alexander Pan, Kush Bhatia, ⁠Jacob Steinhardt
link-bibliography⁠
https://arxiv.org/abs/2112.11446#deepmind: “Scaling Language Models: Methods, Analysis & Insights from Training Gopher ”⁠, Jack W. Rae, Sebastian Borgeaud, Trevor Cai …, Katie Millican, Jordan Hoffmann, Francis Song, John Aslanides, Sarah Henderson⁠, Roman Ring, Susannah Young, Eliza Rutherford, Tom Hennigan, Jacob Menick, Albin Cassirer, Richard Powell, George van den Driessche, Lisa Anne Hendricks, Maribeth Rauh, Po-Sen Huang, Amelia Glaese, Johannes Welbl, Sumanth Dathathri, Saffron Huang, Jonathan Uesato, John Mellor, Irina Higgins, Antonia Creswell, Nat McAleese⁠, Amy Wu, Erich Elsen, Siddhant Jayakumar, Elena Buchatskaya, David Budden, Esme Sutherland, Karen Simonyan⁠, Michela Paganini, Laurent Sifre⁠, Lena Martens, Xiang Lorraine Li, Adhiguna Kuncoro, Aida Nematzadeh, Elena Gribovskaya, Domenic Donato, Angeliki Lazaridou, Arthur Mensch, Jean-Baptiste Lespiau, Maria Tsimpoukelli, Nikolai Grigorev, Doug Fritz, Thibault Sottiaux, Mantas Pajarskas, Toby Pohlen, Zhitao Gong, Daniel Toyama, Cyprien de Masson d’Autume, Yujia Li, Tayfun Terzi, Vladimir Mikulik, Igor Babuschkin, Aidan Clark, Diego de Las Casas, Aurelia Guy, Chris Jones, James Bradbury⁠, Matthew Johnson, Blake Hechtman, Laura Weidinger, Iason Gabriel, William Isaac⁠, Ed Lockhart, Simon Osindero, Laura Rimell, Chris Dyer, Oriol Vinyals⁠, Kareem Ayoub, Jeff Stanway, Lorrayne Bennett, Demis Hassabis⁠, Koray Kavukcuoglu⁠, ⁠Geoffrey Irving
link-bibliography⁠
https://arxiv.org/abs/2112.00861#anthropic: “A General Language Assistant As a Laboratory for Alignment ”⁠, ⁠Amanda Askell, Yuntao Bai⁠, Anna Chen …, Dawn Drain, ⁠Deep Ganguli, Tom Henighan, ⁠Andy L. Jones, Nicholas Joseph, Ben Mann, Nova DasSarma, ⁠Nelson Elhage, Zac Hatfield-Dodds, Danny Hernandez⁠, ⁠Jackson Kernion, Kamal Ndousse, Catherine Olsson, Dario Amodei⁠, Tom B. Brown⁠, ⁠Jack Clark⁠, Sam McCandlish⁠, Chris Olah, Jared Kaplan
link-bibliography⁠
https://arxiv.org/abs/2108.07258: “On the Opportunities and Risks of Foundation Models ”⁠, Rishi Bommasani, Drew A. Hudson, Ehsan Adeli …, Russ Altman⁠, Simran Arora, Sydney von Arx, ⁠Michael S. Bernstein, Jeannette Bohg, Antoine Bosselut, Emma Brunskill, Erik Brynjolfsson⁠, Shyamal Buch, Dallas Card, Rodrigo Castellon, Niladri Chatterji, Annie Chen, Kathleen Creel, Jared Quincy Davis, Dora Demszky, Chris Donahue⁠, Moussa Doumbouya, Esin Durmus, Stefano Ermon⁠, John Etchemendy⁠, Kawin Ethayarajh, Li Fei-Fei⁠, Chelsea Finn⁠, Trevor Gale, Lauren Gillespie, Karan Goel⁠, Noah Goodman, Shelby Grossman, Neel Guha, Tatsunori Hashimoto, Peter Henderson, John Hewitt, Daniel E. Ho⁠, Jenny Hong, Kyle Hsu, Jing Huang, Thomas Icard, Saahil Jain⁠, Dan Jurafsky⁠, Pratyusha Kalluri, Siddharth Karamcheti, Geoff Keeling, Fereshte Khani, Omar Khattab, Pang Wei Koh, Mark Krass, Ranjay Krishna, Rohith Kuditipudi, Ananya Kumar, Faisal Ladhak, Mina Lee, Tony Lee⁠, Jure Leskovec⁠, Isabelle Levent, Xiang Lisa Li, Xuechen Li, ⁠Tengyu Ma, Ali Malik, Christopher D. Manning⁠, Suvir Mirchandani, Eric Mitchell, Zanele Munyikwa, Suraj Nair, Avanika Narayan, Deepak Narayanan, Ben Newman, Allen Nie, Juan Carlos Niebles, Hamed Nilforoshan, Julian Nyarko, Giray Ogut, Laurel Orr, Isabel Papadimitriou, Joon Sung Park, Chris Piech, Eva Portelance, Christopher Potts⁠, Aditi Raghunathan, Rob Reich⁠, Hongyu Ren, Frieda Rong, Yusuf Roohani, Camilo Ruiz, Jack Ryan, Christopher Ré⁠, Dorsa Sadigh, Shiori Sagawa, Keshav Santhanam, Andy Shih, Krishnan Srinivasan⁠, Alex Tamkin, Rohan Taori, Armin W. Thomas, Florian Tramèr, Rose E. Wang, William Wang⁠, Bohan Wu, Jiajun Wu, ⁠Yuhuai Wu, Sang Michael Xie, Michihiro Yasunaga, Jiaxuan You, Matei Zaharia⁠, Michael Zhang⁠, Tianyi Zhang⁠, Xikun Zhang, Yuhui Zhang, Lucia Zheng, Kaitlyn Zhou, ⁠Percy Liang⁠
link-bibliography⁠
https://www.sciencedirect.com/science/article/pii/S0004370221000862#deepmind: “Reward Is Enough ”⁠, David Silver⁠, Satinder Singh⁠, Doina Precup⁠, Richard S. Sutton⁠
link-bibliography⁠
https://waymo.com/blog/2021/03/replaying-real-life/: “Replaying Real Life: How the Waymo Driver Avoids Fatal Human Crashes ”, Waymo
link-bibliography⁠
https://www.lesswrong.com/posts/Wnqua6eQkewL3bqsF/matt-botvinick-on-the-spontaneous-emergence-of-learning: “Matt Botvinick on the Spontaneous Emergence of Learning Algorithms ”⁠, Adam Scholl
link-bibliography⁠
https://www.lesswrong.com/posts/SmDziGM9hBjW9DKmf/2019-ai-alignment-literature-review-and-charity-comparison: “2019 AI Alignment Literature Review and Charity Comparison ”⁠, Larks
link-bibliography⁠
https://www.economist.com/1843/2019/03/01/deepmind-and-google-the-battle-to-control-artificial-intelligence: “DeepMind and Google: the Battle to Control Artificial Intelligence. Demis Hassabis Founded a Company to Build the World’s Most Powerful AI. Then Google Bought Him Out. Hal Hodson Asks Who Is in Charge ”⁠, Hal Hodson
link-bibliography⁠
https://melaniemitchell.me/aibook/: “Artificial Intelligence: A Guide for Thinking Humans § Prologue: Terrified ”, Melanie Mitchell⁠
link-bibliography⁠
2018-everitt.pdf: “The Alignment Problem for Bayesian History-Based Reinforcement Learners ”⁠, ⁠Tom Everitt, Marcus Hutter⁠
link-bibliography⁠
https://blog.gregbrockman.com/my-path-to-openai: “My Path to OpenAI ”, Greg Brockman⁠
link-bibliography⁠
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2821100/: “The Normalization of Deviance in Healthcare Delivery ”⁠, John Banja
link-bibliography⁠
https://dw2blog.com/2009/11/02/halloween-nightmare-scenario-early-2020s/: “Halloween Nightmare Scenario, Early 2020’s ”, David Wood
link-bibliography⁠
2001-taylor.pdf#page=6: “Recent Developments in the Evolution of Morphologies and Controllers for Physically Simulated Creatures § A Re-Implementation of Sims’ Work Using the MathEngine Physics Engine ”⁠, Tim Taylor, Colm Massey⁠
link-bibliography⁠
1984-minsky.html: “Afterword to Vernor Vinge’s Novel, True Names ”⁠, Marvin Minsky⁠
link-bibliography⁠
1970-darrach.pdf: “Meet Shakey: the First Electronic Person—The Fascinating and Fearsome Reality of a Machine With a Mind of Its Own ”⁠, Brad Darrach
link-bibliography⁠
1951-turing.pdf: “Intelligent Machinery, A Heretical Theory ”⁠, Alan Turing⁠
link-bibliography⁠
https://paulfchristiano.com/: “Homepage of Paul F. Christiano ”⁠, Paul F. Christiano
link-bibliography⁠