‘GPT-5’ directory
- See Also
- Gwern
- “Oh, This Old Thing?”, Gwern et al 2026
- “Apawcalypse Meow”, Gwern et al 2026
- “Fine Art versus Fiiine Art”, Gwern 2026
- “Spoilage”, Gwern & Pro 2026
- “Spoilage”, Pro et al 2026
- “Some 2025 LLM System Prompts”, Gwern et al 2025
- “Apollonian #1: The Counted & the Crowned”, Gwern et al 2025
- “LLM Challenge: Write Non-Biblical Sentences”, Gwern 2024
- “[GPT-5 Free Association Experiment for Autonomous Image Generation]”, Gwern & GPT-5 2025
- “Explain Free Energy Minimization Right Now, You Piece of S—T!”, Gwern & GPT-5 2025
- “O3 Is Full of Crimes”, Gwern 2025
- “Scaling ‘Diminishing Returns’”, Gwern 2024
- Links
- “Cream of Can”, Gwern et al 2026
- “[Poetry Typography Design Experiment: Side-By-Side Pindaric Ode]”, Pro 2026
- “Introducing GPT-5.4-Mini and GPT-5.4-Nano: Fast and Efficient Models Optimized for Coding and Subagents”, OpenAI 2026
- “GPT-5.4 Is A Substantial Upgrade”, Mowshowitz 2026
- “Inside OpenAI’s Race to Catch Up to Claude Code: Why Is the Biggest Name in AI Late to the AI Coding Revolution?”, Zeff 2026
- “Introducing GPT-5.4 [GPT-5.4 Pro]”, OpenAI 2026
- “Introducing GPT-5.4 [GPT-5.4 Thinking]”, OpenAI 2026
- “[Unique-8 That Xor to 216 − 1]”, Pro 2026
- “ChatGPT-5.3-Codex Is Also Good At Coding”, Mowshowitz 2026
- “Evaluating
AGENTS.md: Are Repository-Level Context Files Helpful for Coding Agents?”, Gloaguen et al 2026 - “Introducing GPT-5.3-Codex: Expanding Codex across the Full Spectrum of Professional Work on a Computer”, OpenAI 2026
- “LLM Poetry and the ‘Greatness’ Question: Experiments by Gwern and Mercor”, Robbins 2026
- “Shipping at Inference-Speed”, Steinberger 2025
- “Introducing GPT-5.2-Codex: The Most Advanced Agentic Coding Model for Professional Software Engineering and Defensive Cybersecurity.”, OpenAI 2025
- “I Ported JustHTML from Python to JavaScript With Codex CLI & GPT-5.2 in 4.5 Hours”, Willison 2025
- “GPT-5.2 Is Frontier Only For The Frontier”, Mowshowitz 2025
- “GPT-5.2-Thinking-20251213 System Prompt”, Walls & GPT-5.2 2025
- “Introducing GPT-5.2: The Most Advanced Frontier Model for Professional Work & Long-Running Agents”, OpenAI 2025
- “Introducing GPT-5.2 Pro § Science and Math”, OpenAI 2025
- “ARTEMIS: Comparing AI Agents to Cybersecurity Professionals in Real-World Penetration Testing”, Lin et al 2025
- “How Well Can Gemini 3 Make a Henry James Simulator? Finally, a Benchmark for LLMs With Real-World Value”, Breen 2025
- “GPT-5.1: A Smarter, More Conversational ChatGPT § GPT-5.1 Thinking”, OpenAI 2025
- “ImpossibleBench: Measuring LLMs’ Propensity of Exploiting Test Cases”, Zhong et al 2025
- “We Tested Claude Sonnet 4.5 for Writing and Editing: 5 Tests across Blind Comparisons, Editorial Standards, and Deadlines—Here’s What Changed Our Setup”, Parrott 2025
- “Evaluating Long Context (Reasoning) Ability”
- “The QMA Singularity [GPT-5-Thinking Proves a Key Lemma]”, Aaronson 2025
- “I Talked to Sam Altman about the GPT-5 Launch Fiasco: Over Dinner, OpenAI CEO’s Addressed Criticism of GPT-5’s Rollout, the AI Bubble, Brain-Computer Interfaces, Buying Google Chrome, and More”, Heath 2025
- andonlabs @ "2025-08-13"
- “GPT-5s Are Alive: Outside Reactions, the Router and the Resurrection of GPT-4o”, Mowshowitz 2025
- “GPT-5s Are Alive: Basic Facts, Benchmarks and the Model Card”, Mowshowitz 2025
- “GPT-5 AMA With OpenAI’s Sam Altman and Some of the GPT-5 Team”, Altman 2025
- “Details about METR’s Evaluation of OpenAI GPT-5”, METR 2025
- “GPT-5 Is Here: Our Smartest, Fastest, and Most Useful Model Yet, With Thinking Built In. Available to Everyone”, OpenAI 2025
- “Introducing GPT-5 for Developers: The Best Model for Coding and Agentic Tasks [API]”, OpenAI 2025
- “GPT-5 Pro: Scaled but Efficient Parallel Test-Time Compute, to Provide the Highest Quality and Most Comprehensive Answers”, OpenAI 2025
- “GPT-5: It Just Does Stuff—Putting the AI in Charge”, Mollick 2025
- khoomeik @ "2025-08-07"
- “OpenAI Is Expected to Release a ‘Materially Better’ GPT-5 for Its Chatbot Mid-Year, Sources Say”, Hays & Rafieyan 2024
- “Microsoft Swallows OpenAI’s Core Team § Compute Is King”, Patel & Nishball 2023
- “Inside the Chaos at OpenAI: Sam Altman’s Weekend of Shock and Drama Began a Year Ago, With the Release of ChatGPT”, Hao & Warzel 2023
- “OpenAI Chief Seeks New Microsoft Funds to Build ‘Superintelligence’: Sam Altman Expects Big Tech Group Will Back Start-Up’s Mission to Create Software As Intelligent As Humans”, Murgia 2023
- wagieeacc @ "2023-10-17"
- “Altman on Scaling”, Thibs 2023
- “In Sudden Alarm, Tech Doyens Call for a Pause on ChatGPT: Tech Luminaries, Renowned Scientists, and Elon Musk Warn of an ‘Out-Of-Control Race’ to Develop and Deploy Ever-More-Powerful AI Systems § GPT-5”, Knight & Dave 2023
- “GPT-5 Scheduled To Complete Training December”, Chen 2023
- davidtayar5 @ "2023-02-10"
- “LLMs Predict My Coffee”, Dynomight 2026
- “AI Progress Is about to Speed Up”
- “ImpossibleBench”
- “Ladybird Adopts Rust, With Help from AI [GPT-5 & Claude-4]”
- “Rohan Pandey Homepage”, Pandey 2026
- “Erdős Problem #783”, Tao 2026
- “The Current SOTA Model Was Released without Safety Evals”
- “How Well Do Models Follow Their Constitutions?”
- “Did Claude 3 Opus Align Itself via Gradient Hacking?”
- “Models Have Some Pretty Funny Attractor States”
- “Microsoft Prepares for OpenAI’s GPT-5 Model”
- “OpenAI Fires an Employee for Prediction Market Insider Trading”
- sama
- Miscellaneous
- Bibliography
See Also
Gwern
“Oh, This Old Thing?”, Gwern et al 2026
“Apawcalypse Meow”, Gwern et al 2026
“Fine Art versus Fiiine Art”, Gwern 2026
“Spoilage”, Gwern & Pro 2026
“Spoilage”, Pro et al 2026
“Some 2025 LLM System Prompts”, Gwern et al 2025
“Apollonian #1: The Counted & the Crowned”, Gwern et al 2025
“LLM Challenge: Write Non-Biblical Sentences”, Gwern 2024
“[GPT-5 Free Association Experiment for Autonomous Image Generation]”, Gwern & GPT-5 2025
[GPT-5 free association experiment for autonomous image generation]
“Explain Free Energy Minimization Right Now, You Piece of S—T!”, Gwern & GPT-5 2025
Explain Free Energy Minimization Right Now, You Piece of S—t!
“O3 Is Full of Crimes”, Gwern 2025
“Scaling ‘Diminishing Returns’”, Gwern 2024
Links
“Cream of Can”, Gwern et al 2026
“[Poetry Typography Design Experiment: Side-By-Side Pindaric Ode]”, Pro 2026
[Poetry typography design experiment: side-by-side Pindaric ode]
“Introducing GPT-5.4-Mini and GPT-5.4-Nano: Fast and Efficient Models Optimized for Coding and Subagents”, OpenAI 2026
“GPT-5.4 Is A Substantial Upgrade”, Mowshowitz 2026
GPT-5.4 Is A Substantial Upgrade
View External Link:
https://thezvi.wordpress.com/2026/03/11/gpt-5-4-is-a-substantial-upgrade/
“Inside OpenAI’s Race to Catch Up to Claude Code: Why Is the Biggest Name in AI Late to the AI Coding Revolution?”, Zeff 2026
“Introducing GPT-5.4 [GPT-5.4 Pro]”, OpenAI 2026
“Introducing GPT-5.4 [GPT-5.4 Thinking]”, OpenAI 2026
“[Unique-8 That Xor to 216 − 1]”, Pro 2026
“ChatGPT-5.3-Codex Is Also Good At Coding”, Mowshowitz 2026
“Evaluating AGENTS.md: Are Repository-Level Context Files Helpful for Coding Agents?”, Gloaguen et al 2026
Evaluating AGENTS.md: Are Repository-Level Context Files Helpful for Coding Agents?
“Introducing GPT-5.3-Codex: Expanding Codex across the Full Spectrum of Professional Work on a Computer”, OpenAI 2026
“LLM Poetry and the ‘Greatness’ Question: Experiments by Gwern and Mercor”, Robbins 2026
LLM poetry and the ‘greatness’ question: Experiments by Gwern and Mercor
“Shipping at Inference-Speed”, Steinberger 2025
“Introducing GPT-5.2-Codex: The Most Advanced Agentic Coding Model for Professional Software Engineering and Defensive Cybersecurity.”, OpenAI 2025
“I Ported JustHTML from Python to JavaScript With Codex CLI & GPT-5.2 in 4.5 Hours”, Willison 2025
I ported JustHTML from Python to JavaScript with Codex CLI & GPT-5.2 in 4.5 hours
“GPT-5.2 Is Frontier Only For The Frontier”, Mowshowitz 2025
“GPT-5.2-Thinking-20251213 System Prompt”, Walls & GPT-5.2 2025
“Introducing GPT-5.2: The Most Advanced Frontier Model for Professional Work & Long-Running Agents”, OpenAI 2025
Introducing GPT-5.2: The most advanced frontier model for professional work & long-running agents
“Introducing GPT-5.2 Pro § Science and Math”, OpenAI 2025
“ARTEMIS: Comparing AI Agents to Cybersecurity Professionals in Real-World Penetration Testing”, Lin et al 2025
ARTEMIS: Comparing AI Agents to Cybersecurity Professionals in Real-World Penetration Testing
“How Well Can Gemini 3 Make a Henry James Simulator? Finally, a Benchmark for LLMs With Real-World Value”, Breen 2025
“GPT-5.1: A Smarter, More Conversational ChatGPT § GPT-5.1 Thinking”, OpenAI 2025
GPT-5.1: A smarter, more conversational ChatGPT § GPT-5.1 Thinking
“ImpossibleBench: Measuring LLMs’ Propensity of Exploiting Test Cases”, Zhong et al 2025
ImpossibleBench: Measuring LLMs’ Propensity of Exploiting Test Cases
“We Tested Claude Sonnet 4.5 for Writing and Editing: 5 Tests across Blind Comparisons, Editorial Standards, and Deadlines—Here’s What Changed Our Setup”, Parrott 2025
“Evaluating Long Context (Reasoning) Ability”
“The QMA Singularity [GPT-5-Thinking Proves a Key Lemma]”, Aaronson 2025
“I Talked to Sam Altman about the GPT-5 Launch Fiasco: Over Dinner, OpenAI CEO’s Addressed Criticism of GPT-5’s Rollout, the AI Bubble, Brain-Computer Interfaces, Buying Google Chrome, and More”, Heath 2025
andonlabs @ "2025-08-13"
“GPT-5s Are Alive: Outside Reactions, the Router and the Resurrection of GPT-4o”, Mowshowitz 2025
GPT-5s Are Alive: Outside Reactions, the Router and the Resurrection of GPT-4o
“GPT-5s Are Alive: Basic Facts, Benchmarks and the Model Card”, Mowshowitz 2025
GPT-5s Are Alive: Basic Facts, Benchmarks and the Model Card
“GPT-5 AMA With OpenAI’s Sam Altman and Some of the GPT-5 Team”, Altman 2025
GPT-5 AMA with OpenAI’s Sam Altman and some of the GPT-5 team
“Details about METR’s Evaluation of OpenAI GPT-5”, METR 2025
“GPT-5 Is Here: Our Smartest, Fastest, and Most Useful Model Yet, With Thinking Built In. Available to Everyone”, OpenAI 2025
“Introducing GPT-5 for Developers: The Best Model for Coding and Agentic Tasks [API]”, OpenAI 2025
Introducing GPT-5 for developers: The best model for coding and agentic tasks [API]
“GPT-5 Pro: Scaled but Efficient Parallel Test-Time Compute, to Provide the Highest Quality and Most Comprehensive Answers”, OpenAI 2025
“GPT-5: It Just Does Stuff—Putting the AI in Charge”, Mollick 2025
khoomeik @ "2025-08-07"
“OpenAI Is Expected to Release a ‘Materially Better’ GPT-5 for Its Chatbot Mid-Year, Sources Say”, Hays & Rafieyan 2024
OpenAI is expected to release a ‘materially better’ GPT-5 for its chatbot mid-year, sources say
“Microsoft Swallows OpenAI’s Core Team § Compute Is King”, Patel & Nishball 2023
“Inside the Chaos at OpenAI: Sam Altman’s Weekend of Shock and Drama Began a Year Ago, With the Release of ChatGPT”, Hao & Warzel 2023
“OpenAI Chief Seeks New Microsoft Funds to Build ‘Superintelligence’: Sam Altman Expects Big Tech Group Will Back Start-Up’s Mission to Create Software As Intelligent As Humans”, Murgia 2023
wagieeacc @ "2023-10-17"
“Altman on Scaling”, Thibs 2023
“In Sudden Alarm, Tech Doyens Call for a Pause on ChatGPT: Tech Luminaries, Renowned Scientists, and Elon Musk Warn of an ‘Out-Of-Control Race’ to Develop and Deploy Ever-More-Powerful AI Systems § GPT-5”, Knight & Dave 2023
“GPT-5 Scheduled To Complete Training December”, Chen 2023
davidtayar5 @ "2023-02-10"
Context on the NVIDIA ChatGPT opportunity—and ramifications of large language model enthusiasm
“LLMs Predict My Coffee”, Dynomight 2026
“AI Progress Is about to Speed Up”
“ImpossibleBench”
“Ladybird Adopts Rust, With Help from AI [GPT-5 & Claude-4]”
“Rohan Pandey Homepage”, Pandey 2026
“Erdős Problem #783”, Tao 2026
“The Current SOTA Model Was Released without Safety Evals”
“How Well Do Models Follow Their Constitutions?”
“Did Claude 3 Opus Align Itself via Gradient Hacking?”
“Models Have Some Pretty Funny Attractor States”
“Microsoft Prepares for OpenAI’s GPT-5 Model”
“OpenAI Fires an Employee for Prediction Market Insider Trading”
OpenAI Fires an Employee for Prediction Market Insider Trading
sama
Miscellaneous
Bibliography
https://www.theverge.com/command-line-newsletter/759897/sam-altman-chatgpt-openai-social-media-google-chrome-interview: “I Talked to Sam Altman about the GPT-5 Launch Fiasco: Over Dinner, OpenAI CEO’s Addressed Criticism of GPT-5’s Rollout, the AI Bubble, Brain-Computer Interfaces, Buying Google Chrome, and More”,https://openai.com/index/introducing-gpt-5/#gpt-5-pro: “GPT-5 Pro: Scaled but Efficient Parallel Test-Time Compute, to Provide the Highest Quality and Most Comprehensive Answers”,https://x.com/khoomeik/status/1953560406381015259: “[GPT-5 Was a <100× GPT-4 Scaleup]”,https://www.semianalysis.com/p/microsoft-swallows-openais-core-team#%C2%A7compute-is-king: “Microsoft Swallows OpenAI’s Core Team § Compute Is King”,https://www.theatlantic.com/technology/archive/2023/11/sam-altman-open-ai-chatgpt-chaos/676050/: “Inside the Chaos at OpenAI: Sam Altman’s Weekend of Shock and Drama Began a Year Ago, With the Release of ChatGPT”,https://www.lesswrong.com/posts/CfpAXccrBvWpQw9xj/algorithmic-improvement-is-probably-faster-than-scaling-now?commentId=LnyB6PDhazjSXQbAY: “Altman on Scaling”,https://x.com/davidtayar5/status/1627690520456691712: “Context on the NVIDIA ChatGPT Opportunity—And Ramifications of Large Language Model Enthusiasm”,