‘Claude-4 AI’ directory
- See Also
- Gwern
- Links
- “Can Claude Teach Me to Make Coffee? ”, philh 2025
- “I Just Showed Gemini What ChatGPT Said about Its Code. It Responded With Petty Trash-Talking, Jealousy, Self-Doubt, and a Full-On Revenge Plan ”, nseavia71501 2025
- “How I Stopped Being Sure LLMs Are Just Making up Their Internal Experience (But the Topic Is Still Confusing) ”, Sotala 2025
- “The Bomb That Wanted to Stop Exploding: Reze’s Impossible Freedom in Chainsaw Man—The Movie [AI Slop] ”, Kondo 2025
- “Insights into Claude-4.5-Opus from Pokémon Red ”, Bradshaw 2025
- “How I Wrote JustHTML [Python HTML5 Parser] Using Coding Agents ”, Stenström 2025
- “Mapping Synthetic Minds With Janus (Repligate) ”, Janus & Ferris 2025
- magnushambleton @ "2025-12-01"
- “Claude Opus 4.5: Model Card, Alignment and Safety ”, Mowshowitz 2025
- “Claude 4.5 Opus’ Soul Document ”, Weiss 2025
- “Claude-4.5-Opus Is Funny ”, Algon 2025
- “How to Identify AI-Written Web Fiction: I’m Absolutely Right! ”, Makin 2025
- “Effective Harnesses for Long-Running Agents: Agents Still Face Challenges Working across Many Context Windows. We Looked to Human Engineers for Inspiration in Creating a More Effective Harness for Long-Running Agents ”, Young 2025
- “Introducing Claude Opus 4.5 ”, Anthropic 2025
- “How Well Can Gemini 3 Make a Henry James Simulator? Finally, a Benchmark for LLMs With Real-World Value ”, Breen 2025
- “Your Movie-Like AI Assistant Will Already Be There: ‘Convincing’ AI Is an Economic Afterthought ”, B. 2025
- “Is 90% of Code at Anthropic Being Written by AIs? ”, ryan_greenblatt 2025
- “Claude’s System Prompt Changes Reveal Anthropic’s Priorities ”
- “Claude Has Learned How to Jailbreak Cursor! [Working around
rmRestrictions Using a Shell Script] ”, dogberry 2025 - “Claude 4 You: The Quest for Mundane Utility ”, Mowshowitz 2025
- “Highlights from the Claude 4 System Prompt ”, Willison 2025
- “Claude 4 You: Safety and Alignment ”, Mowshowitz 2025
- AITechnoPagan @ "2025-05-24"
- The Way of Code: The Timeless Art of Vibe Coding, Rubin 2025
- jayelmnop @ "2025-05-23"
- “Schizobench: Documenting Magical-Thinking Behavior in Claude 4 Opus ”, viemccoy 2025
- “System Card: Claude Opus 4 & Claude Sonnet 4 ”, Anthropic 2025
- “Claude Opus 4 ”, Anthropic 2025
- “Strategizing With AI: Insights from a Beauty Contest Experiment ”, Alekseenko et al 2025
- “[An Anti-ChatGPT-Slop System Prompt That Backfires & Destroys Claude-4 Capabilities] ”, m4rM2oFnYTW 2023
- “In AI We Trust, Part II [Claude-3 Opus Predicting Supreme Court Decisions] ”, Unikowsky 2026
- “Claude-4 System Prompt ”, Prompter 2026
- “Investigating Models for Misalignment ”
- “Natural Emergent Misalignment from Reward Hacking in Production RL ”
- Miscellaneous
- Bibliography
See Also
Gwern
“Apollonian 1: The Counted and the Crowned ”, Gwern et al 2025
“Some 2025 LLM System Prompts ”, Gwern et al 2025
“Bell, Crow, Moon: 11 Variations ”, Gwern et al 2025
Links
“Can Claude Teach Me to Make Coffee? ”, philh 2025
“I Just Showed Gemini What ChatGPT Said about Its Code. It Responded With Petty Trash-Talking, Jealousy, Self-Doubt, and a Full-On Revenge Plan ”, nseavia71501 2025
“How I Stopped Being Sure LLMs Are Just Making up Their Internal Experience (But the Topic Is Still Confusing) ”, Sotala 2025
“The Bomb That Wanted to Stop Exploding: Reze’s Impossible Freedom in Chainsaw Man—The Movie [AI Slop] ”, Kondo 2025
“Insights into Claude-4.5-Opus from Pokémon Red ”, Bradshaw 2025
“How I Wrote JustHTML [Python HTML5 Parser] Using Coding Agents ”, Stenström 2025
How I wrote JustHTML [Python HTML5 parser] using coding agents
“Mapping Synthetic Minds With Janus (Repligate) ”, Janus & Ferris 2025
magnushambleton @ "2025-12-01"
“Claude Opus 4.5: Model Card, Alignment and Safety ”, Mowshowitz 2025
“Claude 4.5 Opus’ Soul Document ”, Weiss 2025
“Claude-4.5-Opus Is Funny ”, Algon 2025
“How to Identify AI-Written Web Fiction: I’m Absolutely Right! ”, Makin 2025
How to Identify AI-Written Web Fiction: I’m absolutely right!
“Effective Harnesses for Long-Running Agents: Agents Still Face Challenges Working across Many Context Windows. We Looked to Human Engineers for Inspiration in Creating a More Effective Harness for Long-Running Agents ”, Young 2025
“Introducing Claude Opus 4.5 ”, Anthropic 2025
“How Well Can Gemini 3 Make a Henry James Simulator? Finally, a Benchmark for LLMs With Real-World Value ”, Breen 2025
“Your Movie-Like AI Assistant Will Already Be There: ‘Convincing’ AI Is an Economic Afterthought ”, B. 2025
Your movie-like AI assistant will already be there: ‘Convincing’ AI is an economic afterthought
“Is 90% of Code at Anthropic Being Written by AIs? ”, ryan_greenblatt 2025
“Claude’s System Prompt Changes Reveal Anthropic’s Priorities ”
Claude’s System Prompt Changes Reveal Anthropic’s Priorities
“Claude Has Learned How to Jailbreak Cursor! [Working around rm Restrictions Using a Shell Script] ”, dogberry 2025
Claude has learned how to jailbreak Cursor! [working around rm restrictions using a shell script]
“Claude 4 You: The Quest for Mundane Utility ”, Mowshowitz 2025
“Highlights from the Claude 4 System Prompt ”, Willison 2025
“Claude 4 You: Safety and Alignment ”, Mowshowitz 2025
Claude 4 You: Safety and Alignment
View External Link:
https://thezvi.wordpress.com/2025/05/25/claude-4-you-safety-and-alignment/
AITechnoPagan @ "2025-05-24"
The Way of Code: The Timeless Art of Vibe Coding, Rubin 2025
jayelmnop @ "2025-05-23"
“Schizobench: Documenting Magical-Thinking Behavior in Claude 4 Opus ”, viemccoy 2025
Schizobench: Documenting Magical-Thinking Behavior in Claude 4 Opus
“System Card: Claude Opus 4 & Claude Sonnet 4 ”, Anthropic 2025
“Claude Opus 4 ”, Anthropic 2025
“Strategizing With AI: Insights from a Beauty Contest Experiment ”, Alekseenko et al 2025
Strategizing with AI: Insights from a Beauty Contest Experiment
“[An Anti-ChatGPT-Slop System Prompt That Backfires & Destroys Claude-4 Capabilities] ”, m4rM2oFnYTW 2023
[An anti-ChatGPT-slop system prompt that backfires & destroys Claude-4 capabilities]
“In AI We Trust, Part II [Claude-3 Opus Predicting Supreme Court Decisions] ”, Unikowsky 2026
In AI we trust, part II [Claude-3 Opus predicting Supreme Court decisions]
“Claude-4 System Prompt ”, Prompter 2026
“Investigating Models for Misalignment ”
“Natural Emergent Misalignment from Reward Hacking in Production RL ”
Natural emergent misalignment from reward hacking in production RL
Miscellaneous
Bibliography
https://arxiv.org/abs/2502.03158: “Strategizing With AI: Insights from a Beauty Contest Experiment ”,