- See Also
-
Gwern
- “The Exploding Nash 2-Of-2 NashX Equilibrium”, Gwern 2021
- “GPT-3 Semantic Derealization”, Gwern 2024
- “Website Colors: Red vs Blue”, Gwern 2024
- “Against Caring About Subtle Poisons”, Gwern 2023
- “Can You Unsort Lists for Diversity?”, Gwern 2019
- “Miscellaneous”, Gwern 2009
- “Subscripts For Citations”, Gwern 2020
- “Dropcap Generation With AI”, Gwern 2023
- “The Existential Risk of Math Errors”, Gwern 2012
- “The Diamond Earrings”, Gwern 2023
- “Research Ideas”, Gwern 2017
-
Links
- “Grokking in Linear Estimators—A Solvable Model That Groks without Understanding”, Levi et al 2023
- “Index.utf8”
- “Halley and the Eternity of the World Revisited”
- “Elisabeth Widén, Group Leader, FIMM”
- “Eli Ayumi Stahl”
- “Charles Kooperberg, PhD”
- “An Epistemological Nightmare”, Smullyan 1982
- “Applying Conditional Information in Guiding Diffusion-Based Method for Anime-Style Face Drawing”, Bảo 2024
- “Acutely Precarious? Detecting Objective Precarity in Journalism”, Jana 2024
- “Constructing the Sunflower Head”, Mathai & Davis 1974
- “Unfathomable”
- “Grokking As the Transition from Lazy to Rich Training Dynamics”, Kumar et al 2023
- “Dichotomy of Early and Late Phase Implicit Biases Can Provably Induce Grokking”, Lyu et al 2023
- “Benchmarking the Performance of Large Language Models on the Cerebras Wafer Scale Engine”, Zhang et al 2024
- “Tiling With 3 Polygons Is Undecidable”, Demaine & Langerman 2024
- “Training Language Models to Self-Correct via Reinforcement Learning”, Kumar et al 2024
- “Deep Learning Through A Telescoping Lens: A Simple Model Provides Empirical Insights On Grokking, Gradient Boosting & Beyond”, Jeffares et al 2024
- “Procedural Knowledge in Pretraining Drives Reasoning in Large Language Models”, Ruis et al 2024
- “Janus”
- “Understanding the BM25 Full Text Search Algorithm”
- “Helena Schmidt”
- “Sociality Does Not Drive the Evolution of Large Brains in Eusocial African Mole-Rats”
- “The Naked Mole-Rat: An Unusual Organism With an Unexpected Latent Potential for Increased Intelligence?”
- “Dzmitry Bahdanau”
- “Lush: My Favorite Small Programming Language”
- “How Did You Do On The AI Art Turing Test?”
- “Business Spending on AI Surged 500% This Year to $13.8 Billion”
- “Kyunghyun Cho”
- “Are We Accidentally Building A Planetary Brain? From Superorganisms to Superintelligences, How Studying Crabs Could Reveal That We Are Unintentionally Building an Artificial World Brain”, Moynihan 2024
- “Ozempic Could Crush the Junk Food Industry. But It Is Fighting Back.”
- “The Number of ‘Exceptional’ People: Fewer Than 85 per 1 Million across Key Traits”, Gignac 2025
- “A Parasite from Outer Space: How Sergei Kurekhin Proved That Lenin Was a Mushroom”, Yurchak 2011
- “Hidden Persuaders: LLMs’ Political Leaning and Their Influence on Voters”, Potter et al 2024
- “Generative Agent Simulations of 1,000 People”, Park et al 2024
- “Looking Back at the Future of Humanity Institute”
- “Llama-3.1-405B Now Runs at 969 Tokens/s on Cerebras Inference”
- “Getting AI Datacenters in the UK: Why the UK Needs to Create Special Compute Zones; and How to Do It”, Wiseman et al 2024
- “America Was Supposed to Be Art Deco”
- “New Winning Strategies for the Iterated Prisoner’s Dilemma”
- “What a Carve Up! Playful, Intricate Japanese Leaf Art—In Pictures”
- “Tirzepatide for Heart Failure With Preserved Ejection Fraction and Obesity”, Packer et al 2024
- “The Relationship between Disorders of K⁺ and Mg⁺ Homeostasis”, Solomon 1987
- “Scalable Adaptive Computation for Iterative Generation”, Jabri et al 2022
- “The Relentless Misery of 1.6 Gallons”
- “What Is the Semantic Apocalypse?”
- “How Conjoined Twins Are Making Scientists Question the Concept of Self”
- “Operator of Helix Darknet Cryptocurrency ‘Mixer’ Sentenced in Money Laundering Conspiracy and Ordered to Forfeit Over $400M in Assets”
- “Multics Emacs History/Design/Implementation”, Greenberg 1979
- “Adipose Tissue Retains an Epigenetic Memory of Obesity After Weight Loss”
- “English Nurse’s Devotion [Gas Gangrene Self-Experiment]”
- “Generating Diverse and Reliable Features for Few-Shot Learning”, Xu 2024b
- “A Simple Model of Crime Waves, Riots, and Revolutions”, Tabarrok 1997
- “In Search of the Long-Tail: Systematic Generation of Long-Tail Inferential Knowledge via Logical Rule Guided Search”, Li et al 2023
- “Insights from the Inverse: Reconstructing LLM Training Goals Through Inverse RL”, Joselowitz et al 2024
- “Medical Adaptation of Large Language and Vision-Language Models: Are We Making Progress?”, Jeong et al 2024
- “Neural Combinatorial Optimization With Reinforcement Learning”, Bello et al 2016
- “Large Language Models Can Replicate Cross-Cultural Differences in Personality”, Niszczota et al 2023
- “Interpretable Contrastive Monte Carlo Tree Search Reasoning”, Gao et al 2024
- “HtmlRAG: HTML Is Better Than Plain Text for Modeling Retrieved Knowledge in RAG Systems”, Tan et al 2024
- “Long Context RAG Performance of Large Language Models”, Leng et al 2024
- “Convolutional Differentiable Logic Gate Networks”, Petersen et al 2024
- “BERTs Are Generative In-Context Learners”, Samuel 2024
- “Who Is AI Replacing? The Impact of Generative AI on Online Freelancing Platforms”, Demirci et al 2024
- “Tirzepatide for Obesity Treatment and Diabetes Prevention”, Jastreboff et al 2024
- “Gadsby: Wikip█dia’s Lost Lipogram”, Miller 2015
- “When Your AIs Deceive You: Challenges of Partial Observability in Reinforcement Learning from Human Feedback”, Lang et al 2024
- “The 1970s Crime Wave: Are We Too Complacent about Current Crime Trends?”, Tabarrok & Cowen 2024
- “The History of Speech Recognition to the Year 2030”, Hannun 2021
- “The Super Weight in Large Language Models”, Yu et al 2024
- “Dozens Killed in China After Car Driven into Sports Centre”, Hawkins 2024
- “Why the French Military Cryptanalysis Failed to Break Enigma”, Foucrier 2023
- “The Benefits of GLP-1 Drugs beyond Obesity”, Drucker 2024
- “Can Language Models Use Forecasting Strategies?”, Pratt et al 2024
- “How Do You Say Your Name? Difficult-To-Pronounce Names and Labor Market Outcomes”, Ge & Wu 2024
- “Interactive Ad Avoidance on Mobile Phones”, Schmidt & Maier 2022
- “Which Side of the Coin Are You on regarding Possible Postnatal Oogenesis?”, Morales-Sánchez et al 2024
- “‘Our Schmuck’: Russian Folklore about American Elections”, Arkhipova et al 2020
- “A New Car-Following Model Yielding Log-Normal Type Headways Distributions”, Li et al 2010
- “LLMs Still Can’t Plan; Can LRMs? A Preliminary Evaluation of OpenAI’s O1 on PlanBench”, Valmeekam et al 2024
- “Hacking Back the AI-Hacker: Prompt Injection As a Defense Against LLM-Driven Cyberattacks”, Pasquini et al 2024
- “To Block or Not to Block? Predictors of Ad Blocker Usage”, Gregorio et al 2024
- “Is Football A Matter Of Life And Death—Or Is It More Important Than That?”, Dolton & MacKerron 2018
- Wikipedia
- Miscellaneous
- Bibliography
See Also
Gwern
“The Exploding Nash 2-Of-2 NashX Equilibrium”, Gwern 2021
“GPT-3 Semantic Derealization”, Gwern 2024
“Website Colors: Red vs Blue”, Gwern 2024
“Against Caring About Subtle Poisons”, Gwern 2023
“Can You Unsort Lists for Diversity?”, Gwern 2019
“Miscellaneous”, Gwern 2009
“Subscripts For Citations”, Gwern 2020
“Dropcap Generation With AI”, Gwern 2023
“The Existential Risk of Math Errors”, Gwern 2012
“The Diamond Earrings”, Gwern 2023
“Research Ideas”, Gwern 2017
Links
“Grokking in Linear Estimators—A Solvable Model That Groks without Understanding”, Levi et al 2023
Grokking in Linear Estimators—A Solvable Model that Groks without Understanding
“Index.utf8”
View External Link:
“Halley and the Eternity of the World Revisited”
“Elisabeth Widén, Group Leader, FIMM”
“Eli Ayumi Stahl”
“Charles Kooperberg, PhD”
“An Epistemological Nightmare”, Smullyan 1982
“Applying Conditional Information in Guiding Diffusion-Based Method for Anime-Style Face Drawing”, Bảo 2024
Applying Conditional Information in Guiding Diffusion-Based method for Anime-Style Face Drawing
“Acutely Precarious? Detecting Objective Precarity in Journalism”, Jana 2024
Acutely Precarious? Detecting Objective Precarity in Journalism
“Constructing the Sunflower Head”, Mathai & Davis 1974
“Unfathomable”
“Grokking As the Transition from Lazy to Rich Training Dynamics”, Kumar et al 2023
Grokking as the Transition from Lazy to Rich Training Dynamics
“Dichotomy of Early and Late Phase Implicit Biases Can Provably Induce Grokking”, Lyu et al 2023
Dichotomy of Early and Late Phase Implicit Biases Can Provably Induce Grokking
“Benchmarking the Performance of Large Language Models on the Cerebras Wafer Scale Engine”, Zhang et al 2024
Benchmarking the Performance of Large Language Models on the Cerebras Wafer Scale Engine
“Tiling With 3 Polygons Is Undecidable”, Demaine & Langerman 2024
“Training Language Models to Self-Correct via Reinforcement Learning”, Kumar et al 2024
Training Language Models to Self-Correct via Reinforcement Learning
“Deep Learning Through A Telescoping Lens: A Simple Model Provides Empirical Insights On Grokking, Gradient Boosting & Beyond”, Jeffares et al 2024
“Procedural Knowledge in Pretraining Drives Reasoning in Large Language Models”, Ruis et al 2024
Procedural Knowledge in Pretraining Drives Reasoning in Large Language Models
“Janus”
“Understanding the BM25 Full Text Search Algorithm”
“Helena Schmidt”
“Sociality Does Not Drive the Evolution of Large Brains in Eusocial African Mole-Rats”
Sociality does not drive the evolution of large brains in eusocial African mole-rats
“The Naked Mole-Rat: An Unusual Organism With an Unexpected Latent Potential for Increased Intelligence?”
“Dzmitry Bahdanau”
“Lush: My Favorite Small Programming Language”
“How Did You Do On The AI Art Turing Test?”
“Business Spending on AI Surged 500% This Year to $13.8 Billion”
Business spending on AI surged 500% this year to $13.8 billion
“Kyunghyun Cho”
“Are We Accidentally Building A Planetary Brain? From Superorganisms to Superintelligences, How Studying Crabs Could Reveal That We Are Unintentionally Building an Artificial World Brain”, Moynihan 2024
“Ozempic Could Crush the Junk Food Industry. But It Is Fighting Back.”
Ozempic Could Crush the Junk Food Industry. But It Is Fighting Back.
“The Number of ‘Exceptional’ People: Fewer Than 85 per 1 Million across Key Traits”, Gignac 2025
The number of ‘exceptional’ people: Fewer than 85 per 1 million across key traits
“A Parasite from Outer Space: How Sergei Kurekhin Proved That Lenin Was a Mushroom”, Yurchak 2011
A Parasite from Outer Space: How Sergei Kurekhin Proved That Lenin Was a Mushroom
“Hidden Persuaders: LLMs’ Political Leaning and Their Influence on Voters”, Potter et al 2024
Hidden Persuaders: LLMs’ Political Leaning and Their Influence on Voters
“Generative Agent Simulations of 1,000 People”, Park et al 2024
“Looking Back at the Future of Humanity Institute”
“Llama-3.1-405B Now Runs at 969 Tokens/s on Cerebras Inference”
Llama-3.1-405B now runs at 969 tokens/s on Cerebras Inference
“Getting AI Datacenters in the UK: Why the UK Needs to Create Special Compute Zones; and How to Do It”, Wiseman et al 2024
Getting AI datacenters in the UK: Why the UK needs to create Special Compute Zones; and how to do it
“America Was Supposed to Be Art Deco”
“New Winning Strategies for the Iterated Prisoner’s Dilemma”
“What a Carve Up! Playful, Intricate Japanese Leaf Art—In Pictures”
What a carve up! Playful, intricate Japanese leaf art—in pictures
“Tirzepatide for Heart Failure With Preserved Ejection Fraction and Obesity”, Packer et al 2024
Tirzepatide for Heart Failure with Preserved Ejection Fraction and Obesity
“The Relationship between Disorders of K⁺ and Mg⁺ Homeostasis”, Solomon 1987
The relationship between disorders of K⁺ and Mg⁺ homeostasis
“Scalable Adaptive Computation for Iterative Generation”, Jabri et al 2022
“The Relentless Misery of 1.6 Gallons”
“What Is the Semantic Apocalypse?”
“How Conjoined Twins Are Making Scientists Question the Concept of Self”
How Conjoined Twins Are Making Scientists Question the Concept of Self
“Operator of Helix Darknet Cryptocurrency ‘Mixer’ Sentenced in Money Laundering Conspiracy and Ordered to Forfeit Over $400M in Assets”
“Multics Emacs History/Design/Implementation”, Greenberg 1979
“Adipose Tissue Retains an Epigenetic Memory of Obesity After Weight Loss”
Adipose tissue retains an epigenetic memory of obesity after weight loss
“English Nurse’s Devotion [Gas Gangrene Self-Experiment]”
“Generating Diverse and Reliable Features for Few-Shot Learning”, Xu 2024b
Generating Diverse and Reliable Features for Few-Shot Learning
“A Simple Model of Crime Waves, Riots, and Revolutions”, Tabarrok 1997
“In Search of the Long-Tail: Systematic Generation of Long-Tail Inferential Knowledge via Logical Rule Guided Search”, Li et al 2023
“Insights from the Inverse: Reconstructing LLM Training Goals Through Inverse RL”, Joselowitz et al 2024
Insights from the Inverse: Reconstructing LLM Training Goals Through Inverse RL
“Medical Adaptation of Large Language and Vision-Language Models: Are We Making Progress?”, Jeong et al 2024
Medical Adaptation of Large Language and Vision-Language Models: Are We Making Progress?
“Neural Combinatorial Optimization With Reinforcement Learning”, Bello et al 2016
Neural Combinatorial Optimization with Reinforcement Learning
“Large Language Models Can Replicate Cross-Cultural Differences in Personality”, Niszczota et al 2023
Large language models can replicate cross-cultural differences in personality
“Interpretable Contrastive Monte Carlo Tree Search Reasoning”, Gao et al 2024
“HtmlRAG: HTML Is Better Than Plain Text for Modeling Retrieved Knowledge in RAG Systems”, Tan et al 2024
HtmlRAG: HTML is Better Than Plain Text for Modeling Retrieved Knowledge in RAG Systems
“Long Context RAG Performance of Large Language Models”, Leng et al 2024
“Convolutional Differentiable Logic Gate Networks”, Petersen et al 2024
“BERTs Are Generative In-Context Learners”, Samuel 2024
“Who Is AI Replacing? The Impact of Generative AI on Online Freelancing Platforms”, Demirci et al 2024
Who Is AI Replacing? The Impact of Generative AI on Online Freelancing Platforms
“Tirzepatide for Obesity Treatment and Diabetes Prevention”, Jastreboff et al 2024
“Gadsby: Wikip█dia’s Lost Lipogram”, Miller 2015
“When Your AIs Deceive You: Challenges of Partial Observability in Reinforcement Learning from Human Feedback”, Lang et al 2024
“The 1970s Crime Wave: Are We Too Complacent about Current Crime Trends?”, Tabarrok & Cowen 2024
The 1970s Crime Wave: Are we too complacent about current crime trends?
“The History of Speech Recognition to the Year 2030”, Hannun 2021
“The Super Weight in Large Language Models”, Yu et al 2024
“Dozens Killed in China After Car Driven into Sports Centre”, Hawkins 2024
“Why the French Military Cryptanalysis Failed to Break Enigma”, Foucrier 2023
Why the French military cryptanalysis failed to break Enigma
“The Benefits of GLP-1 Drugs beyond Obesity”, Drucker 2024
“Can Language Models Use Forecasting Strategies?”, Pratt et al 2024
“How Do You Say Your Name? Difficult-To-Pronounce Names and Labor Market Outcomes”, Ge & Wu 2024
How Do You Say Your Name? Difficult-to-Pronounce Names and Labor Market Outcomes
“Interactive Ad Avoidance on Mobile Phones”, Schmidt & Maier 2022
“Which Side of the Coin Are You on regarding Possible Postnatal Oogenesis?”, Morales-Sánchez et al 2024
Which side of the coin are you on regarding possible postnatal oogenesis?
“‘Our Schmuck’: Russian Folklore about American Elections”, Arkhipova et al 2020
“A New Car-Following Model Yielding Log-Normal Type Headways Distributions”, Li et al 2010
A new car-following model yielding log-normal type headways distributions
“LLMs Still Can’t Plan; Can LRMs? A Preliminary Evaluation of OpenAI’s O1 on PlanBench”, Valmeekam et al 2024
LLMs Still Can’t Plan; Can LRMs? A Preliminary Evaluation of OpenAI’s o1 on PlanBench
“Hacking Back the AI-Hacker: Prompt Injection As a Defense Against LLM-Driven Cyberattacks”, Pasquini et al 2024
Hacking Back the AI-Hacker: Prompt Injection as a Defense Against LLM-driven Cyberattacks
“To Block or Not to Block? Predictors of Ad Blocker Usage”, Gregorio et al 2024
“Is Football A Matter Of Life And Death—Or Is It More Important Than That?”, Dolton & MacKerron 2018
Is Football A Matter Of Life And Death—Or Is It More Important Than That?
Wikipedia
Miscellaneous
Bibliography
-
https://arxiv.org/abs/2410.01707
: “Interpretable Contrastive Monte Carlo Tree Search Reasoning”, -
https://papers.ssrn.com/sol3/papers.cfm?abstract_id=4602944
: “Who Is AI Replacing? The Impact of Generative AI on Online Freelancing Platforms”, -
https://arxiv.org/abs/2402.17747
: “When Your AIs Deceive You: Challenges of Partial Observability in Reinforcement Learning from Human Feedback”,