‘newest links’ directory
- See Also
-
Gwern
- “Your LLM Can Be Faster Than You (Can) Think”, Gwern 2025
- “Research Bounties On Fulltexts”, Gwern 2018
- “Miscellaneous”, Gwern 2009
- “A Christmas Protestation”, o1-pro et al 2024
- “Georgism”, Gwern 2019
- “Self-Blinded Mineral Water Taste Test”, Gwern 2017
- “Towards Benchmarking LLM Diversity & Creativity”, Gwern 2024
- “Second Life Sentences”, Gwern 2024
- “Hierarchical Embeddings for Text Search”, Gwern 2024
- “Why ‘Tools For Thought’ Work Fails: Lots of Tools Work, Not Thought”, Gwern 2024
-
Links
- “Mike Stay”
- “Pamela Madden, PhD—Psychiatry”
- “Ozren Polašek”
- “Peter Holmans”
- “Items—Munroe, Patricia B—The William Harvey Research Institute—Faculty of Medicine and Dentistry”
- “The GF-11 Parallel Computer”, Kumar et al 1993
- “The Limits of Platforms: Why Disintermediation Has Failed in the Art Market”, Ricucci & Blank 2025
- “Science Needs Reason to Be Trusted”, Hossenfelder 2017
- “Reinforcement Learning for Improving Agent Design”, Ha 2018
- “SWE-Bench+: Enhanced Coding Benchmark for LLMs”, Aleithan et al 2024
- “Reinforcement Learning for Improving Agent Design [Homepage]”, Ha 2018
- “Sarah Bergen”
- “The 1⁄8th Sleep”
- “Sean Wharton”
- “Scott Gray”
- “The CRPG Renaissance, Part 3: TSR Is Dead…”
- “Laboratory Layered Latte”
- “The Crackdown on Compounded GLP-1 Meds Has Begun”
- “The Ozempic Shortage Is Over”
- “Early Cognitive Development and Psychopathology in Children at Familial High Risk for Schizophrenia”, Stephens et al 2024
- “Algorithmic Thermodynamics”, Baez & Stay 2010
- “Foundations of Algorithmic Thermodynamics”, Ebtekar & Hutter 2023
- “Idiosyncrasies in Large Language Models”, Sun et al 2025
- “Do Ambiguous Images Provide Psychological Insights? Testing a Popular Claim [No & Yes]”
- “An 83-Year-Old Short Story [‘The Library of Babel’] by Jorge Luis Borges Portends a Bleak Future for the Internet”, Kreuz 2024
- “6 Years of Meditation Data Reveals a Depressing Picture: A Mini Meta-Analysis Showing Meditation Made Me More Anxious, Frustrated, Depressed, and Less Social”, Syler 2025
- “Meditating More Made Me Sleep Better and Feel Worse: Findings from a Year’s worth of N = 1 Experiments on Meditation”, Syler 2025
- “The Irish in England”
- “Microsoft Prepares for OpenAI’s GPT-5 Model”
- “Average Case Complexity under the Universal Distribution Equals Worst-Case Complexity”, Li & Vitányi 1992
- “Catalytic Approaches to the Tree Evaluation Problem”, Cook & Mertz 2020
- “The Impact of the 2018 Tariffs on Prices and Welfare”, Amiti et al 2019
- “Effect of Glucagon-Like Peptide-1 Receptor Agonists on All-Cause Mortality and Cardiovascular Outcomes: A Meta-Analysis”, Peterson & Barry 2018b
- “A Bivariate Approach to Meta-Analysis”, Houwelingen et al 1993
- “Undersea Nuclear Forces: Survivability of Chinese, Russian, and US SSBNs”, Stefanick 2025
- “Pulling Out The Big Guns For Needle Phobia In An Insane World Where Nobody Seems To Take It Seriously”, aellagirl 2025
- “OpenAI O1-Pro Deep Research Generated Age of Empires 2 [And WH40k Etc] Playthroughs”, Simfish 2025
- “Don’t Trust: Verify—Grounding LLM Quantitative Reasoning With Autoformalization”, Zhou et al 2024
- “DeepSeek-V3 Technical Report”, DeepSeek et al 2024
- “Gravitational Effects of a Small Primordial Black Hole Passing Through the Human Body”, Scherrer 2025
- “NaturalReasoning: Reasoning in the Wild With 2.8M Challenging Questions”, Yuan et al 2025
- “The Long-Term Future of Extragalactic Astronomy”, Loeb 2001
- “Where Does In-Context Learning Happen in Large Language Models?”, Sia et al 2025
- “Optic Neuropathy in an Autistic Child With Vitamin A Deficiency: A Case Report and Literature Review”, Cheah et al 2022
- “Flame-Free Candles Are Not Pollution-Free: Scented Wax Melts As a Substantial Source of Atmospheric Nanoparticles”, Patra et al 2025
- “Radiance, Carter Scholz (2002)”, bormgans 2020
- “DS R1 Is Not on Par With OA O1, and the Difference Is Qualitative, Not Quantitative: Long-Tail Benchmarks Reveal Gaps”, Polshkov 2025
- “Trillions of Viruses Live in Your Body. A.I. Is Trying to Find Them.”
- “The Ecology of Fear: Optimal Foraging, Game Theory, and Trophic Interactions”, Brown et al 1999
- “Computing With a Full Memory: Catalytic Space”, Buhrman et al 2014
- “The Impact of Short-Lived Controls on the Interpretation of Lifespan Experiments and Progress in Geroscience—Through the Lens of the ‘900-Day Rule’”, Pabis et al 2024
- “What’s Wrong With This Lagrangean?”, Mermin 1988
- “Fixing the Internet for Real Time Applications: Part I”
- “Fixing the Internet for Real Time Applications: Part II”
- “Won’t vs. Can’t: Sandbagging-Like Behavior from Claude Models”
- “Stable Video Diffusion: Scaling Latent Video Diffusion Models to Large Datasets”, Blattmann et al 2023
- “Lossy Catalytic Computation”, Gupta et al 2024
- “Fully Characterizing Lossy Catalytic Computation”, Folkertsma et al 2024
- “Are DeepSeek R1 And Other Reasoning Models More Faithful?”, Chua & Evans 2025
- “Native Sparse Attention (NSA): Hardware-Aligned and Natively Trainable Sparse Attention”, Yuan et al 2025
- “Meta-Statistical Learning: Supervised Learning of Statistical Inference”, Peyrard & Cho 2025
-
“
unix-In-Lisp
: Mount Unix System into Common Lisp Image” - “Hunt Allcott”
- “Fixing the Internet for Real-Time Applications: Part III”
- “What Makes Mathematicians Believe Unproven Mathematical Statements?”, Gowers 2023
- “Breaking the Barrier? Children Fail to Translate Their Preverbal Memories into Language”, Simcock & Hayne 2002
- “Portable Magnetometry for Detection of Biomagnetism in Ambient Environments”, Limes et al 2020
- “On Layer Normalization in the Transformer Architecture”, Xiong et al 2020
- “Password Similarity Using Probabilistic Data Structures”, Berardi et al 2020
- “RADD: Your Absorbing Discrete Diffusion Secretly Models the Conditional Distributions of Clean Data”, Ou et al 2024
- “How Far Is Video Generation from World Model: A Physical Law Perspective”, Kang et al 2024
- “Do Generative Video Models Learn Physical Principles from Watching Videos?”, Motamed et al 2025
- “SycEval: Evaluating LLM Sycophancy”, Fanous et al 2025
- “ZeroBench: An Impossible Visual Benchmark for Contemporary Large Multimodal Models”, Roberts et al 2025
- “LLaDA: Large Language Diffusion Models”, Nie et al 2025
- “Step-Video-T2V Technical Report: The Practice, Challenges, and Future of Video Foundation Model”, Ma et al 2025
- “Building a Princess Saving App: How to Build Learning and Fun into Your Applications”, Cook 2008
- “Once-Weekly Semaglutide in Adults With Alcohol Use Disorder: A Randomized Clinical Trial”, Hendershot et al 2025
- “Parenthood, Mental Disorders, and Symptoms Through Adulthood: A Total Population Study”, Andersen et al 2024
- “The Case against Ethics Review in the Social Sciences”, Schrag 2011
- “Geons”, Wheeler 1955
- “The Retention of Manual Flying Skills in the Automated Cockpit”, Casner et al 2014
- “Lost in Time: Clock and Calendar Understanding Challenges in Multimodal LLMs”, Saxena et al 2025
- Wikipedia
- Miscellaneous
- Bibliography
See Also
Gwern
“Your LLM Can Be Faster Than You (Can) Think”, Gwern 2025
“Research Bounties On Fulltexts”, Gwern 2018
“Miscellaneous”, Gwern 2009
“A Christmas Protestation”, o1-pro et al 2024
“Georgism”, Gwern 2019
“Self-Blinded Mineral Water Taste Test”, Gwern 2017
“Towards Benchmarking LLM Diversity & Creativity”, Gwern 2024
“Second Life Sentences”, Gwern 2024
“Hierarchical Embeddings for Text Search”, Gwern 2024
“Why ‘Tools For Thought’ Work Fails: Lots of Tools Work, Not Thought”, Gwern 2024
Why ‘Tools For Thought’ Work Fails: Lots of Tools Work, Not Thought
Links
“Mike Stay”
“Pamela Madden, PhD—Psychiatry”
“Ozren Polašek”
“Peter Holmans”
“Items—Munroe, Patricia B—The William Harvey Research Institute—Faculty of Medicine and Dentistry”
Items—Munroe, Patricia B—The William Harvey Research Institute—Faculty of Medicine and Dentistry
“The GF-11 Parallel Computer”, Kumar et al 1993
“The Limits of Platforms: Why Disintermediation Has Failed in the Art Market”, Ricucci & Blank 2025
The limits of platforms: Why disintermediation has failed in the art market
“Science Needs Reason to Be Trusted”, Hossenfelder 2017
“Reinforcement Learning for Improving Agent Design”, Ha 2018
“SWE-Bench+: Enhanced Coding Benchmark for LLMs”, Aleithan et al 2024
“Reinforcement Learning for Improving Agent Design [Homepage]”, Ha 2018
Reinforcement Learning for Improving Agent Design [homepage] :
View External Link:
“Sarah Bergen”
“The 1⁄8th Sleep”
“Sean Wharton”
“Scott Gray”
“The CRPG Renaissance, Part 3: TSR Is Dead…”
The CRPG Renaissance, Part 3: TSR is Dead… :
View External Link:
https://www.filfre.net/2025/02/the-crpg-renaissance-part-3-tsr-is-dead/
“Laboratory Layered Latte”
“The Crackdown on Compounded GLP-1 Meds Has Begun”
The Crackdown on Compounded GLP-1 Meds Has Begun :
View External Link:
https://www.wired.com/story/crackdown-compounded-glp-1-lilly-mounjaro-zepbound/
“The Ozempic Shortage Is Over”
The Ozempic Shortage Is Over :
View External Link:
https://www.wired.com/story/end-of-semaglutide-shortage-glp1-ozempic-wegovy/
“Early Cognitive Development and Psychopathology in Children at Familial High Risk for Schizophrenia”, Stephens et al 2024
Early cognitive development and psychopathology in children at familial high risk for schizophrenia
“Algorithmic Thermodynamics”, Baez & Stay 2010
“Foundations of Algorithmic Thermodynamics”, Ebtekar & Hutter 2023
“Idiosyncrasies in Large Language Models”, Sun et al 2025
“Do Ambiguous Images Provide Psychological Insights? Testing a Popular Claim [No & Yes]”
Do ambiguous images provide psychological insights? Testing a popular claim [no & yes]
“An 83-Year-Old Short Story [‘The Library of Babel’] by Jorge Luis Borges Portends a Bleak Future for the Internet”, Kreuz 2024
“6 Years of Meditation Data Reveals a Depressing Picture: A Mini Meta-Analysis Showing Meditation Made Me More Anxious, Frustrated, Depressed, and Less Social”, Syler 2025
“Meditating More Made Me Sleep Better and Feel Worse: Findings from a Year’s worth of N = 1 Experiments on Meditation”, Syler 2025
“The Irish in England”
“Microsoft Prepares for OpenAI’s GPT-5 Model”
“Average Case Complexity under the Universal Distribution Equals Worst-Case Complexity”, Li & Vitányi 1992
Average case complexity under the universal distribution equals worst-case complexity
“Catalytic Approaches to the Tree Evaluation Problem”, Cook & Mertz 2020
Catalytic Approaches to the Tree Evaluation Problem :
View PDF:
“The Impact of the 2018 Tariffs on Prices and Welfare”, Amiti et al 2019
“Effect of Glucagon-Like Peptide-1 Receptor Agonists on All-Cause Mortality and Cardiovascular Outcomes: A Meta-Analysis”, Peterson & Barry 2018b
“A Bivariate Approach to Meta-Analysis”, Houwelingen et al 1993
“Undersea Nuclear Forces: Survivability of Chinese, Russian, and US SSBNs”, Stefanick 2025
Undersea nuclear forces: Survivability of Chinese, Russian, and US SSBNs
“Pulling Out The Big Guns For Needle Phobia In An Insane World Where Nobody Seems To Take It Seriously”, aellagirl 2025
“OpenAI O1-Pro Deep Research Generated Age of Empires 2 [And WH40k Etc] Playthroughs”, Simfish 2025
OpenAI o1-pro Deep Research generated Age of Empires 2 [and WH40k etc] playthroughs
“Don’t Trust: Verify—Grounding LLM Quantitative Reasoning With Autoformalization”, Zhou et al 2024
Don’t Trust: Verify—Grounding LLM Quantitative Reasoning with Autoformalization
“DeepSeek-V3 Technical Report”, DeepSeek et al 2024
“Gravitational Effects of a Small Primordial Black Hole Passing Through the Human Body”, Scherrer 2025
Gravitational Effects of a Small Primordial Black Hole Passing Through the Human Body
“NaturalReasoning: Reasoning in the Wild With 2.8M Challenging Questions”, Yuan et al 2025
NaturalReasoning: Reasoning in the Wild with 2.8M Challenging Questions
“The Long-Term Future of Extragalactic Astronomy”, Loeb 2001
“Where Does In-Context Learning Happen in Large Language Models?”, Sia et al 2025
Where does In-context Learning Happen in Large Language Models?
“Optic Neuropathy in an Autistic Child With Vitamin A Deficiency: A Case Report and Literature Review”, Cheah et al 2022
Optic Neuropathy in an Autistic Child With Vitamin A Deficiency: A Case Report and Literature Review
“Flame-Free Candles Are Not Pollution-Free: Scented Wax Melts As a Substantial Source of Atmospheric Nanoparticles”, Patra et al 2025
“Radiance, Carter Scholz (2002)”, bormgans 2020
“DS R1 Is Not on Par With OA O1, and the Difference Is Qualitative, Not Quantitative: Long-Tail Benchmarks Reveal Gaps”, Polshkov 2025
“Trillions of Viruses Live in Your Body. A.I. Is Trying to Find Them.”
Trillions of Viruses Live in Your Body. A.I. Is Trying to Find Them.
“The Ecology of Fear: Optimal Foraging, Game Theory, and Trophic Interactions”, Brown et al 1999
The Ecology of Fear: Optimal Foraging, Game Theory, and Trophic Interactions
“Computing With a Full Memory: Catalytic Space”, Buhrman et al 2014
“The Impact of Short-Lived Controls on the Interpretation of Lifespan Experiments and Progress in Geroscience—Through the Lens of the ‘900-Day Rule’”, Pabis et al 2024
“What’s Wrong With This Lagrangean?”, Mermin 1988
“Fixing the Internet for Real Time Applications: Part I”
“Fixing the Internet for Real Time Applications: Part II”
“Won’t vs. Can’t: Sandbagging-Like Behavior from Claude Models”
Won’t vs. Can’t: Sandbagging-like Behavior from Claude Models
“Stable Video Diffusion: Scaling Latent Video Diffusion Models to Large Datasets”, Blattmann et al 2023
Stable Video Diffusion: Scaling Latent Video Diffusion Models to Large Datasets
“Lossy Catalytic Computation”, Gupta et al 2024
“Fully Characterizing Lossy Catalytic Computation”, Folkertsma et al 2024
“Are DeepSeek R1 And Other Reasoning Models More Faithful?”, Chua & Evans 2025
“Native Sparse Attention (NSA): Hardware-Aligned and Natively Trainable Sparse Attention”, Yuan et al 2025
Native Sparse Attention (NSA): Hardware-Aligned and Natively Trainable Sparse Attention
“Meta-Statistical Learning: Supervised Learning of Statistical Inference”, Peyrard & Cho 2025
Meta-Statistical Learning: Supervised Learning of Statistical Inference
“unix-In-Lisp
: Mount Unix System into Common Lisp Image”
“Hunt Allcott”
“Fixing the Internet for Real-Time Applications: Part III”
“What Makes Mathematicians Believe Unproven Mathematical Statements?”, Gowers 2023
What Makes Mathematicians Believe Unproven Mathematical Statements?
“Breaking the Barrier? Children Fail to Translate Their Preverbal Memories into Language”, Simcock & Hayne 2002
Breaking the Barrier? Children Fail to Translate Their Preverbal Memories into Language
“Portable Magnetometry for Detection of Biomagnetism in Ambient Environments”, Limes et al 2020
Portable magnetometry for detection of biomagnetism in ambient environments
“On Layer Normalization in the Transformer Architecture”, Xiong et al 2020
“Password Similarity Using Probabilistic Data Structures”, Berardi et al 2020
“RADD: Your Absorbing Discrete Diffusion Secretly Models the Conditional Distributions of Clean Data”, Ou et al 2024
RADD: Your Absorbing Discrete Diffusion Secretly Models the Conditional Distributions of Clean Data
“How Far Is Video Generation from World Model: A Physical Law Perspective”, Kang et al 2024
How Far is Video Generation from World Model: A Physical Law Perspective
“Do Generative Video Models Learn Physical Principles from Watching Videos?”, Motamed et al 2025
Do generative video models learn physical principles from watching videos?
“SycEval: Evaluating LLM Sycophancy”, Fanous et al 2025
“ZeroBench: An Impossible Visual Benchmark for Contemporary Large Multimodal Models”, Roberts et al 2025
ZeroBench: An Impossible Visual Benchmark for Contemporary Large Multimodal Models
“LLaDA: Large Language Diffusion Models”, Nie et al 2025
“Step-Video-T2V Technical Report: The Practice, Challenges, and Future of Video Foundation Model”, Ma et al 2025
Step-Video-T2V Technical Report: The Practice, Challenges, and Future of Video Foundation Model
“Building a Princess Saving App: How to Build Learning and Fun into Your Applications”, Cook 2008
Building a Princess Saving App: How to build learning and fun into your applications
“Once-Weekly Semaglutide in Adults With Alcohol Use Disorder: A Randomized Clinical Trial”, Hendershot et al 2025
Once-Weekly Semaglutide in Adults With Alcohol Use Disorder: A Randomized Clinical Trial
“Parenthood, Mental Disorders, and Symptoms Through Adulthood: A Total Population Study”, Andersen et al 2024
Parenthood, Mental Disorders, and Symptoms Through Adulthood: A Total Population Study
“The Case against Ethics Review in the Social Sciences”, Schrag 2011
“Geons”, Wheeler 1955
“The Retention of Manual Flying Skills in the Automated Cockpit”, Casner et al 2014
The Retention of Manual Flying Skills in the Automated Cockpit
“Lost in Time: Clock and Calendar Understanding Challenges in Multimodal LLMs”, Saxena et al 2025
Lost in Time: Clock and Calendar Understanding Challenges in Multimodal LLMs
Wikipedia
Miscellaneous
Bibliography
-
https://arxiv.org/abs/1810.03779#google
: “Reinforcement Learning for Improving Agent Design”, -
https://arxiv.org/abs/2403.18120#google
: “Don’t Trust: Verify—Grounding LLM Quantitative Reasoning With Autoformalization”, -
https://toloka.ai/blog/r1-is-not-on-par-with-o1-and-the-difference-is-qualitative-not-quantitative/
: “DS R1 Is Not on Par With OA O1, and the Difference Is Qualitative, Not Quantitative: Long-Tail Benchmarks Reveal Gaps”, -
https://arxiv.org/abs/2501.08156
: “Are DeepSeek R1 And Other Reasoning Models More Faithful?”, -
https://arxiv.org/abs/2501.09038#deepmind
: “Do Generative Video Models Learn Physical Principles from Watching Videos?”, -
https://arxiv.org/abs/2502.09992
: “LLaDA: Large Language Diffusion Models”, -
https://arxiv.org/abs/2502.10248#stepfun
: “Step-Video-T2V Technical Report: The Practice, Challenges, and Future of Video Foundation Model”, -
2008-cook.pdf
: “Building a Princess Saving App: How to Build Learning and Fun into Your Applications”,