newest links tag

See Also
Gwern
Links
Miscellaneous
Link Bibliography

[Warning: JavaScript Disabled!]

[For support of key website features (link annotation popups/popovers & transclusions, collapsible sections, backlinks, tablesorting, image zooming, sidenotes etc), you must enable JavaScript.]

See Also

Parent

Gwern

“Miscellaneous”, Gwern 2009

Miscellaneous

“Novelty Nets: Classifier Anti-Guidance”, Gwern 2024

Novelty Nets: Classifier Anti-Guidance

“InvertOrNot.com Proposal”, Gwern 2021

InvertOrNot.com Proposal

“The Second Apocalypse: Freedom In An Unfree Universe”, Gwern 2017

The Second Apocalypse: Freedom In An Unfree Universe

“Review Of The Quantum Thief Trilogy”, Gwern 2022

Review Of The Quantum Thief Trilogy

“Interpreting ‘Suzanne Delage’ As Dracula”, Gwern 2009

Interpreting ‘Suzanne Delage’ as Dracula

“Research Ideas”, Gwern 2017

Research Ideas

“What Is The Morning Writing Effect?”, Gwern 2011

What Is The Morning Writing Effect?

“Absolute Unit NNs: Regression-Based MLPs for Everything”, Gwern 2023

Absolute Unit NNs: Regression-Based MLPs for Everything

“Danbooru2021: A Large-Scale Crowdsourced & Tagged Anime Illustration Dataset”, Gwern 2015

Danbooru2021: A Large-Scale Crowdsourced & Tagged Anime Illustration Dataset

Links

“VASA-1: Lifelike Audio-Driven Talking Faces Generated in Real Time”, Xu et al 2024

VASA-1: Lifelike Audio-Driven Talking Faces Generated in Real Time

“InterNyet: Why the Soviet Union Did Not Build a Nationwide Computer Network”, Gerovitch 2008

InterNyet: why the Soviet Union did not build a nationwide computer network

“Comparative Study of Model-Based and Model-Free Reinforcement Learning Control Performance in HVAC Systems”, Gao & Wang 2023

Comparative study of model-based and model-free reinforcement learning control performance in HVAC systems

“Learning Transformer Programs”, Friedman et al 2023

Learning Transformer Programs

“Demystifying RCE Vulnerabilities in LLM-Integrated Apps”, Liu et al 2023

Demystifying RCE Vulnerabilities in LLM-Integrated Apps

“How to Train Data-Efficient LLMs”, Sachdeva et al 2024

How to Train Data-Efficient LLMs

“CATS: Contextually-Aware Thresholding for Sparsity in Large Language Models”, Lee et al 2024

CATS: Contextually-Aware Thresholding for Sparsity in Large Language Models

“Foundational Challenges in Assuring Alignment and Safety of Large Language Models”, Anwar et al 2024

Foundational Challenges in Assuring Alignment and Safety of Large Language Models

“Chronic Consumption of Fructose Rich Soft Drinks Alters Tissue Lipids of Rats”, Botezelli et al 2010

Chronic consumption of fructose rich soft drinks alters tissue lipids of rats

“Community Surveillance of Respiratory Viruses Among Families in the Utah Better Identification of Germs-Longitudinal Viral Epidemiology (BIG-LoVE) Study”, Byington et al 2015

Community Surveillance of Respiratory Viruses Among Families in the Utah Better Identification of Germs-Longitudinal Viral Epidemiology (BIG-LoVE) Study

“From Here to There; Or, If Cooperative Ownership Is So Desirable, Why Are There So Few Cooperatives?”, Elster 1989

From Here to There; or, If Cooperative Ownership Is So Desirable, Why are There So Few Cooperatives?

“Why Are Worker Cooperatives So Rare?”, Kremer 1997

Why are Worker Cooperatives So Rare?

“Equality Under Threat by the Talented: Evidence from Worker-Managed Firms”, Burdín 2016

Equality Under Threat by the Talented: Evidence from Worker-Managed Firms

“Are Children Spending Too Much Time on Enrichment Activities?”, Caetano et al 2024

Are children spending too much time on enrichment activities?

“IJON: Exploring Deep State Spaces via Fuzzing”, Aschermann et al 2020

IJON: Exploring Deep State Spaces via Fuzzing

“Artificial Intelligence for Retrosynthetic Planning Needs Both Data and Expert Knowledge”, Strieth-Kalthoff et al 2024

Artificial Intelligence for Retrosynthetic Planning Needs Both Data and Expert Knowledge

“A Test of Exogeneity Without Instrumental Variables in Models With Bunching”, Caetano 2015

A Test of Exogeneity Without Instrumental Variables in Models With Bunching

“Correcting for Endogeneity in Models With Bunching”, Carolina et al 2023

Correcting for Endogeneity in Models with Bunching

“MiniCPM: Unveiling the Potential of Small Language Models With Scalable Training Strategies”, Hu et al 2024

MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies

“Emergence of Large-Scale Mechanical Spiral Waves in Bacterial Living Matter”, Liu et al 2024

Emergence of large-scale mechanical spiral waves in bacterial living matter

“Chinchilla Scaling: A Replication Attempt”, Besiroglu et al 2024

Chinchilla Scaling: A replication attempt

“Long-Form Music Generation With Latent Diffusion”, Evans et al 2024

Long-form music generation with latent diffusion

“Jake Sullivan: Minneapolis Native among Those to Hatch Iranian Nuclear Deal”, Henry 2013

Jake Sullivan: Minneapolis native among those to hatch Iranian nuclear deal

“Brain Changes Associated With Long-Term Ketamine Abuse, A Systematic Review”, Strous et al 2022

Brain Changes Associated With Long-Term Ketamine Abuse, A Systematic Review

“Target Specificity of the Felixer Grooming "Trap"”, Read et al 2019

Target specificity of the Felixer grooming "trap"

“Effectiveness of the Felixer Grooming Trap for the Control of Feral Cats: a Field Trial in Arid South Australia”, Moseby et al 2020

Effectiveness of the Felixer grooming trap for the control of feral cats: a field trial in arid South Australia

“A Dead Reckoning/map Correlation System for Automatic Vehicle Tracking”, Lezniak et al 1977

A dead reckoning/map correlation system for automatic vehicle tracking

“Taxonomy and Nomenclature for the Stone Domain in New England”, Thorson 2023

Taxonomy and Nomenclature for the Stone Domain in New England

“Mixture-Of-Depths: Dynamically Allocating Compute in Transformer-Based Language Models”, Raposo et al 2024

Mixture-of-Depths: Dynamically allocating compute in transformer-based language models

“RecurrentGemma: Moving Past Transformers for Efficient Open Language Models”, Botev et al 2024

RecurrentGemma: Moving Past Transformers for Efficient Open Language Models

“Megalodon: Efficient LLM Pretraining and Inference With Unlimited Context Length”, Ma et al 2024

Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context Length

“The Illusion of State in State-Space Models”, Merrill et al 2024

The Illusion of State in State-Space Models

“Construction and Characterization of an Effector Strain OfStreptococcus Mutans for Replacement Therapy of Dental Caries”, Hillman et al 2000

Construction and characterization of an effector strain ofStreptococcus mutans for replacement therapy of dental caries

“Salivary PH and Glucose After Consuming Various Beverages, Including Sugar-Containing Drinks”, Meurman et al 1987

Salivary pH and Glucose after Consuming Various Beverages, Including Sugar-Containing Drinks

“The Long-Run Impacts of Adolescent Drinking: Evidence from Zero Tolerance Laws”, Abboud et al 2024

The long-run impacts of adolescent drinking: Evidence from Zero Tolerance Laws

“Sciama’s Argument on Life in a Random Universe: Distinguishing Apples from Oranges”, Wang & Braunstein 2023

Sciama’s argument on life in a random universe: Distinguishing apples from oranges

“MetaAligner: Conditional Weak-To-Strong Correction for Generalizable Multi-Objective Alignment of Language Models”, Yang et al 2024

MetaAligner: Conditional Weak-to-Strong Correction for Generalizable Multi-Objective Alignment of Language Models

“JetMoE: Reaching Llama2 Performance With 0.1M Dollars”, Shen et al 2024

JetMoE: Reaching Llama2 Performance with 0.1M Dollars

“Why Do Small Language Models Underperform? Studying Language Model Saturation via the Softmax Bottleneck”, Godey et al 2024

Why do small language models underperform? Studying Language Model Saturation via the Softmax Bottleneck

“Scaling (Down) CLIP: A Comprehensive Analysis of Data, Architecture, and Training Strategies”, Li et al 2024

Scaling (Down) CLIP: A Comprehensive Analysis of Data, Architecture, and Training Strategies

“Dataset Reset Policy Optimization for RLHF”, Chang et al 2024

Dataset Reset Policy Optimization for RLHF

“Is ChatGPT Transforming Academics’ Writing Style?”, Geng & Trotta 2024

Is ChatGPT Transforming Academics’ Writing Style?

“Pre-Training Small Base LMs With Fewer Tokens”, Sanyal et al 2024

Pre-training Small Base LMs with Fewer Tokens

“What If There Were No Personality Factors? Comparing the Predictability of Behavioral Act Frequencies from a Big-Five and a Maximal-Dimensional Item Set”, Altgassen et al 2023

What if there were no personality factors? Comparing the predictability of behavioral act frequencies from a big-five and a maximal-dimensional item set

“Genetic Gains Underpinning a Little-Known Strawberry Green Revolution”, Feldmann et al 2024

Genetic gains underpinning a little-known strawberry Green Revolution

“Essay Content and Style Are Strongly Related to Household Income and SAT Scores: Evidence from 60,000 Undergraduate Applications”, Alvero et al 2021

Essay content and style are strongly related to household income and SAT scores: Evidence from 60,000 undergraduate applications

“CartoonizeDiff: Diffusion-Based Photo Cartoonization Scheme”, Jeon et al 2024

CartoonizeDiff: Diffusion-Based Photo Cartoonization Scheme

“Union Reform, Performance Pay, and New Teacher Supply: Evidence from Wisconsin’s Act 10”, Baron 2021

Union Reform, Performance Pay, and New Teacher Supply: Evidence from Wisconsin’s Act 10

“Suicidal Thoughts and Behaviors Among Children and Adolescents With Autism Spectrum Disorder”, Schindel et al 2024

Suicidal Thoughts and Behaviors Among Children and Adolescents With Autism Spectrum Disorder

“Comparing Predictive Validity in a Community Sample: High-Dimensionality and Traditional Domain-And-Facet Structures of Personality Variation”, Saucier et al 2020

Comparing predictive validity in a community sample: High-dimensionality and traditional domain-and-facet structures of personality variation

Wikipedia

Miscellaneous

Link Bibliography

2008-gerovitch.pdf: “InterNyet: Why the Soviet Union Did Not Build a Nationwide Computer Network”, Slava Gerovitch

link-bibliography
2023-gao.pdf: “Comparative Study of Model-Based and Model-Free Reinforcement Learning Control Performance in HVAC Systems”, Cheng Gao, Dan Wang

link-bibliography
1989-elster.pdf: “From Here to There; Or, If Cooperative Ownership Is So Desirable, Why Are There So Few Cooperatives?”, Jon Elster

link-bibliography
2024-caetano.pdf: “Are Children Spending Too Much Time on Enrichment Activities?”, Carolina Caetano, Gregorio Caetano, Eric Nielsen

link-bibliography
2024-striethkalthoff.pdf: “Artificial Intelligence for Retrosynthetic Planning Needs Both Data and Expert Knowledge”, Felix Strieth-Kalthoff, Sara Szymkuć, Karol Molga, Alán Aspuru-Guzik, Frank Glorius, Bartosz A. Grzybowski

link-bibliography
https://arxiv.org/abs/2404.10102: “Chinchilla Scaling: A Replication Attempt”, Tamay Besiroglu, Ege Erdil, Matthew Barnett, Josh You

link-bibliography
https://arxiv.org/abs/2404.08801#facebook: “Megalodon: Efficient LLM Pretraining and Inference With Unlimited Context Length”, Xuezhe Ma, Xiaomeng Yang, Wenhan Xiong, Beidi Chen, Lili Yu, Hao Zhang, Jonathan May, Luke Zettlemoyer

link-bibliography
https://arxiv.org/abs/2404.08495: “Dataset Reset Policy Optimization for RLHF”, Jonathan D. Chang, Wenhao Shan, Owen Oertell, Kianté Brantley, Dipendra Misra, Jason D. Lee, Wen Sun

link-bibliography
https://arxiv.org/abs/2404.08634: “Pre-Training Small Base LMs With Fewer Tokens”, Sunny Sanyal, Sujay Sanghavi, Alexandros G. Dimakis

link-bibliography
/

link-bibliography

[Quote Of The Day]

[Site Of The Day]

[Annotation Of The Day]

[adblock public service announcement]