- See Also
-
Gwern
- “Miscellaneous”, Gwern 2009
- “Novelty Nets: Classifier Anti-Guidance”, Gwern 2024
- “InvertOrNot.com Proposal”, Gwern 2021
- “The Second Apocalypse: Freedom In An Unfree Universe”, Gwern 2017
- “Review Of The Quantum Thief Trilogy”, Gwern 2022
- “Interpreting ‘Suzanne Delage’ As Dracula”, Gwern 2009
- “Research Ideas”, Gwern 2017
- “What Is The Morning Writing Effect?”, Gwern 2011
- “Absolute Unit NNs: Regression-Based MLPs for Everything”, Gwern 2023
- “Danbooru2021: A Large-Scale Crowdsourced & Tagged Anime Illustration Dataset”, Gwern 2015
-
Links
- “VASA-1: Lifelike Audio-Driven Talking Faces Generated in Real Time”, Xu et al 2024
- “InterNyet: Why the Soviet Union Did Not Build a Nationwide Computer Network”, Gerovitch 2008
- “Comparative Study of Model-Based and Model-Free Reinforcement Learning Control Performance in HVAC Systems”, Gao & Wang 2023
- “Learning Transformer Programs”, Friedman et al 2023
- “Demystifying RCE Vulnerabilities in LLM-Integrated Apps”, Liu et al 2023
- “How to Train Data-Efficient LLMs”, Sachdeva et al 2024
- “CATS: Contextually-Aware Thresholding for Sparsity in Large Language Models”, Lee et al 2024
- “Foundational Challenges in Assuring Alignment and Safety of Large Language Models”, Anwar et al 2024
- “Chronic Consumption of Fructose Rich Soft Drinks Alters Tissue Lipids of Rats”, Botezelli et al 2010
- “Community Surveillance of Respiratory Viruses Among Families in the Utah Better Identification of Germs-Longitudinal Viral Epidemiology (BIG-LoVE) Study”, Byington et al 2015
- “From Here to There; Or, If Cooperative Ownership Is So Desirable, Why Are There So Few Cooperatives?”, Elster 1989
- “Why Are Worker Cooperatives So Rare?”, Kremer 1997
- “Equality Under Threat by the Talented: Evidence from Worker-Managed Firms”, Burdín 2016
- “Are Children Spending Too Much Time on Enrichment Activities?”, Caetano et al 2024
- “IJON: Exploring Deep State Spaces via Fuzzing”, Aschermann et al 2020
- “Artificial Intelligence for Retrosynthetic Planning Needs Both Data and Expert Knowledge”, Strieth-Kalthoff et al 2024
- “A Test of Exogeneity Without Instrumental Variables in Models With Bunching”, Caetano 2015
- “Correcting for Endogeneity in Models With Bunching”, Carolina et al 2023
- “MiniCPM: Unveiling the Potential of Small Language Models With Scalable Training Strategies”, Hu et al 2024
- “Emergence of Large-Scale Mechanical Spiral Waves in Bacterial Living Matter”, Liu et al 2024
- “Chinchilla Scaling: A Replication Attempt”, Besiroglu et al 2024
- “Long-Form Music Generation With Latent Diffusion”, Evans et al 2024
- “Jake Sullivan: Minneapolis Native among Those to Hatch Iranian Nuclear Deal”, Henry 2013
- “Brain Changes Associated With Long-Term Ketamine Abuse, A Systematic Review”, Strous et al 2022
- “Target Specificity of the Felixer Grooming "Trap"”, Read et al 2019
- “Effectiveness of the Felixer Grooming Trap for the Control of Feral Cats: a Field Trial in Arid South Australia”, Moseby et al 2020
- “A Dead Reckoning/map Correlation System for Automatic Vehicle Tracking”, Lezniak et al 1977
- “Taxonomy and Nomenclature for the Stone Domain in New England”, Thorson 2023
- “Mixture-Of-Depths: Dynamically Allocating Compute in Transformer-Based Language Models”, Raposo et al 2024
- “RecurrentGemma: Moving Past Transformers for Efficient Open Language Models”, Botev et al 2024
- “Megalodon: Efficient LLM Pretraining and Inference With Unlimited Context Length”, Ma et al 2024
- “The Illusion of State in State-Space Models”, Merrill et al 2024
- “Construction and Characterization of an Effector Strain OfStreptococcus Mutans for Replacement Therapy of Dental Caries”, Hillman et al 2000
- “Salivary PH and Glucose After Consuming Various Beverages, Including Sugar-Containing Drinks”, Meurman et al 1987
- “The Long-Run Impacts of Adolescent Drinking: Evidence from Zero Tolerance Laws”, Abboud et al 2024
- “Sciama’s Argument on Life in a Random Universe: Distinguishing Apples from Oranges”, Wang & Braunstein 2023
- “MetaAligner: Conditional Weak-To-Strong Correction for Generalizable Multi-Objective Alignment of Language Models”, Yang et al 2024
- “JetMoE: Reaching Llama2 Performance With 0.1M Dollars”, Shen et al 2024
- “Why Do Small Language Models Underperform? Studying Language Model Saturation via the Softmax Bottleneck”, Godey et al 2024
- “Scaling (Down) CLIP: A Comprehensive Analysis of Data, Architecture, and Training Strategies”, Li et al 2024
- “Dataset Reset Policy Optimization for RLHF”, Chang et al 2024
- “Is ChatGPT Transforming Academics’ Writing Style?”, Geng & Trotta 2024
- “Pre-Training Small Base LMs With Fewer Tokens”, Sanyal et al 2024
- “What If There Were No Personality Factors? Comparing the Predictability of Behavioral Act Frequencies from a Big-Five and a Maximal-Dimensional Item Set”, Altgassen et al 2023
- “Genetic Gains Underpinning a Little-Known Strawberry Green Revolution”, Feldmann et al 2024
- “Essay Content and Style Are Strongly Related to Household Income and SAT Scores: Evidence from 60,000 Undergraduate Applications”, Alvero et al 2021
- “CartoonizeDiff: Diffusion-Based Photo Cartoonization Scheme”, Jeon et al 2024
- “Union Reform, Performance Pay, and New Teacher Supply: Evidence from Wisconsin’s Act 10”, Baron 2021
- “Suicidal Thoughts and Behaviors Among Children and Adolescents With Autism Spectrum Disorder”, Schindel et al 2024
- “Comparing Predictive Validity in a Community Sample: High-Dimensionality and Traditional Domain-And-Facet Structures of Personality Variation”, Saucier et al 2020
- Wikipedia
- Miscellaneous
- Link Bibliography
See Also
Gwern
“Miscellaneous”, Gwern 2009
“Novelty Nets: Classifier Anti-Guidance”, Gwern 2024
“InvertOrNot.com Proposal”, Gwern 2021
“The Second Apocalypse: Freedom In An Unfree Universe”, Gwern 2017
“Review Of The Quantum Thief Trilogy”, Gwern 2022
“Interpreting ‘Suzanne Delage’ As Dracula”, Gwern 2009
“Research Ideas”, Gwern 2017
“What Is The Morning Writing Effect?”, Gwern 2011
“Absolute Unit NNs: Regression-Based MLPs for Everything”, Gwern 2023
“Danbooru2021: A Large-Scale Crowdsourced & Tagged Anime Illustration Dataset”, Gwern 2015
Danbooru2021: A Large-Scale Crowdsourced & Tagged Anime Illustration Dataset
Links
“VASA-1: Lifelike Audio-Driven Talking Faces Generated in Real Time”, Xu et al 2024
VASA-1: Lifelike Audio-Driven Talking Faces Generated in Real Time
“InterNyet: Why the Soviet Union Did Not Build a Nationwide Computer Network”, Gerovitch 2008
InterNyet: why the Soviet Union did not build a nationwide computer network
“Comparative Study of Model-Based and Model-Free Reinforcement Learning Control Performance in HVAC Systems”, Gao & Wang 2023
“Learning Transformer Programs”, Friedman et al 2023
“Demystifying RCE Vulnerabilities in LLM-Integrated Apps”, Liu et al 2023
“How to Train Data-Efficient LLMs”, Sachdeva et al 2024
“CATS: Contextually-Aware Thresholding for Sparsity in Large Language Models”, Lee et al 2024
CATS: Contextually-Aware Thresholding for Sparsity in Large Language Models
“Foundational Challenges in Assuring Alignment and Safety of Large Language Models”, Anwar et al 2024
Foundational Challenges in Assuring Alignment and Safety of Large Language Models
“Chronic Consumption of Fructose Rich Soft Drinks Alters Tissue Lipids of Rats”, Botezelli et al 2010
Chronic consumption of fructose rich soft drinks alters tissue lipids of rats
“Community Surveillance of Respiratory Viruses Among Families in the Utah Better Identification of Germs-Longitudinal Viral Epidemiology (BIG-LoVE) Study”, Byington et al 2015
“From Here to There; Or, If Cooperative Ownership Is So Desirable, Why Are There So Few Cooperatives?”, Elster 1989
From Here to There; or, If Cooperative Ownership Is So Desirable, Why are There So Few Cooperatives?
“Why Are Worker Cooperatives So Rare?”, Kremer 1997
“Equality Under Threat by the Talented: Evidence from Worker-Managed Firms”, Burdín 2016
Equality Under Threat by the Talented: Evidence from Worker-Managed Firms
“Are Children Spending Too Much Time on Enrichment Activities?”, Caetano et al 2024
Are children spending too much time on enrichment activities?
“IJON: Exploring Deep State Spaces via Fuzzing”, Aschermann et al 2020
“Artificial Intelligence for Retrosynthetic Planning Needs Both Data and Expert Knowledge”, Strieth-Kalthoff et al 2024
Artificial Intelligence for Retrosynthetic Planning Needs Both Data and Expert Knowledge
“A Test of Exogeneity Without Instrumental Variables in Models With Bunching”, Caetano 2015
A Test of Exogeneity Without Instrumental Variables in Models With Bunching
“Correcting for Endogeneity in Models With Bunching”, Carolina et al 2023
“MiniCPM: Unveiling the Potential of Small Language Models With Scalable Training Strategies”, Hu et al 2024
MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies
“Emergence of Large-Scale Mechanical Spiral Waves in Bacterial Living Matter”, Liu et al 2024
Emergence of large-scale mechanical spiral waves in bacterial living matter
“Chinchilla Scaling: A Replication Attempt”, Besiroglu et al 2024
“Long-Form Music Generation With Latent Diffusion”, Evans et al 2024
“Jake Sullivan: Minneapolis Native among Those to Hatch Iranian Nuclear Deal”, Henry 2013
Jake Sullivan: Minneapolis native among those to hatch Iranian nuclear deal
“Brain Changes Associated With Long-Term Ketamine Abuse, A Systematic Review”, Strous et al 2022
Brain Changes Associated With Long-Term Ketamine Abuse, A Systematic Review
“Target Specificity of the Felixer Grooming "Trap"”, Read et al 2019
“Effectiveness of the Felixer Grooming Trap for the Control of Feral Cats: a Field Trial in Arid South Australia”, Moseby et al 2020
“A Dead Reckoning/map Correlation System for Automatic Vehicle Tracking”, Lezniak et al 1977
A dead reckoning/map correlation system for automatic vehicle tracking
“Taxonomy and Nomenclature for the Stone Domain in New England”, Thorson 2023
Taxonomy and Nomenclature for the Stone Domain in New England
“Mixture-Of-Depths: Dynamically Allocating Compute in Transformer-Based Language Models”, Raposo et al 2024
Mixture-of-Depths: Dynamically allocating compute in transformer-based language models
“RecurrentGemma: Moving Past Transformers for Efficient Open Language Models”, Botev et al 2024
RecurrentGemma: Moving Past Transformers for Efficient Open Language Models
“Megalodon: Efficient LLM Pretraining and Inference With Unlimited Context Length”, Ma et al 2024
Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context Length
“The Illusion of State in State-Space Models”, Merrill et al 2024
“Construction and Characterization of an Effector Strain OfStreptococcus Mutans for Replacement Therapy of Dental Caries”, Hillman et al 2000
“Salivary PH and Glucose After Consuming Various Beverages, Including Sugar-Containing Drinks”, Meurman et al 1987
Salivary pH and Glucose after Consuming Various Beverages, Including Sugar-Containing Drinks
“The Long-Run Impacts of Adolescent Drinking: Evidence from Zero Tolerance Laws”, Abboud et al 2024
The long-run impacts of adolescent drinking: Evidence from Zero Tolerance Laws
“Sciama’s Argument on Life in a Random Universe: Distinguishing Apples from Oranges”, Wang & Braunstein 2023
Sciama’s argument on life in a random universe: Distinguishing apples from oranges
“MetaAligner: Conditional Weak-To-Strong Correction for Generalizable Multi-Objective Alignment of Language Models”, Yang et al 2024
“JetMoE: Reaching Llama2 Performance With 0.1M Dollars”, Shen et al 2024
“Why Do Small Language Models Underperform? Studying Language Model Saturation via the Softmax Bottleneck”, Godey et al 2024
“Scaling (Down) CLIP: A Comprehensive Analysis of Data, Architecture, and Training Strategies”, Li et al 2024
Scaling (Down) CLIP: A Comprehensive Analysis of Data, Architecture, and Training Strategies
“Dataset Reset Policy Optimization for RLHF”, Chang et al 2024
“Is ChatGPT Transforming Academics’ Writing Style?”, Geng & Trotta 2024
“Pre-Training Small Base LMs With Fewer Tokens”, Sanyal et al 2024
“What If There Were No Personality Factors? Comparing the Predictability of Behavioral Act Frequencies from a Big-Five and a Maximal-Dimensional Item Set”, Altgassen et al 2023
“Genetic Gains Underpinning a Little-Known Strawberry Green Revolution”, Feldmann et al 2024
Genetic gains underpinning a little-known strawberry Green Revolution
“Essay Content and Style Are Strongly Related to Household Income and SAT Scores: Evidence from 60,000 Undergraduate Applications”, Alvero et al 2021
“CartoonizeDiff: Diffusion-Based Photo Cartoonization Scheme”, Jeon et al 2024
“Union Reform, Performance Pay, and New Teacher Supply: Evidence from Wisconsin’s Act 10”, Baron 2021
Union Reform, Performance Pay, and New Teacher Supply: Evidence from Wisconsin’s Act 10
“Suicidal Thoughts and Behaviors Among Children and Adolescents With Autism Spectrum Disorder”, Schindel et al 2024
Suicidal Thoughts and Behaviors Among Children and Adolescents With Autism Spectrum Disorder
“Comparing Predictive Validity in a Community Sample: High-Dimensionality and Traditional Domain-And-Facet Structures of Personality Variation”, Saucier et al 2020
Wikipedia
Miscellaneous
-
/doc/biology/2015-byington-figure1-percentageoftimewithatleastoneinfectedpersoninhousehold.jpg
: -
https://twitter.com/alexalbert__/status/1780707227130863674
: -
https://twitter.com/davis_yoshida/status/1780733741457088759
-
https://arstechnica.com/information-technology/2017/07/winamp-how-greatest-mp3-player-undid-itself/
-
View External Link:
-
View External Link:
-
https://github.com/azzamsa/awesome-lisp-companies/?tab=readme-ov-file#awesome-lisp-company
-
https://github.com/curiousjp/toy_sd_genetics?tab=readme-ov-file#toy_sd_genetics
-
https://github.com/spawnmason/randar-explanation/blob/master/README
-
https://lilianweng.github.io/posts/2024-04-12-diffusion-video/
:View External Link:
https://lilianweng.github.io/posts/2024-04-12-diffusion-video/
-
https://longreads.com/2024/04/18/crohns-life-without-eating/
-
https://squareallworthy.tumblr.com/post/163790039847/everyone-will-not-just
:View External Link:
https://squareallworthy.tumblr.com/post/163790039847/everyone-will-not-just
-
https://www.architecturaldigest.com/story/step-inside-all-new-art-deco-orient-express-train
-
https://www.wired.com/story/yahoo-boys-real-time-deepfake-scams/
Link Bibliography
-
2008-gerovitch.pdf
: “InterNyet: Why the Soviet Union Did Not Build a Nationwide Computer Network”, Slava Gerovitch -
2023-gao.pdf
: “Comparative Study of Model-Based and Model-Free Reinforcement Learning Control Performance in HVAC Systems”, Cheng Gao, Dan Wang -
1989-elster.pdf
: “From Here to There; Or, If Cooperative Ownership Is So Desirable, Why Are There So Few Cooperatives?”, Jon Elster -
2024-caetano.pdf
: “Are Children Spending Too Much Time on Enrichment Activities?”, Carolina Caetano, Gregorio Caetano, Eric Nielsen -
2024-striethkalthoff.pdf
: “Artificial Intelligence for Retrosynthetic Planning Needs Both Data and Expert Knowledge”, Felix Strieth-Kalthoff, Sara Szymkuć, Karol Molga, Alán Aspuru-Guzik, Frank Glorius, Bartosz A. Grzybowski -
https://arxiv.org/abs/2404.10102
: “Chinchilla Scaling: A Replication Attempt”, Tamay Besiroglu, Ege Erdil, Matthew Barnett, Josh You -
https://arxiv.org/abs/2404.08801#facebook
: “Megalodon: Efficient LLM Pretraining and Inference With Unlimited Context Length”, -
https://arxiv.org/abs/2404.08495
: “Dataset Reset Policy Optimization for RLHF”, Jonathan D. Chang, Wenhao Shan, Owen Oertell, Kianté Brantley, Dipendra Misra, Jason D. Lee, Wen Sun -
https://arxiv.org/abs/2404.08634
: “Pre-Training Small Base LMs With Fewer Tokens”, Sunny Sanyal, Sujay Sanghavi, Alexandros G. Dimakis