Mixture of Parrots: Experts improve memorization more than reasoning
Industrious Dice [Minimizing Pip Counts on Still-Functional Dice]
An Intuitive Explanation of Black-Scholes: I Explain the Black-Scholes Formula Using Only Basic Probability Theory and Calculus, With a Focus on the Big Picture and Intuition over Technical Details
To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning
I Have Played a Little Bit With OpenAI’s New Iteration, GPT-4 O1
‘He Was in Mystic Delirium’: Was This Hermit Mathematician Alexander Grothendieck a Forgotten Genius Whose Ideas Could Transform AI—Or a Lonely Madman?
Statistical Patterns in the Equations of Physics and the Emergence of a Meta-Law of Nature
Physics of Language Models: Part 2.1, Grade-School Math and the Hidden Reasoning Process
MCTSr: Accessing GPT-4 level Mathematical Olympiad Solutions via Monte Carlo Tree Self-refine with LLaMA-3-8B
AI Will Become Mathematicians’ ‘Co-Pilot’: Fields Medalist Terence Tao explains how proof checkers and AI programs are dramatically changing mathematics
OmegaPRM: Improve Mathematical Reasoning in Language Models by Automated Process Supervision
MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark
The Lessons of Hermann Grassmann and the Nature of Abstractions
8b226cb9ce8ab9549c0a7498d447eab00d2c7f9f.html#the-lessons-of-hermann-grassmann-and-the-nature-of-abstractions
DeepSeek-Prover: Advancing Theorem Proving in LLMs through Large-Scale Synthetic Data
GSM1k: A Careful Examination of Large Language Model Performance on Grade School Arithmetic
Wu’s Method can Boost Symbolic AI to Rival Silver Medalists and AlphaGeometry to Outperform Gold Medalists at IMO Geometry
Functional Benchmarks for Robust Evaluation of Reasoning Performance, and the Reasoning Gap
Tokenization counts: the impact of tokenization on arithmetic in frontier LLMs
Autonomous Data Selection with Language Models for Mathematical Texts
Hamiltonicity of expanders: optimal bounds and applications
Leveraging Large Language Models to Boost Dafny’s Developers Productivity
Generative AI for Math: Part I—MathPile: A Billion-Token-Scale Pretraining Corpus for Math
PRER: Modeling Complex Mathematical Reasoning via Large Language Model based MathAgent
TinyGSM: achieving >80% on GSM8k with small language models
Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models (ReSTEM)
Frugal LMs Trained to Invoke Symbolic Solvers Achieve Parameter-Efficient Arithmetic Reasoning
Universal Self-Consistency for Large Language Model Generation
Why Won’t OpenAI Say What the Q✱ Algorithm Is? Supposed AI breakthroughs are frequently veiled in secrecy, hindering scientific consensus
Positional Description Matters for Transformers Arithmetic
The Impact of Large Language Models on Scientific Discovery: a Preliminary Study using GPT-4
Implicit Chain-of-Thought Reasoning via Knowledge Distillation
Let Models Speak Ciphers: Multiagent Debate through Embeddings
OpenWebMath: An Open Dataset of High-Quality Mathematical Web Text
Distinct neuronal representation of small and large numbers in the human medial temporal lobe
MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models
FIMO: A Challenge Formal Dataset for Automated Theorem Proving
Solving Challenging Math Word Problems Using GPT-4 Code Interpreter with Code-based Self-Verification
Testing GPT-4 with Wolfram Alpha and Code Interpreter plug-ins on math and science problems
Solid-body trajectoids shaped to roll along desired pathways
LeanDojo: Theorem Proving with Retrieval-Augmented Language Models
FERMAT: An Alternative to Accuracy for Numerical Reasoning
What Number Comes Next? The Encyclopedia of Integer Sequences Knows. The ‘Mathematical Equivalent to the FBI’s Voluminous Fingerprint Files’ Turns 50 This Year, With 362,765 Entries (and Counting)
How does GPT-2 compute greater-than?: Interpreting mathematical abilities in a pre-trained language model
Evaluating Transformer Language Models on Arithmetic Operations Using Number Decomposition
How well do Large Language Models perform in Arithmetic tasks?
ProofNet: Autoformalizing and Formally Proving Undergraduate-Level Mathematics
Solving math word problems with process & outcome-based feedback
What is my math transformer doing? – 3 results on interpretability and generalization
Dynamic Prompt Learning via Policy Gradient for Semi-structured Mathematical Reasoning
Connecting the Scientific and Industrial Revolutions: The Role of Practical Mathematics
NaturalProver: Grounded Mathematical Proof Generation with Language Models
The sexes do not differ in general intelligence, but they do in some specifics
Impact of Pretraining Term Frequencies on Few-Shot Reasoning
Exact Number Concepts Are Limited to the Verbal Count Range
A Neural Network Solves and Generates Mathematics Problems by Program Synthesis: Calculus, Differential Equations, Linear Algebra, and More
What is the point of computers? A question for pure mathematicians
Scaling Language Models: Methods, Analysis & Insights from Training Gopher
MiniF2F: a cross-system benchmark for formal Olympiad-level mathematics
A Diverse Corpus for Evaluating and Developing English Math Word Problem Solvers
SymbolicGPT: A Generative Transformer Model for Symbolic Regression
Behavioral and Neuronal Representation of Numerosity Zero in the Crow
MathBERT: A Pre-Trained Model for Mathematical Formula Understanding
NaturalProofs: Mathematical Theorem Proving in Natural Language
Are NLP Models really able to Solve Simple Math Word Problems?
Measuring Mathematical Problem Solving With the MATH Dataset
TacticZero: Learning to Prove Theorems from Scratch with Deep Reinforcement Learning
Proof Artifact Co-training for Theorem Proving with Language Models
LIME: Learning Inductive Bias for Primitives of Mathematical Reasoning
How the Slowest Computer Programs Illuminate Math’s Fundamental Limits: The goal of the ‘busy beaver’ game is to find the longest-running computer program. Its pursuit has surprising connections to some of the most profound questions and concepts in mathematics
Generative Language Modeling for Automated Theorem Proving
A Promising Path Towards Autoformalization and General Artificial Intelligence
Mathematical Reasoning via Self-supervised Skip-tree Training
Remembering John Conway’s FRACTRAN, a ridiculous, yet surprisingly deep language
Radical Solutions: French mathematician Évariste Galois lived a full life. When he wasn’t trying to overthrow the government, he was reinventing algebra
Learning to Prove Theorems by Learning to Generate Theorems
Generative Language Modeling for Automated Theorem Proving § Experiments
Do NLP Models Know Numbers? Probing Numeracy in Embeddings
Ternary circuits: why R=3 is not the Optimal Radix for Computation
Analysing Mathematical Reasoning Abilities of Neural Models
A randomized controlled trial of interleaved mathematics practice
Reinventing the Wheel: Discovering the Optimal Rolling Shape With PyTorch
Best Practices: Formal Proofs, the Fine Print and Side Effects
Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm
From boiling lead and black art: An essay on the history of mathematical typography
Program Induction by Rationale Generation: Learning to Solve and Explain Algebraic Word Problems
The Doodle Theorem, and Beyond: Colin Wright Juggles Euler, Doodling and Millennium Problems
A Relatively Small Turing Machine Whose Behavior Is Independent of Set Theory
Too good to be true: when overwhelming evidence fails to convince
Probabilistic Integration: A Role in Statistical Computation?
Prizes and Productivity: How Winning the Fields Medal Affects Scientific Output
The Misfortunes of a Trio of Mathematicians Using Computer Algebra Systems—Can We Trust in Them?
The case of the Case of Benny: Elucidating the influence of a landmark study in mathematics education
Finite time blowup for an averaged three-dimensional Navier-Stokes equation
Homotopy groups of suspended classifying spaces: An experimental approach
The Algebraic Combinatorial Approach for Low-Rank Matrix Completion
On the distribution of time-to-proof of mathematical conjectures
Probing the Improbable: Methodological Challenges for Risks with Low Probabilities and High Stakes
Béla Bollobás: Graphs Extremal and Random [Interview of Béla Bollobás by Y. K. Leong]
Comment on a Paper by Yucai Su On the Jacobian Conjecture (2005-12-30)
Online Convex Programming and Generalized Infinitesimal Gradient Ascent
The War of the Frogs and the Mice, or the Crisis of the Mathematische Annalen
On the computational complexity of the Jones and Tutte polynomials
Envisioning Information: chapter 5, ‘Color and Information’, pg83-86 [on Oliver Byrne’s color diagram version of Euclid’s Elements]
Discussion: John von Neumann—A Case Study of Scientific Creativity
Dynamical systems that sort lists, diagonalize matrices and solve linear programming problems
The Emergence of Princeton As a World Center for Mathematical Research, 1896--1939
Review of Yuri I. Manin Yu, A Course in Mathematical Logic 1997
Bi-continuous extensions of invertible combinatorial functions
Heaviside's Operational Calculus and the Attempts to Rigorise It
Benny’s conception of rules and answers in IPI mathematics
Fidelity in Mathematical Discourse: Is One and One Really Two?
Singular Extremals In Lawden’s Problem Of Optimal Rocket Flight
A Steepest-Ascent Method for Solving Optimum Programming Problems
Stable Predictor-Corrector Methods for Ordinary Differential Equations
The Printing of Mathematics: Aids for Authors and Editors and Rules for Compositors and Readers at the University Press, Oxford
An Essay On The Psychology Of Invention In The Mathematical Field
A More Symmetrical Fourier Analysis Applied to Transmission Problems
Extreme D&D DIY: Adventures in Hypergeometry, Procedural Generation, and Software Development (part 1)
Math: OpenAI API Can Do Some Math out of the Gate, but Most Math It Seems It Has to Learn. Many Times, the Numbers That It Spits out Are Just Random. However, including Different Priming Prompts Can Result in Decent Results.
Solid Objects: 16th-Century Geometric and Perspective Drawings
Optimized, Individualized Spaced Repetition in Hierarchical Knowledge Structures
Best-Of-n With Misaligned Reward Models for Math Reasoning
A Mentor Challenged Bright Math Students And Changed Their Lives
The Examples Are Indeed Extremely Simple on Purpose (otherwise It’s Hard to Communicate Efficiently What’s Happening to Non-Metamath Experts). That Being Said, We’re Still Pretty Far Away from IMOs; but This Is Definitely a Goal for Us, and One We’re Actively Working Towards!
2020-10-26-dyqz-rdataisbeautiful-iasked16kpeoplehowmanyholescertainobjectshave.html
1985-tukey-theprincetonmathematicscommunityinthe1930s-pmc41-interview.html
1980-euler-rationalmechanicsflexibleelasticbodies16381788.pdf
1966-mathematicalassociationofamerica-documentary-maavideoclassics2-johnvonneumanadocumentary.mkv
1931-ramsey-foundationsofmathematicsandotherlogicalessays.epub
http://math.andrej.com/2007/09/28/seemingly-impossible-functional-programs/
https://aperiodical.com/2019/09/reimagining-byrnes-euclid/
https://blog.ploeh.dk/2017/10/04/from-design-patterns-to-category-theory/
https://eli.thegreenplace.net/2023/demystifying-tuppers-formula/
https://marckhoury.github.io/blog/counterintuitive-properties-of-high-dimensional-space/
https://math.dartmouth.edu/~matc/MathDrama/reading/Hamming.html
https://mathoverflow.net/questions/19930/writing-papers-in-pre-latex-era
https://nunosempere.com/blog/2023/01/30/an-in-progress-experiment-to-test-how-laplace-s-rule-of/
https://paperswithcode.com/sota/math-word-problem-solving-on-math
https://pershmail.substack.com/p/questions-and-answers-about-multiplication
https://pro.univ-lille.fr/fileadmin/user_upload/pages_pros/lorenzo_ramero/CoursAG.pdf
https://terrytao.wordpress.com/2023/06/19/ai-anthology/#comment-678803
32ca743b892d3fff281c955b26ff70ff86cecd85.html#comment-678803
https://terrytao.wordpress.com/about/ai-generated-versions-of-the-ai-anthology-article/
https://terrytao.wordpress.com/career-advice/theres-more-to-mathematics-than-rigour-and-proofs/
https://thehighergeometer.wordpress.com/2023/08/09/no-order-10-projective-planes-via-sat/
https://thepalindrome.org/p/how-does-the-japanese-multiplication-work
https://worksinprogress.co/issue/how-mathematics-built-the-modern-world/
https://writings.stephenwolfram.com/2023/03/chatgpt-gets-its-wolfram-superpowers/
https://www.amazon.com/Turings-Cathedral-Origins-Digital-Universe/dp/1400075998/
https://www.deepakg.com/bringing-19th-century-ornamental-tile-illustrations-into-a-21st-century-web-app
https://www.kroneckerwallis.com/product/euclids-elements-completing-oliver-byrnes-work/
https://www.lesswrong.com/posts/ZwshvqiqCvXPsZEct/the-learning-theoretic-agenda-status-2023
https://www.lesswrong.com/posts/t9qvdjY5385MbzoYp/chatgpt-4-solved-all-the-gotcha-problems-i-posed-that
https://www.newyorker.com/culture/annals-of-inquiry/the-man-who-invented-fifteen-hundred-necktie-knots
https://www.novaspivack.com/science/we-have-discovered-a-new-pattern-in-the-prime-numbers-parallax-compression
https://www.npr.org/2023/11/08/1209932614/jungle-gym-playground-monkey-bars-maths-hinton-fourth-dimension
https://www.nytimes.com/2022/03/22/science/geometry-math-brain-primmates.html
https://www.nytimes.com/2023/12/29/science/puzzles-mechanical-miller.html
https://www.quantamagazine.org/a-new-agenda-for-low-dimensional-topology-20240222/
https://www.quantamagazine.org/a-team-of-math-proves-a-critical-link-between-addition-and-sets-20231206/
https://www.quantamagazine.org/amateur-mathematicians-find-fifth-busy-beaver-turing-machine-20240702
https://www.quantamagazine.org/amateur-mathematicians-find-fifth-busy-beaver-turing-machine-20240702/
https://www.quantamagazine.org/how-isaac-newton-discovered-the-binomial-power-series-20220831/
https://www.quantamagazine.org/how-mathematical-curves-power-cryptography-20220919/
https://www.quantamagazine.org/in-highly-connected-networks-theres-always-a-loop-20240607/
https://www.quantamagazine.org/in-new-math-proofs-artificial-intelligence-plays-to-win-20220307/
https://www.quantamagazine.org/mathematicians-find-hidden-structure-in-a-common-type-of-space-20230412/
https://www.quantamagazine.org/the-mysterious-math-of-billiards-tables-20240215/
https://www.quantamagazine.org/the-quest-to-decode-the-mandelbrot-set-maths-famed-fractal-20240126/
https://www.science.org/content/article/citation-cartels-help-some-mathematicians-and-their-universities-climb-rankings
https://www.theatlantic.com/magazine/archive/2016/03/the-math-revolution/426855/
https://www.unicode.org/notes/tn28/UTN28-PlainTextMath-v3.2.pdf
https://xenaproject.wordpress.com/2022/09/12/beyond-the-liquid-tensor-experiment/
MCTSr: Accessing GPT-4 level Mathematical Olympiad Solutions via Monte Carlo Tree Self-refine with LLaMA-3-8B
GSM1k: A Careful Examination of Large Language Model Performance on Grade School Arithmetic
https%253A%252F%252Farxiv.org%252Fabs%252F2405.00332%2523scale.html
Functional Benchmarks for Robust Evaluation of Reasoning Performance, and the Reasoning Gap
Tokenization counts: the impact of tokenization on arithmetic in frontier LLMs
Autonomous Data Selection with Language Models for Mathematical Texts
PRER: Modeling Complex Mathematical Reasoning via Large Language Model based MathAgent
Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models (ReSTEM)
https%253A%252F%252Farxiv.org%252Fabs%252F2312.06585%2523deepmind.html
OpenWebMath: An Open Dataset of High-Quality Mathematical Web Text
MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models
https%253A%252F%252Fcr.yp.to%252Fpapers%252Fpwccp-20230907.pdf.html
Solving Challenging Math Word Problems Using GPT-4 Code Interpreter with Code-based Self-Verification
LeanDojo: Theorem Proving with Retrieval-Augmented Language Models
https%253A%252F%252Farxiv.org%252Fabs%252F2305.20050%2523openai.html
How well do Large Language Models perform in Arithmetic tasks?
https%253A%252F%252Farxiv.org%252Fabs%252F2304.02015%2523alibaba.html
ProofNet: Autoformalizing and Formally Proving Undergraduate-Level Mathematics
http%253A%252F%252Fneilsloane.com%252Fdoc%252FHIS50.pdf.html
NaturalProver: Grounded Mathematical Proof Generation with Language Models
https%253A%252F%252Farxiv.org%252Fabs%252F2205.12910%2523allen.html
https%253A%252F%252Farxiv.org%252Fabs%252F2205.11491%2523facebook.html
The sexes do not differ in general intelligence, but they do in some specifics
https%253A%252F%252Farxiv.org%252Fabs%252F2204.02311%2523google.html
https%253A%252F%252Farxiv.org%252Fabs%252F2202.01344%2523openai.html
A Neural Network Solves and Generates Mathematics Problems by Program Synthesis: Calculus, Differential Equations, Linear Algebra, and More
Scaling Language Models: Methods, Analysis & Insights from Training Gopher
https%253A%252F%252Farxiv.org%252Fabs%252F2112.11446%2523deepmind.html
https%253A%252F%252Farxiv.org%252Fabs%252F2110.14168%2523openai.html
How the Slowest Computer Programs Illuminate Math’s Fundamental Limits: The goal of the ‘busy beaver’ game is to find the longest-running computer program. Its pursuit has surprising connections to some of the most profound questions and concepts in mathematics
https%253A%252F%252Fwww.quantamagazine.org%252Fhow-the-slowest-computer-programs-illuminate-maths-fundamental-limits-20201210%252F.html
Generative Language Modeling for Automated Theorem Proving
https%253A%252F%252Farxiv.org%252Fabs%252F2009.03393%2523openai.html
A randomized controlled trial of interleaved mathematics practice
%252Fdoc%252Fpsychology%252Fspaced-repetition%252F2019-rohrer.pdf.html
https%253A%252F%252Fwww.c82.net%252Fblog%252F%253Fid%253D79.html
Homotopy groups of suspended classifying spaces: An experimental approach
Online Convex Programming and Generalized Infinitesimal Gradient Ascent
Envisioning Information: chapter 5, ‘Color and Information’, pg83-86 [on Oliver Byrne’s color diagram version of Euclid’s Elements]
%252Fdoc%252Fdesign%252Ftypography%252Frubrication%252F1990-tufte-envisioninginformation-ch5-byrneseuclid.pdf.html
%252Fdoc%252Fiq%252Fhigh%252Fsmpy%252F1984-clements.pdf.html
Benny’s conception of rules and answers in IPI mathematics
An Essay On The Psychology Of Invention In The Mathematical Field
https%253A%252F%252Farchive.org%252Fdetails%252Feassayonthepsych006281mbp.html