Bibliography (67):

https://x.com/DimitrisPapail/status/1678407512637284352
https://github.com/lee-ny/teaching_arithmetic
https://github.com/karpathy/nanoGPT
Language Models are Unsupervised Multitask Learners
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
https://arxiv.org/pdf/2307.03381.pdf#page=3
https://arxiv.org/pdf/2307.03381.pdf#page=4
https://arxiv.org/pdf/2307.03381.pdf#page=5
https://arxiv.org/pdf/2307.03381.pdf#page=7
https://arxiv.org/pdf/2307.03381.pdf#page=8
https://x.com/BlinkDL_AI/status/1677593798531223552
https://arxiv.org/pdf/2307.03381.pdf#page=9
https://arxiv.org/pdf/2307.03381.pdf#page=11
https://arxiv.org/pdf/2307.03381.pdf#page=13
https://arxiv.org/pdf/2307.03381.pdf#page=15
https://arxiv.org/pdf/2307.03381.pdf#page=16
https://arxiv.org/pdf/2307.03381.pdf#page=17
https://arxiv.org/pdf/2307.03381.pdf#page=18
https://arxiv.org/pdf/2307.03381.pdf#page=19
https://arxiv.org/pdf/2307.03381.pdf#page=20
https://arxiv.org/pdf/2307.03381.pdf#page=21
https://arxiv.org/pdf/2307.03381.pdf#page=23
https://arxiv.org/pdf/2307.03381.pdf#page=26
https://arxiv.org/pdf/2307.03381.pdf#page=27
https://arxiv.org/pdf/2307.03381.pdf#page=30
https://arxiv.org/pdf/2307.03381.pdf#page=31
FLAN: Scaling Instruction-Finetuned Language Models
Attention Is All You Need
Program Induction by Rationale Generation: Learning to Solve and Explain Algebraic Word Problems
Training Verifiers to Solve Math Word Problems
Show Your Work: Scratchpads for Intermediate Computation with Language Models
MultiArith: Solving General Arithmetic Word Problems
Neural Programmer-Interpreters
Towards Synthesizing Complex Programs from Input-Output Examples
Efficient Architecture Search by Network Transformation
Investigating the Limitations of the Transformers with Simple Arithmetic Tasks
How well do Large Language Models perform in Arithmetic tasks?
Impact of Pretraining Term Frequencies on Few-Shot Reasoning
GPT-3: Language Models are Few-Shot Learners
Least-to-Most Prompting Enables Complex Reasoning in Large Language Models
Learning to Execute
https://aclanthology.org/2021.emnlp-main.563.pdf
There Once Was a Really Bad Poet, It Was Automated but You Didn’t Know It
Limitations of Language Models in Arithmetic and Symbolic Induction
Let’s Verify Step by Step
Solving math word problems with process & outcome-based feedback
Do NLP Models Know Numbers? Probing Numeracy in Embeddings
What is my math transformer doing? – 3 results on interpretability and generalization
Linear algebra with transformers
Solving Linear Algebra by Program Synthesis
How does GPT-2 compute greater-than?: Interpreting mathematical abilities in a pre-trained language model
The Shape of Learning Curves: a Review
The Algebraic Combinatorial Approach for Low-Rank Matrix Completion
Exploring Length Generalization in Large Language Models
Rethinking the Role of Demonstrations: What Makes In-Context Learning Work?
2023-lee-figure1-numberformattingforgpt2arithmetic.jpg
https://arxiv.org/pdf/2307.03381.pdf#page=14
GPT-3 Creative Fiction § BPEs
https://github.com/openai/tiktoken
Evaluating Transformer Language Models on Arithmetic Operations Using Number Decomposition
It’s Not Just Size That Matters: Small Language Models Are Also Few-Shot Learners
TinyStories: How Small Can Language Models Be and Still Speak Coherent English?
Models In a Spelling Bee: Language Models Implicitly Learn the Character Composition of Tokens
Wikipedia Bibliography: