https://x.com/fredahshi/status/1579858716257460225
Training Verifiers to Solve Math Word Problems
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
https://github.com/google-research/url-nlp
GPT-3: Language Models are Few-Shot Learners
PaLM: Scaling Language Modeling with Pathways
Emergent Abilities of Large Language Models
InstructGPT: Training language models to follow instructions with human feedback
https://arxiv.org/pdf/2210.03057.pdf#page=17&org=google