Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling
Language Models are Unsupervised Multitask Learners
The LAMBADA dataset: Word prediction requiring a broad discourse context
LLaMa-1: Open and Efficient Foundation Language Models
PaLM: Scaling Language Modeling with Pathways
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models