PaLM: Scaling Language Modeling with Pathways
Training Verifiers to Solve Math Word Problems
Language Models are Multilingual Chain-of-Thought Reasoners
TyDiQA: A Benchmark for Information-Seeking Question Answering in Typologically Diverse Languages
MMLU: Measuring Massive Multitask Language Understanding