Bibliography (8):

  1. GPT-3: Language Models are Few-Shot Learners

  2. MMLU: Measuring Massive Multitask Language Understanding

  3. PaLM: Scaling Language Modeling with Pathways

  4. Measuring Mathematical Problem Solving With the MATH Dataset

  5. PubMedQA: A Dataset for Biomedical Research Question Answering

  6. Bigscience/bloom

  7. Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

  8. Wikipedia Bibliography:

    1. LaTeX