Bibliography (14):

  1. https://deepmind.google/discover/blog/language-modelling-at-scale-gopher-ethical-considerations-and-retrieval/

  2. https://x.com/drjwrae/status/1488557776754417664

  3. Attention Is All You Need

  4. Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

  5. TruthfulQA: Measuring How Models Mimic Human Falsehoods

  6. GPT-J-6B: 6B JAX-Based Transformer

  7. Language Models are Unsupervised Multitask Learners

  8. T5: Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

  9. GPT-3: Language Models are Few-Shot Learners

  10. https://arxiv.org/pdf/2112.11446#page=81&org=deepmind

  11. A Multi-Level Attention Model for Evidence-Based Fact Checking

  12. FEVER: a large-scale dataset for Fact Extraction and VERification

  13. BPEs: Neural Machine Translation of Rare Words with Subword Units

  14. GPT-1: Improving Language Understanding by Generative Pre-Training ยง Model specifications