Bibliography (16):

  1. Broken Neural Scaling Laws

  2. https://ethancaballero.github.io/

  3. https://scholar.google.com/citations?user=KvLJAf0AAAAJ

  4. https://x.com/ethanCaballero

  5. GPT-3: Language Models are Few-Shot Learners

  6. Scaling Language Models: Methods, Analysis & Insights from Training Gopher

  7. Chinchilla: Training Compute-Optimal Large Language Models

  8. Introducing Adept

  9. https://www.cnbc.com/2022/03/29/inflection-ai-reid-hoffmans-start-up-poaches-staff-from-google-meta.html

  10. https://bmk.sh/

  11. Attention Is All You Need

  12. Scaling Laws for Autoregressive Generative Modeling