Bibliography (5):

  1. Chinchilla: Training Compute-Optimal Large Language Models

  2. https://arxiv.org/pdf/2209.14958#page=30&org=deepmind

  3. https://arxiv.org/pdf/2209.14958#page=4&org=deepmind

  4. https://arxiv.org/pdf/2209.14958#page=3&org=deepmind

  5. https://arxiv.org/pdf/2209.14958#page=46&org=deepmind