Bibliography (6):

  1. The Unusual Effectiveness of Averaging in GAN Training

  2. LLaMa-1: Open and Efficient Foundation Language Models

  3. https://github.com/mnoukhov/elastic-reset