Bibliography (6):

The Unusual Effectiveness of Averaging in GAN Training
LLaMa-1: Open and Efficient Foundation Language Models
https://github.com/mnoukhov/elastic-reset
Wikipedia Bibliography: