Bibliography (4):
Attention Is All You Need
The Unusual Effectiveness of Averaging in GAN Training
LLaMA-2: Open Foundation and Fine-Tuned Chat Models
https://github.com/XuezheMax/megalodon