Bibliography (12):

  1. https://academictorrents.com/details/0d366035664fdf51cfbe9f733953ba325776e667/tech

  2. EleutherAI

  3. https://pile.eleuther.ai/

  4. https://openwebtext2.readthedocs.io/en/latest/

  5. Compressive Transformers for Long-Range Sequence Modeling

  6. Language Models are Unsupervised Multitask Learners

  7. GPT-3: Language Models are Few-Shot Learners