https://academictorrents.com/details/0d366035664fdf51cfbe9f733953ba325776e667/tech
EleutherAI
https://pile.eleuther.ai/
https://openwebtext2.readthedocs.io/en/latest/
Compressive Transformers for Long-Range Sequence Modeling
Language Models are Unsupervised Multitask Learners
GPT-3: Language Models are Few-Shot Learners
The Pile: An 800GB Dataset of Diverse Text for Language Modeling