Bibliography (14):

https://github.com/socialfoundations/tttlm
Large-Scale Retrieval for Reinforcement Learning
In-Context Pretraining (ICP): Language Modeling Beyond Document Boundaries
GPT-3 Creative Fiction § Prompts As Programming
https://github.com/EleutherAI/The-Pile
The Pile: An 800GB Dataset of Diverse Text for Language Modeling
EleutherAI/gpt-Neo: An Implementation of Model Parallel GPT-2 and GPT-3-Style Models Using the Mesh-Tensorflow Library.
Recurrent Neural Network Based Language Model § Dynamic Evaluation
Language Models are Unsupervised Multitask Learners
2023-hardt-figure6-perplexitiesdecreasewhentrainingonincreasinglymoreneighborsusinggpt2onthepile.jpg
Wikipedia Bibliography: