Bibliography:
Language Models are Unsupervised Multitask Learners
ConnorJL/GPT2: An Implementation of Training for GPT-2, Supports TPUs
https://github.com/ConnorJL/GPT2/tree/master/samples
https://github.com/ConnorJL/GPT2/tree/master/samples
A domain-specific supercomputer for training deep neural networks
Wikipedia Bibliography:
OpenAI