Bibliography (9):

  1. Language Models are Unsupervised Multitask Learners

  2. GPT-2: 1.5B Release

  3. https://old.reddit.com/user/gwern

  4. Gwern Branwen Creating Essays on Gwern.net

  5. https://x.com/theshawwn

  6. https://old.reddit.com/user/shawwwn

  7. A domain-specific supercomputer for training deep neural networks