Language Models are Unsupervised Multitask Learners
GPT-2: 1.5B Release
https://old.reddit.com/user/gwern
Gwern Branwen Creating Essays on Gwern.net
https://x.com/theshawwn
https://old.reddit.com/user/shawwwn
A domain-specific supercomputer for training deep neural networks