Bibliography (6):
Attention Is All You Need
GPT-3: Language Models are Few-Shot Learners
GPT-3 Creative Fiction § BPEs
Language Models are Unsupervised Multitask Learners
Draft Report on AI Timelines
Wikipedia Bibliography:
Neural scaling law :
https://en.wikipedia.org/wiki/Neural_scaling_law