Bibliography (6):
Language Models are Unsupervised Multitask Learners
Generating Long Sequences with Sparse Transformers
https://mattmahoney.net/dc/textdata.html
ImageNet Large Scale Visual Recognition Challenge
Wikipedia Bibliography:
MIDI
CIFAR-10