Bibliography (6):

  1. Language Models are Unsupervised Multitask Learners

  2. Generating Long Sequences with Sparse Transformers

  3. https://mattmahoney.net/dc/textdata.html

  4. ImageNet Large Scale Visual Recognition Challenge

  5. Wikipedia Bibliography:

    1. MIDI

    2. CIFAR-10