Bibliography (7):

MuseNet: a deep neural network that can generate 4-minute musical compositions with 10 different instruments, and can combine styles from country to Mozart to the Beatles
Language Models are Unsupervised Multitask Learners
Generating Long Sequences with Sparse Transformers
https://mattmahoney.net/dc/textdata.html
ImageNet Large Scale Visual Recognition Challenge
Wikipedia Bibliography:
1. MIDI
2. CIFAR-10