Bibliography (5):
Attention Is All You Need
Long Range Arena (LRA): A Benchmark for Efficient Transformers
ImageNet Large Scale Visual Recognition Challenge
https://mattmahoney.net/dc/textdata.html
https://github.com/NVIDIA/transformer-ls