Bibliography (5):

  1. Attention Is All You Need

  2. Long Range Arena (LRA): A Benchmark for Efficient Transformers

  3. ImageNet Large Scale Visual Recognition Challenge

  4. https://mattmahoney.net/dc/textdata.html

  5. https://github.com/NVIDIA/transformer-ls