Bibliography (9):

  1. Attention Is All You Need

  2. Revisiting Unreasonable Effectiveness of Data in Deep Learning Era

  3. ImageNet Large Scale Visual Recognition Challenge

  4. CIFAR-10 and CIFAR-100 Datasets

  5. Vision Transformer: An Image is Worth 16ร—16 Words: Transformers for Image Recognition at Scale

  6. https://research.google/blog/transformers-for-image-recognition-at-scale/

  7. Not All Images are Worth 16ร—16 Words: Dynamic Transformers for Efficient Image Recognition