Revisiting Unreasonable Effectiveness of Data in Deep Learning Era
Vision Transformer: An Image is Worth 16ร16 Words: Transformers for Image Recognition at Scale
https://research.google/blog/transformers-for-image-recognition-at-scale/
Not All Images are Worth 16ร16 Words: Dynamic Transformers for Efficient Image Recognition