Bibliography (7):

  1. Contrastive Representation Learning: A Framework and Review

  2. Deep Residual Learning for Image Recognition

  3. Vision Transformer: An Image is Worth 16×16 Words: Transformers for Image Recognition at Scale

  4. MLP-Mixer: An all-MLP Architecture for Vision

  5. Reaching 80% Zero-Shot Accuracy With OpenCLIP: VIT-G/14 Trained On LAION-2B

  6. ImageNet Large Scale Visual Recognition Challenge

  7. ObjectNet: A large-scale bias-controlled dataset for pushing the limits of object recognition models