Bibliography (7):

  1. MAE: Masked Autoencoders Are Scalable Vision Learners

  2. Contrastive Representation Learning: A Framework and Review

  3. ImageNet Large Scale Visual Recognition Challenge

  4. https://paperswithcode.com/dataset/ade20k