Bibliography (5):

  1. Microsoft COCO: Common Objects in Context

  2. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

  3. A Simple Framework for Contrastive Learning of Visual Representations

  4. https://github.com/OFA-Sys/OFA

  5. Wikipedia Bibliography:

    1. Fréchet inception distance