Bibliography (5):
Microsoft COCO: Common Objects in Context
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
A Simple Framework for Contrastive Learning of Visual Representations
https://github.com/OFA-Sys/OFA
Wikipedia Bibliography:
Fréchet inception distance