Bibliography (4):

  1. Microsoft COCO: Common Objects in Context

  2. Conceptual Captions: A Cleaned, Hypernymed, Image Alt-text Dataset For Automatic Image Captioning

  3. https://github.com/ChenRocks/UNITER