Bibliography (4):
https://arxiv.org/abs/1908.02265
https://arxiv.org/abs/1908.07490
UNITER: UNiversal Image-TExt Representation Learning
Wikipedia Bibliography:
https://en.wikipedia.org/wiki/Visual_semantics :
https://en.wikipedia.org/wiki/Visual_semantics