Bibliography (4):
CM3: A Causal Masked Multimodal Model of the Internet
https://github.com/facebookresearch/chameleon
LLaMA-2: Open Foundation and Fine-Tuned Chat Models
https://openai.com/index/gpt-4v-system-card/