Bibliography (5):

  1. https://github.com/salesforce/LAVIS/tree/main/projects/blip2

  2. ‘end-to-end’ directory

  3. BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

  4. Attention Is All You Need

  5. Flamingo: a Visual Language Model for Few-Shot Learning