Bibliography (7):

  1. Attention Is All You Need

  2. MLP-Mixer: An all-MLP Architecture for Vision

  3. Vision Transformer: An Image is Worth 16×16 Words: Transformers for Image Recognition at Scale

  4. https://github.com/okojoalg/raft-mlp