Bibliography (5):

  1. MLP-Mixer: An all-MLP Architecture for Vision

  2. HyperNetworks

  3. Attention Is All You Need