Bibliography (5):

  1. https://research.google/blog/scaling-vision-with-sparse-mixture-of-experts/

  2. https://github.com/google-research/vmoe

  3. Vision Transformer: An Image is Worth 16×16 Words: Transformers for Image Recognition at Scale

  4. Revisiting Unreasonable Effectiveness of Data in Deep Learning Era

  5. ImageNet Large Scale Visual Recognition Challenge