Bibliography (7):

  1. Attention Is All You Need

  2. RoBERTa: A Robustly Optimized BERT Pretraining Approach

  3. ImageNet Large Scale Visual Recognition Challenge

  4. EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks