-
β βend-to-endβ tag
-
β Attention Is All You Need
-
β fully-connected#mlp-mixer
[Transclude the forward-link's context]
-
β MLP-Mixer: An all-MLP Architecture for Vision
-
β DETR: End-to-End Object Detection with Transformers
-
β Focal Loss for Dense Object Detection
-
β Mask R-CNN
-
β Deep Residual Learning for Image Recognition
-
β Training data-efficient image transformers & distillation through attention
-
β DINO: Emerging Properties in Self-Supervised Vision Transformers
-