Bibliography (5):

  1. Training Transformers Together

  2. SwAV: Unsupervised Learning of Visual Features by Contrasting Cluster Assignments

  3. ALBERT: A Lite BERT for Self-supervised Learning of Language Representations