Bibliography (3):

  1. Attention Is All You Need

  2. ‘end-to-end’ directory

  3. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding