Bibliography (7):

  1. https://jiachenzhu.github.io/DyT/

  2. https://github.com/jiachenzhu/DyT

  3. Attention Is All You Need

  4. Layer Normalization