Bibliography (7):
https://jiachenzhu.github.io/DyT/
https://github.com/jiachenzhu/DyT
Attention Is All You Need
Layer Normalization
Wikipedia Bibliography:
https://en.wikipedia.org/wiki/Normalization_(machine_learning)#Layer_normalizationโโ:
https://en.wikipedia.org/wiki/Normalization_(machine_learning)#Layer_normalization
https://en.wikipedia.org/wiki/Tanhโโ:
https://en.wikipedia.org/wiki/Tanh
Weak supervision ยง Semi-supervised learning