Bibliography (5):
Neural Ordinary Differential Equations
Linear Transformers Are Secretly Fast Weight Programmers
Attention Is All You Need
Wikipedia Bibliography:
https://en.wikipedia.org/wiki/Oja%27s_rule :
https://en.wikipedia.org/wiki/Oja%27s_rule
Principal component analysis