Wikipedia Bibliography:

https://en.wikipedia.org/wiki/Prompt_engineering#In-context_learning :

https://en.wikipedia.org/wiki/Prompt_engineering#In-context_learning
https://en.wikipedia.org/wiki/Attention_(machine_learning)#Self-attention :

https://en.wikipedia.org/wiki/Attention_(machine_learning)#Self-attention
Gradient descent
https://en.wikipedia.org/wiki/Linear_regression :

https://en.wikipedia.org/wiki/Linear_regression
Independence (probability theory)
https://en.wikipedia.org/wiki/Preconditioner#Preconditioning_in_optimization :

https://en.wikipedia.org/wiki/Preconditioner#Preconditioning_in_optimization
https://en.wikipedia.org/wiki/Attention_(machine_learning)#Softmax_attention :

https://en.wikipedia.org/wiki/Attention_(machine_learning)#Softmax_attention