What learning algorithm is in-context learning? Investigations with linear models
Why Can GPT Learn In-Context? Language Models Secretly Perform Gradient Descent as Meta-Optimizers
An Explanation of In-context Learning as Implicit Bayesian Inference
What Can Transformers Learn In-Context? A Case Study of Simple Function Classes
Schema-learning and rebinding as mechanisms of in-context learning and emergence
Wikipedia Bibliography:
https://en.wikipedia.org/wiki/Prompt_engineering#In-context_learning
ββ:
https://en.wikipedia.org/wiki/Attention_(machine_learning)#Self-attention
ββ:
https://en.wikipedia.org/wiki/Preconditioner#Preconditioning_in_optimization
ββ:
https://en.wikipedia.org/wiki/Attention_(machine_learning)#Softmax_attention
ββ: