Bibliography (6):

  1. https://x.com/NeelNanda5/status/1616590960066203648

  2. https://www.lesswrong.com/posts/N6WM6hs7RQMKDhYjB/a-mechanistic-interpretability-analysis-of-grokking?commentId=TMTrScsM5sTErwXtm

  3. Grokking: Generalization Beyond Overfitting On Small Algorithmic Datasets