The Goldilocks zone: Towards better understanding of neural network loss landscapes
2022-liu-figure1-goldilockszoneofinitializationandrelationshiptogrokking.png
2022-liu-figure7-transformergrokkingvsweightnormformodularaddition.png
Progress measures for grokking via mechanistic interpretability
Wikipedia Bibliography: