Bibliography (5):
Training Verifiers to Solve Math Word Problems
The Reversal Curse: LLMs trained on A-is-B fail to learn B-is-A
GPT-3: Language Models are Few-Shot Learners
https://github.com/ML-GSAI/SMDM
Wikipedia Bibliography:
Neural scaling law :
https://en.wikipedia.org/wiki/Neural_scaling_law