Bibliography (5):

  1. Measuring Mathematical Problem Solving With the MATH Dataset

  2. Training Verifiers to Solve Math Word Problems

  3. Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

  4. https://huggingface.co/datasets/math-ai/AutoMathText

  5. https://github.com/yifanzhang-pro/AutoMathText