Bibliography (3):

  1. https://github.com/openai/miniF2F

  2. Measuring Mathematical Problem Solving With the MATH Dataset

  3. https://openai.com/index/gpt-4-research/