Bibliography (3):
https://github.com/openai/miniF2F
Measuring Mathematical Problem Solving With the MATH Dataset
https://openai.com/index/gpt-4-research/