Bibliography (6):

  1. https://x.com/avisingh599/status/1734603680933192089

  2. ReST: Reinforced Self-Training (ReST) for Language Modeling

  3. Measuring Mathematical Problem Solving With the MATH Dataset

  4. PaLM: Scaling Language Modeling with Pathways