Bibliography (4):

  1. https://openai.com/index/gpt-4-research/

  2. Introducing the next Generation of Claude

  3. Proximal Policy Optimization Algorithms

  4. https://github.com/Cornell-RL/drpo