Bibliography (3):

  1. https://openai.com/index/gpt-4-research/

  2. Proximal Policy Optimization Algorithms

  3. https://github.com/OpenBMB/UltraFeedback