Bibliography (5):

  1. Proximal Policy Optimization Algorithms

  2. Human-level performance in 3D multiplayer games with population-based reinforcement learning

  3. https://www.youtube.com/watch?v=Xh-FKD0AAKE