Bibliography (5):
Proximal Policy Optimization Algorithms
Human-level performance in 3D multiplayer games with population-based reinforcement learning
https://www.youtube.com/watch?v=Xh-FKD0AAKE
Wikipedia Bibliography:
Reinforcement learning
Entropy (information theory)