Bibliography (7):

  1. Proximal Policy Optimization Algorithms

  2. Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor

  3. Evolution Strategies as a Scalable Alternative to Reinforcement Learning

  4. Openai/gym: A Toolkit for Developing and Comparing Reinforcement Learning Algorithms.

  5. https://mujoco.org/