-
Proximal Policy Optimization Algorithms
-
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
-
Evolution Strategies as a Scalable Alternative to Reinforcement Learning
-
Openai/gym: A Toolkit for Developing and Comparing Reinforcement Learning Algorithms.
-
https://mujoco.org/
-