Bibliography (7):

  1. MuZero: Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model

  2. https://mujoco.org/

  3. Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor