Bibliography (4):

  1. MuZero: Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model

  2. Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor

  3. https://github.com/YeWR/EfficientZero

  4. Wikipedia Bibliography:

    1. Monte Carlo tree search