Bibliography (5):

  1. A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play

  2. A Review of the Gumbel-max Trick and its Extensions for Discrete Stochasticity in Machine Learning

  3. MuZero: Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model