Bibliography (3):

  1. A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play

  2. MuZero: Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model

  3. https://www.lesswrong.com/tag/aixi