Bibliography (4):

  1. MuZero: Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model

  2. Procgen Benchmark: We’re releasing Procgen Benchmark, 16 simple-to-use procedurally-generated environments which provide a direct measure of how quickly a reinforcement learning agent learns generalizable skills

  3. Meta-World: A Benchmark and Evaluation for Multi-Task and Meta Reinforcement Learning

  4. Wikipedia Bibliography:

    1. Reinforcement learning