Bibliography (5):

Agent57: Outperforming the Atari Human Benchmark
Playing Atari with Deep Reinforcement Learning
R2D2: Recurrent Experience Replay in Distributed Reinforcement Learning
MuZero: Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model
Wikipedia Bibliography:
1. Reinforcement learning