Bibliography (13):

A domain-specific supercomputer for training deep neural networks
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures
MuZero: Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model
Discovering Reinforcement Learning Algorithms
DeepMind Lab
https://mujoco.org/
SEED RL: Scalable and Efficient Deep-RL with Accelerated Central Inference
Playing Atari with Deep Reinforcement Learning
Wikipedia Bibliography: