A domain-specific supercomputer for training deep neural networks
IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures
MuZero: Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model
SEED RL: Scalable and Efficient Deep-RL with Accelerated Central Inference
Wikipedia Bibliography: