-
Value Iteration Networks
-
The Predictron: End-To-End Learning and Planning
-
Value Prediction Network
-
TreeQN & ATreeC: Differentiable Tree-Structured Models for Deep Reinforcement Learning
-
MuZero: Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model
-
Proper Value Equivalence
-