Bibliography (9):

  1. The Value Equivalence Principle for Model-Based Reinforcement Learning

  2. Value Iteration Networks

  3. The Predictron: End-To-End Learning and Planning

  4. Value Prediction Network

  5. TreeQN & ATreeC: Differentiable Tree-Structured Models for Deep Reinforcement Learning

  6. MuZero: Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model

  7. Proper Value Equivalence