The MineRL 2019 Competition on Sample Efficient Reinforcement Learning using Human Priors
The MineRL 2020 Competition on Sample Efficient Reinforcement Learning using Human Priors
Deep DPG (DDPG): Continuous control with deep reinforcement learning
DP4G: Distributed Distributional Deterministic Policy Gradients
SPR: Data-Efficient Reinforcement Learning with Self-Predictive Representations
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
Rainbow: Combining Improvements in Deep Reinforcement Learning
Learning to Generalize with Object-centric Agents in the Open World Survival Game Crafter
Improving Variational Inference with Inverse Autoregressive Flow
2023-hafner-figure1-dreamerv3outperformsbaselinesinsampleefficiencyonmanytasks.png
Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos