“Why Generalization in RL Is Difficult: Epistemic POMDPs and Implicit Partial Observability [Blog]” (RL exploration, meta-learning, Bayes; backlinks)
View HTML (20MB):
Why Generalization in RL Is Difficult: Epistemic POMDPs and Implicit Partial Observability [Blog]