Human-AI Coordination via Human-Regularized Search and Learning
Is Vanilla Policy Gradient Overlooked? Analyzing Deep Reinforcement Learning for Hanabi
On-the-fly Strategy Adaptation for ad-hoc Agent Coordination
Any-Play: An Intrinsic Augmentation for Zero-Shot Coordination
Reinforcement Learning on Human Decision Models for Uniquely Collaborative AI Teammates
Scalable Online Planning via Reinforcement Learning Fine-Tuning
Evaluation of Human-AI Teams for Learned and Rule-Based Agents in Hanabi
Learned Belief Search: Efficiently Improving Policies in Partially Observable Settings
On the Critical Role of Conventions in Adaptive Human-AI Collaboration
Continuous Coordination As a Realistic Scenario for Lifelong Learning
The Surprising Effectiveness of PPO in Cooperative, Multi-Agent Games
Evaluating the Rainbow DQN Agent in Hanabi with Unseen Partners
Generating and Adapting to Diverse Ad-Hoc Cooperation Agents in Hanabi
Improving Policies via Search in Cooperative Partially Observable Games
Simplified Action Decoder for Deep Multi-Agent Reinforcement Learning
Bayesian Action Decoder for Deep Multi-Agent Reinforcement Learning
State-Of-The-Art Hanabi Bots + Simulation Framework in Rust
The Surprising Effectiveness of PPO in Cooperative, Multi-Agent Games
Wikipedia Bibliography: