Bibliography:

  1. ‘hidden-information game’ tag

  2. JaxMARL: Multi-Agent RL Environments in JAX

  3. Human-AI Coordination via Human-Regularized Search and Learning

  4. Is Vanilla Policy Gradient Overlooked? Analyzing Deep Reinforcement Learning for Hanabi

  5. On-the-fly Strategy Adaptation for ad-hoc Agent Coordination

  6. Any-Play: An Intrinsic Augmentation for Zero-Shot Coordination

  7. Conditional Imitation Learning for Multi-Agent Games

  8. Reinforcement Learning on Human Decision Models for Uniquely Collaborative AI Teammates

  9. Scalable Online Planning via Reinforcement Learning Fine-Tuning

  10. Evaluation of Human-AI Teams for Learned and Rule-Based Agents in Hanabi

  11. Learned Belief Search: Efficiently Improving Policies in Partially Observable Settings

  12. On the Critical Role of Conventions in Adaptive Human-AI Collaboration

  13. Off-Belief Learning

  14. Continuous Coordination As a Realistic Scenario for Lifelong Learning

  15. The Surprising Effectiveness of PPO in Cooperative, Multi-Agent Games

  16. Theory of Mind for Deep Reinforcement Learning in Hanabi

  17. Evaluating the Rainbow DQN Agent in Hanabi with Unseen Partners

  18. Generating and Adapting to Diverse Ad-Hoc Cooperation Agents in Hanabi

  19. "Other-Play" for Zero-Shot Coordination

  20. Improving Policies via Search in Cooperative Partially Observable Games

  21. Simplified Action Decoder for Deep Multi-Agent Reinforcement Learning

  22. The Hanabi Challenge: A New Frontier for AI Research

  23. Bayesian Action Decoder for Deep Multi-Agent Reinforcement Learning

  24. State-Of-The-Art Hanabi Bots + Simulation Framework in Rust

  25. design#future-tag-features

    [Transclude the forward-link's context]

  26. JaxMARL: Multi-Agent RL Environments in JAX

  27. https%253A%252F%252Farxiv.org%252Fabs%252F2311.10090.html

  28. The Surprising Effectiveness of PPO in Cooperative, Multi-Agent Games

  29. https%253A%252F%252Farxiv.org%252Fabs%252F2103.01955.html