- See Also
-
Links
- “Human-AI Coordination via Human-Regularized Search and Learning”, Et Al 2022
- “Is Vanilla Policy Gradient Overlooked? Analyzing Deep Reinforcement Learning for Hanabi”, Et Al 2022
- “On-the-fly Strategy Adaptation for Ad-hoc Agent Coordination”, Et Al 2022
- “Any-Play: An Intrinsic Augmentation for Zero-Shot Coordination”, 2022
- “Conditional Imitation Learning for Multi-Agent Games”, Et Al 2022
- “Reinforcement Learning on Human Decision Models for Uniquely Collaborative AI Teammates”, 2021
- “Scalable Online Planning via Reinforcement Learning Fine-Tuning”, Et Al 2021
- “Evaluation of Human-AI Teams for Learned and Rule-Based Agents in Hanabi”, Et Al 2021
- “Learned Belief Search: Efficiently Improving Policies in Partially Observable Settings”, Et Al 2021
- “On the Critical Role of Conventions in Adaptive Human-AI Collaboration”, Et Al 2021
- “Off-Belief Learning”, Et Al 2021
- “Continuous Coordination As a Realistic Scenario for Lifelong Learning”, Et Al 2021
- “The Surprising Effectiveness of PPO in Cooperative, Multi-Agent Games”, Et Al 2021
- “Theory of Mind for Deep Reinforcement Learning in Hanabi”, Et Al 2021
- “Generating and Adapting to Diverse Ad-Hoc Cooperation Agents in Hanabi”, Et Al 2020
- “Evaluating the Rainbow DQN Agent in Hanabi With Unseen Partners”, Et Al 2020
- “ “Other-Play” For Zero-Shot Coordination”, Et Al 2020
- “Improving Policies via Search in Cooperative Partially Observable Games”, Et Al 2019
- “Simplified Action Decoder for Deep Multi-Agent Reinforcement Learning”, 2019
- “The Hanabi Challenge: A New Frontier for AI Research”, Et Al 2019
- “Bayesian Action Decoder for Deep Multi-Agent Reinforcement Learning”, Et Al 2018
- Wikipedia
- Miscellaneous
See Also
Links
“Human-AI Coordination via Human-Regularized Search and Learning”, Et Al 2022
“Human-AI Coordination via Human-Regularized Search and Learning”, 2022-10-11 ( ; similar)
“Is Vanilla Policy Gradient Overlooked? Analyzing Deep Reinforcement Learning for Hanabi”, Et Al 2022
“Is Vanilla Policy Gradient Overlooked? Analyzing Deep Reinforcement Learning for Hanabi”, 2022-03-22 ( ; similar)
“On-the-fly Strategy Adaptation for Ad-hoc Agent Coordination”, Et Al 2022
“On-the-fly Strategy Adaptation for ad-hoc Agent Coordination”, 2022-03-08 ( ; similar)
“Any-Play: An Intrinsic Augmentation for Zero-Shot Coordination”, 2022
“Any-Play: An Intrinsic Augmentation for Zero-Shot Coordination”, 2022-01-28 ( ; similar)
“Conditional Imitation Learning for Multi-Agent Games”, Et Al 2022
“Conditional Imitation Learning for Multi-Agent Games”, 2022-01-05 (similar)
“Reinforcement Learning on Human Decision Models for Uniquely Collaborative AI Teammates”, 2021
“Reinforcement Learning on Human Decision Models for Uniquely Collaborative AI Teammates”, 2021-11-18 ( ; similar)
“Scalable Online Planning via Reinforcement Learning Fine-Tuning”, Et Al 2021
“Scalable Online Planning via Reinforcement Learning Fine-Tuning”, 2021-09-30 ( ; similar)
“Evaluation of Human-AI Teams for Learned and Rule-Based Agents in Hanabi”, Et Al 2021
“Evaluation of Human-AI Teams for Learned and Rule-Based Agents in Hanabi”, 2021-07-15 (similar)
“Learned Belief Search: Efficiently Improving Policies in Partially Observable Settings”, Et Al 2021
“Learned Belief Search: Efficiently Improving Policies in Partially Observable Settings”, 2021-06-16 (similar)
“On the Critical Role of Conventions in Adaptive Human-AI Collaboration”, Et Al 2021
“On the Critical Role of Conventions in Adaptive Human-AI Collaboration”, 2021-04-07 (similar)
“Off-Belief Learning”, Et Al 2021
“Off-Belief Learning”, 2021-03-06 (similar)
“Continuous Coordination As a Realistic Scenario for Lifelong Learning”, Et Al 2021
“Continuous Coordination As a Realistic Scenario for Lifelong Learning”, 2021-03-04 (similar)
“The Surprising Effectiveness of PPO in Cooperative, Multi-Agent Games”, Et Al 2021
“The Surprising Effectiveness of PPO in Cooperative, Multi-Agent Games”, 2021-03-02 (backlinks; similar)
“Theory of Mind for Deep Reinforcement Learning in Hanabi”, Et Al 2021
“Theory of Mind for Deep Reinforcement Learning in Hanabi”, 2021-01-22 (similar)
“Generating and Adapting to Diverse Ad-Hoc Cooperation Agents in Hanabi”, Et Al 2020
“Generating and Adapting to Diverse Ad-Hoc Cooperation Agents in Hanabi”, 2020-04-28 (similar)
“Evaluating the Rainbow DQN Agent in Hanabi With Unseen Partners”, Et Al 2020
“Evaluating the Rainbow DQN Agent in Hanabi with Unseen Partners”, 2020-04-28 ( ; similar)
“ “Other-Play” For Zero-Shot Coordination”, Et Al 2020
“"Other-Play" for Zero-Shot Coordination”, 2020-03-06 (similar)
“Improving Policies via Search in Cooperative Partially Observable Games”, Et Al 2019
“Improving Policies via Search in Cooperative Partially Observable Games”, 2019-12-05 (similar)
“Simplified Action Decoder for Deep Multi-Agent Reinforcement Learning”, 2019
“Simplified Action Decoder for Deep Multi-Agent Reinforcement Learning”, 2019-12-04 (similar)
“The Hanabi Challenge: A New Frontier for AI Research”, Et Al 2019
“The Hanabi Challenge: A New Frontier for AI Research”, 2019-02-01 (similar)
“Bayesian Action Decoder for Deep Multi-Agent Reinforcement Learning”, Et Al 2018
“Bayesian Action Decoder for Deep Multi-Agent Reinforcement Learning”, 2018-11-04 ( ; similar)