BetaZero: Belief-State Planning for Long-Horizon POMDPs using Learned Approximations
Posterior sampling for multi-agent reinforcement learning: solving extensive games with imperfect information
AlphaZe∗∗: AlphaZero-like baselines for imperfect information games are surprisingly strong
DeepNash: Mastering the Game of Stratego with Model-Free Multiagent Reinforcement Learning
DouZero: Mastering DouDizhu with Self-Play Deep Reinforcement Learning
From Poincaré Recurrence to Convergence in Imperfect Information Games: Finding Equilibrium via Regularization
Monte Carlo Neural Fictitious Self-Play: Approach to Approximate Nash equilibrium of Imperfect-Information Games
A Survey and Critique of Multiagent Deep Reinforcement Learning
Solving Imperfect-Information Games via Discounted Regret Minimization
ExIt-OOS: Towards Learning from Planning in Imperfect Information Games
Regret Minimization for Partially Observable Deep Reinforcement Learning
LADDER: A Human-Level Bidding Agent for Large-Scale Real-Time Online Auctions
One Writer Enters International Competition to Play the World-Conquering Game That Redefines What It Means to Be a Geek (and a Person)
Artificial Intelligence Beats Eight World Champions at Bridge
2022-perolat-figure1b-deepnashstrategoselfplayarchitecture.png
https://intapi.sciendo.com/pdf/10.2478/ijasitels-2020-0003
https://www.reddit.com/r/reinforcementlearning/comments/cdwzp3/pluribus_superhuman_ai_for_multiplayer_poker/etwu82u/
DeepNash: Mastering the Game of Stratego with Model-Free Multiagent Reinforcement Learning
https%253A%252F%252Farxiv.org%252Fabs%252F2206.15378%2523deepmind.html
https%253A%252F%252Farxiv.org%252Fabs%252F2106.04615%2523deepmind.html
%252Fdoc%252Freinforcement-learning%252Fmodel%252F2010-silver.pdf.html
Wikipedia Bibliography: