Offline Pre-trained Multi-Agent Decision Transformer: One Big Sequence Model Tackles All SMAC Tasks
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Decision Transformer: Reinforcement Learning via Sequence Modeling
The Surprising Effectiveness of PPO in Cooperative, Multi-Agent Games
Trust Region Policy Optimization in Multi-Agent Reinforcement Learning
Wikipedia Bibliography: