Foundational Challenges in Assuring Alignment and Safety of Large Language Models
From reinforcement learning to agency: Frameworks for understanding basal cognition
Classical Sorting Algorithms as a Model of Morphogenesis: self-sorting arrays reveal unexpected competencies in a minimal model of basal intelligence
PRER: Modeling Complex Mathematical Reasoning via Large Language Model based MathAgent
Generative agent-based modeling with actions grounded in physical, social, or digital space using Concordia
Large Language Models can Strategically Deceive their Users when Put Under Pressure
Neural MMO 2.0: A Massively Multi-task Addition to Massively Multi-agent Learning
Let Models Speak Ciphers: Multiagent Debate through Embeddings
AI Deception: A Survey of Examples, Risks, and Potential Solutions
Diversifying AI: Towards Creative Chess with AlphaZero (AZdb)
Hoodwinked: Deception and Cooperation in a Text-Based Game for Language Models
Combining Human Expertise with Artificial Intelligence: Experimental Evidence from Radiology
Posterior sampling for multi-agent reinforcement learning: solving extensive games with imperfect information
Learning Agile Soccer Skills for a Bipedal Robot with Deep Reinforcement Learning
Multi-Party Chat (MultiLIGHT): Conversational Agents in Group Settings with Humans and Models
Off-the-Grid MARL (OG-MARL): Datasets with Baselines for Offline Multi-Agent Reinforcement Learning
Learning to Control and Coordinate Mixed Traffic Through Robot Vehicles at Complex and Unsignalized Intersections
CICERO: Human-level play in the game of Diplomacy by combining language models with strategic reasoning
Over-communicate no more: Situated RL agents learn concise communication protocols
Human-AI Coordination via Human-Regularized Search and Learning
Game Theoretic Rating in N-player general-sum games with Equilibria
Modeling Bounded Rationality in Multi-Agent Simulations Using Rationally Inattentive Reinforcement Learning
Neural Payoff Machines: Predicting Fair and Stable Payoff Allocations Among Team Members
Social Simulacra: Creating Populated Prototypes for Social Computing Systems
DeepNash: Mastering the Game of Stratego with Model-Free Multiagent Reinforcement Learning
Fleet-DAgger: Interactive Robot Fleet Learning with Scalable Human Supervision
Revisiting Some Common Practices in Cooperative Multi-Agent Reinforcement Learning
MAT: Multi-Agent Reinforcement Learning is a Sequence Modeling Problem
First Contact: Unsupervised Human-Machine Co-Adaptation via Mutual Information Maximization
Emergent Bartering Behavior in Multi-Agent Reinforcement Learning
Hidden Agenda: a Social Deduction Game with Diverse Learned Equilibria
Finding General Equilibria in Many-Agent Economic Simulations Using Deep Reinforcement Learning
Maximum Entropy Population Based Training for Zero-Shot Human-AI Coordination
Modeling Strong and Human-Like Gameplay with KL-Regularized Search
Offline Pre-trained Multi-Agent Decision Transformer: One Big Sequence Model Tackles All SMAC Tasks
Collective Intelligence for Deep Learning: A Survey of Recent Developments
Learning to Ground Multi-Agent Communication with Autoencoders
Meta-learning, social cognition and consciousness in brains and machines
Trust Region Policy Optimization in Multi-Agent Reinforcement Learning
WarpDrive: Extremely Fast End-to-End Deep Multi-Agent Reinforcement Learning on a GPU
The AI Economist: Optimal Economic Policy Design via Two-level Deep Reinforcement Learning
Megaverse: Simulating Embodied Agents at One Million Experiences per Second
Scalable Evaluation of Multi-Agent Reinforcement Learning with Melting Pot
From Motor Control to Team Play in Simulated Humanoid Football
baller2vec++: A Look-Ahead Multi-Entity Transformer For Modeling Coordinated Agents
Neural Tree Expansion for Multi-Robot Planning in Non-Cooperative Environments
Asymmetric self-play for automatic goal discovery in robotic manipulation
baller2vec: A Multi-Entity Transformer For Multi-Agent Spatiotemporal Modeling
UPDeT: Universal Multi-agent Reinforcement Learning via Policy Decoupling with Transformers
Towards Playing Full MOBA Games with Deep Reinforcement Learning
TLeague: A Framework for Competitive Self-Play based Distributed Multi-Agent Reinforcement Learning
Reinforcement Learning for Optimization of COVID-19 Mitigation policies
Human-Level Performance in No-Press Diplomacy via Equilibrium Search
Emergent Social Learning via Multi-agent Reinforcement Learning
ReBeL: Combining Deep Reinforcement Learning and Search for Imperfect-Information Games
Decentralized Reinforcement Learning: Global Decision-Making via Local Economic Transactions [blog]
One Policy to Control Them All: Shared Modular Policies for Agent-Agnostic Control
Decentralized Reinforcement Learning: Global Decision-Making via Local Economic Transactions
Benchmarking Multi-Agent Deep Reinforcement Learning Algorithms in Cooperative Tasks
Learning to Play No-Press Diplomacy with Best Response Policy Iteration
Approximate exploitability: Learning a best response in large games
Enhanced POET: Open-Ended Reinforcement Learning through Unbounded Invention of Learning Challenges and their Solutions
Social diversity and social preferences in mixed-motive reinforcement learning
Effective Diversity in Population Based Reinforcement Learning
Smooth markets: A basic mechanism for organizing gradient-based learners
microbatchGAN: Stimulating Diversity with Multi-Adversarial Discrimination
Increasing Generality in Machine Learning through Procedural Content Generation
Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms
Grandmaster level in StarCraft II using multi-agent reinforcement learning
Emergent Tool Use from Multi-Agent Interaction § Surprising behavior
A Review of Cooperative Multi-Agent Deep Reinforcement Learning
Hierarchical Decision Making by Generating and Following Natural Language Instructions
Human-level performance in 3D multiplayer games with population-based reinforcement learning
AI-GAs: AI-generating algorithms, an alternate paradigm for producing general artificial intelligence
Adversarial Policies: Attacking Deep Reinforcement Learning
LIGHT: Learning to Speak and Act in a Fantasy Text Adventure Game
Autocurricula and the Emergence of Innovation from Social Interaction: A Manifesto for Multi-Agent Intelligence Research
Hierarchical Reinforcement Learning for Multi-agent MOBA Game
Paired Open-Ended Trailblazer (POET): Endlessly Generating Increasingly Complex and Diverse Learning Environments and Their Solutions
Continual Match Based Training in Pommerman: Technical Report
Social Influence as Intrinsic Motivation for Multi-Agent Deep Reinforcement Learning
A Survey and Critique of Multiagent Deep Reinforcement Learning
Learning to Coordinate Multiple Reinforcement Learning Agents for Diverse Query Reformulation
Fully Distributed Multi-Robot Collision Avoidance via Deep Reinforcement Learning for Safe and Efficient Navigation in Complex Scenarios
Human-level performance in first-person multiplayer games with population-based deep reinforcement learning
Construction of arbitrarily strong amplifiers of natural selection using evolutionary graph theory
Adaptive Mechanism Design: Learning to Promote Cooperation
Sim-to-Real Optimization of Complex Real World Mobile Network with Imperfect Information via Deep Reinforcement Learning from Self-play
Trust-Aware Decision Making for Human-Robot Collaboration: Model Learning and Planning
LADDER: A Human-Level Bidding Agent for Large-Scale Real-Time Online Auctions
CAN: Creative Adversarial Networks, Generating "Art" by Learning About Styles and Deviating from Style Norms
Supervision via Competition: Robot Adversaries for Learning Tasks
Reflective Oracles: A Foundation for Classical Game Theory
Homo Moralis-Preference Evolution Under Incomplete Information and Assortative Matching
If multi-agent learning is the answer, what is the question?
Market-Based Reinforcement Learning in Partially Observable Worlds
Computer-Aided Gas Pipeline Operation Using Genetic Algorithms And Rule Learning
The Surprising Effectiveness of PPO in Cooperative Multi-Agent Games
Efficient Large-Scale Fleet Management via Multi-Agent Deep Reinforcement Learning
One Writer Enters International Competition to Play the World-Conquering Game That Redefines What It Means to Be a Geek (and a Person)
The Pommerman Team Competition Or: How We Learned to Stop Worrying and Love the Battle
New Winning Strategies for the Iterated Prisoner’s Dilemma
How Much Compute Was Used to Train DeepMind's Generally Capable Agents?
DeepMind: Generally Capable Agents Emerge from Open-Ended Play
Artificial Intelligence Beats Eight World Champions at Bridge
Open-Ended Learning Leads to Generally Capable Agents [Video]
https://deepmind.google/discover/blog/learning-robust-real-time-cultural-transmission-without-human-data/
https://research.facebook.com/publications/control-strategies-for-physically-simulated-characters-performing-two-player-competitive-sports/
https://research.google/blog/introducing-google-research-football-a-novel-reinforcement-learning-environment/
https://research.google/blog/leveraging-machine-learning-for-game-development/
https://www.lesswrong.com/posts/65qmEJHDw3vw69tKm/proposal-scaling-laws-for-rl-generalization
https://www.lesswrong.com/posts/FbSAuJfCxizZGpcHc/interpreting-the-learning-of-deceit
https://www.lesswrong.com/posts/bwyKCQD7PFWKhELMr/by-default-gpts-think-in-plain-sight#zfzHshctWZYo8JkLe
https://www.nature.com/articles/s41467-020-19244-4#deepmind
https://www.nature.com/articles/s41598-019-45619-9#deepmind
https://www.quantamagazine.org/computers-evolve-a-new-path-toward-human-intelligence-20191106/
https://www.reddit.com/r/reinforcementlearning/comments/cdwzp3/pluribus_superhuman_ai_for_multiplayer_poker/etwu82u/
PRER: Modeling Complex Mathematical Reasoning via Large Language Model based MathAgent
https%253A%252F%252Fwww.nature.com%252Farticles%252Fs41467-023-42875-2%2523deepmind.html
Neural MMO 2.0: A Massively Multi-task Addition to Massively Multi-agent Learning
Diversifying AI: Towards Creative Chess with AlphaZero (AZdb)
https%253A%252F%252Farxiv.org%252Fabs%252F2308.09175%2523deepmind.html
Hoodwinked: Deception and Cooperation in a Text-Based Game for Language Models
Combining Human Expertise with Artificial Intelligence: Experimental Evidence from Radiology
https%253A%252F%252Fwww.nber.org%252Fpapers%252Fw31422.html
Learning Agile Soccer Skills for a Bipedal Robot with Deep Reinforcement Learning
https%253A%252F%252Farxiv.org%252Fabs%252F2304.13653%2523deepmind.html
CICERO: Human-level play in the game of Diplomacy by combining language models with strategic reasoning
%252Fdoc%252Freinforcement-learning%252Fimperfect-information%252Fdiplomacy%252F2022-bakhtin.pdf.html
Modeling Bounded Rationality in Multi-Agent Simulations Using Rationally Inattentive Reinforcement Learning
https%253A%252F%252Fopenreview.net%252Fforum%253Fid%253DDY1pMrmDkm.html
Social Simulacra: Creating Populated Prototypes for Social Computing Systems
DeepNash: Mastering the Game of Stratego with Model-Free Multiagent Reinforcement Learning
https%253A%252F%252Farxiv.org%252Fabs%252F2206.15378%2523deepmind.html
Fleet-DAgger: Interactive Robot Fleet Learning with Scalable Human Supervision
Revisiting Some Common Practices in Cooperative Multi-Agent Reinforcement Learning
MAT: Multi-Agent Reinforcement Learning is a Sequence Modeling Problem
https%253A%252F%252Farxiv.org%252Fabs%252F2202.07415%2523deepmind.html
Maximum Entropy Population Based Training for Zero-Shot Human-AI Coordination
https%253A%252F%252Farxiv.org%252Fabs%252F2112.11701%2523tencent.html
https%253A%252F%252Farxiv.org%252Fabs%252F2112.03178%2523deepmind.html
From Motor Control to Team Play in Simulated Humanoid Football
https%253A%252F%252Farxiv.org%252Fabs%252F2105.12196%2523deepmind.html
baller2vec++: A Look-Ahead Multi-Entity Transformer For Modeling Coordinated Agents
https%253A%252F%252Farxiv.org%252Fabs%252F2012.05672%2523deepmind.html
Towards Playing Full MOBA Games with Deep Reinforcement Learning
https%253A%252F%252Farxiv.org%252Fabs%252F2011.12692%2523tencent.html
TLeague: A Framework for Competitive Self-Play based Distributed Multi-Agent Reinforcement Learning
https%253A%252F%252Farxiv.org%252Fabs%252F2011.12895%2523tencent.html
Decentralized Reinforcement Learning: Global Decision-Making via Local Economic Transactions [blog]
https%253A%252F%252Fbair.berkeley.edu%252Fblog%252F2020%252F07%252F11%252Fauction%252F.html
Grandmaster level in StarCraft II using multi-agent reinforcement learning
%252Fdoc%252Freinforcement-learning%252Fmodel-free%252Falphastar%252F2019-vinyals.pdf%2523deepmind.html
Emergent Tool Use from Multi-Agent Interaction § Surprising behavior
https%253A%252F%252Fopenai.com%252Fresearch%252Femergent-tool-use%2523surprisingbehaviors.html
https%253A%252F%252Fdavid-abel.github.io%252Fnotes%252Ficml_2019.pdf.html
Human-level performance in 3D multiplayer games with population-based reinforcement learning
%252Fdoc%252Freinforcement-learning%252Fexploration%252F2019-jaderberg.pdf%2523deepmind.html
https%253A%252F%252Farxiv.org%252Fabs%252F1902.02186%2523deepmind.html
Construction of arbitrarily strong amplifiers of natural selection using evolutionary graph theory
https%253A%252F%252Fwww.nature.com%252Farticles%252Fs42003-018-0078-7.html
Homo Moralis-Preference Evolution Under Incomplete Information and Assortative Matching
%252Fdoc%252Freinforcement-learning%252Fmulti-agent%252F2013-alger.pdf.html
If multi-agent learning is the answer, what is the question?
%252Fdoc%252Freinforcement-learning%252Fmulti-agent%252F2007-shoham.pdf.html
Wikipedia Bibliography: