- See Also
-
Links
- “Learning Few-shot Imitation As Cultural Transmission”, Bhoopchand et al 2023
- “JaxMARL: Multi-Agent RL Environments in JAX”, Rutherford et al 2023
- “Large Language Models Can Strategically Deceive Their Users When Put Under Pressure”, Scheurer et al 2023
- “Neural MMO 2.0: A Massively Multi-task Addition to Massively Multi-agent Learning”, Suárez et al 2023
- “Diversifying AI: Towards Creative Chess With AlphaZero (AZdb)”, Zahavy et al 2023
- “Combining Human Expertise With Artificial Intelligence: Experimental Evidence from Radiology”, Agarwal et al 2023
- “Reinforcement Learning in Newcomb-like Environments”, Bell et al 2023
- “Posterior Sampling for Multi-agent Reinforcement Learning: Solving Extensive Games With Imperfect Information”, Zhou et al 2023
- “Multi-Party Chat (MultiLIGHT): Conversational Agents in Group Settings With Humans and Models”, Wei et al 2023
- “Learning Agile Soccer Skills for a Bipedal Robot With Deep Reinforcement Learning”, Haarnoja et al 2023
- “Melting Pot 2.0”, Agapiou et al 2022
- “CICERO: Human-level Play in the Game of Diplomacy by Combining Language Models With Strategic Reasoning”, Team et al 2022
- “Over-communicate No More: Situated RL Agents Learn Concise Communication Protocols”, Kalinowska et al 2022
- “Human-AI Coordination via Human-Regularized Search and Learning”, Hu et al 2022
- “Game Theoretic Rating in N-player General-sum Games With Equilibria”, Marris et al 2022
- “Modeling Bounded Rationality in Multi-Agent Simulations Using Rationally Inattentive Reinforcement Learning”, Anonymous 2022
- “Neural Payoff Machines: Predicting Fair and Stable Payoff Allocations Among Team Members”, Cornelisse et al 2022
- “Social Simulacra: Creating Populated Prototypes for Social Computing Systems”, Park et al 2022
- “DeepNash: Mastering the Game of Stratego With Model-Free Multiagent Reinforcement Learning”, Perolat et al 2022
- “Fleet-DAgger: Interactive Robot Fleet Learning With Scalable Human Supervision”, Hoque et al 2022
- “Revisiting Some Common Practices in Cooperative Multi-Agent Reinforcement Learning”, Fu et al 2022
- “MAT: Multi-Agent Reinforcement Learning Is a Sequence Modeling Problem”, Wen et al 2022
- “First Contact: Unsupervised Human-Machine Co-Adaptation via Mutual Information Maximization”, Reddy et al 2022
- “Emergent Bartering Behavior in Multi-Agent Reinforcement Learning”, Johanson et al 2022
- “NeuPL: Neural Population Learning”, Liu et al 2022
- “Uncalibrated Models Can Improve Human-AI Collaboration”, Vodrahalli et al 2022
- “Human-centered Mechanism Design With Democratic AI”, Koster et al 2022
- “Hidden Agenda: a Social Deduction Game With Diverse Learned Equilibria”, Kopparapu et al 2022
- “Finding General Equilibria in Many-Agent Economic Simulations Using Deep Reinforcement Learning”, Curry et al 2022
- “Maximum Entropy Population Based Training for Zero-Shot Human-AI Coordination”, Zhao et al 2021
- “Modeling Strong and Human-Like Gameplay With KL-Regularized Search”, Jacob et al 2021
- “Player of Games”, Schmid et al 2021
- “Offline Pre-trained Multi-Agent Decision Transformer: One Big Sequence Model Tackles All SMAC Tasks”, Meng et al 2021
- “Collective Intelligence for Deep Learning: A Survey of Recent Developments”, Ha & Tang 2021
- “Learning to Ground Multi-Agent Communication With Autoencoders”, Lin et al 2021
- “Meta-learning, Social Cognition and Consciousness in Brains and Machines”, Langdon et al 2021
- “Collaborating With Humans without Human Data”, Strouse et al 2021
- “The Neural MMO Platform for Massively Multiagent Research”, Suarez et al 2021
- “Embodied Intelligence via Learning and Evolution”, Gupta et al 2021
- “No-Press Diplomacy from Scratch”, Bakhtin et al 2021
- “Replay-Guided Adversarial Environment Design”, Jiang et al 2021
- “Trust Region Policy Optimisation in Multi-Agent Reinforcement Learning”, Kuba et al 2021
- “WarpDrive: Extremely Fast End-to-End Deep Multi-Agent Reinforcement Learning on a GPU”, Lan et al 2021
- “The AI Economist: Optimal Economic Policy Design via Two-level Deep Reinforcement Learning”, Zheng et al 2021
- “Open-Ended Learning Leads to Generally Capable Agents”, Team et al 2021
- “Megaverse: Simulating Embodied Agents at One Million Experiences per Second”, Petrenko et al 2021
- “Scalable Evaluation of Multi-Agent Reinforcement Learning With Melting Pot”, Leibo et al 2021
- “From Motor Control to Team Play in Simulated Humanoid Football”, Liu et al 2021
- “Cooperative AI Foundation (CAIF)”, CAIF 2021
- “Baller2vec++: A Look-Ahead Multi-Entity Transformer For Modeling Coordinated Agents”, Alcorn & Nguyen 2021
- “Neural Tree Expansion for Multi-Robot Planning in Non-Cooperative Environments”, Riviere et al 2021
- “Multitasking Inhibits Semantic Drift”, Jacob et al 2021
- “Asymmetric Self-play for Automatic Goal Discovery in Robotic Manipulation”, OpenAI et al 2021
- “Reinforcement Learning for Datacenter Congestion Control”, Tessler et al 2021
- “Baller2vec: A Multi-Entity Transformer For Multi-Agent Spatiotemporal Modeling”, Alcorn & Nguyen 2021
- “UPDeT: Universal Multi-agent Reinforcement Learning via Policy Decoupling With Transformers”, Hu et al 2021
- “Imitating Interactive Intelligence”, Abramson et al 2020
- “TLeague: A Framework for Competitive Self-Play Based Distributed Multi-Agent Reinforcement Learning”, Sun et al 2020
- “Towards Playing Full MOBA Games With Deep Reinforcement Learning”, Ye et al 2020
- “Emergent Road Rules In Multi-Agent Driving Environments”, Pal et al 2020
- “Reinforcement Learning for Optimization of COVID-19 Mitigation Policies”, Kompella et al 2020
- “Human-Level Performance in No-Press Diplomacy via Equilibrium Search”, Gray et al 2020
- “Emergent Social Learning via Multi-agent Reinforcement Learning”, Ndousse et al 2020
- “Grounded Language Learning Fast and Slow”, Hill et al 2020
- “ReBeL: Combining Deep Reinforcement Learning and Search for Imperfect-Information Games”, Brown et al 2020
- “Decentralized Reinforcement Learning: Global Decision-Making via Local Economic Transactions [blog]”, Chang & Kaushik 2020
- “One Policy to Control Them All: Shared Modular Policies for Agent-Agnostic Control”, Huang et al 2020
- “Decentralized Reinforcement Learning: Global Decision-Making via Local Economic Transactions”, Chang et al 2020
- “Benchmarking Multi-Agent Deep Reinforcement Learning Algorithms in Cooperative Tasks”, Papoudakis et al 2020
- “Learning to Play No-Press Diplomacy With Best Response Policy Iteration”, Anthony et al 2020
- “Approximate Exploitability: Learning a Best Response in Large Games”, Timbers et al 2020
- “Real World Games Look Like Spinning Tops”, Czarnecki et al 2020
- “Enhanced POET: Open-Ended Reinforcement Learning through Unbounded Invention of Learning Challenges and Their Solutions”, Wang et al 2020
- “Social Diversity and Social Preferences in Mixed-motive Reinforcement Learning”, McKee et al 2020
- “Effective Diversity in Population Based Reinforcement Learning”, Parker-Holder et al 2020
- “Towards Learning Multi-agent Negotiations via Self-Play”, Tang 2020
- “Smooth Markets: A Basic Mechanism for Organizing Gradient-based Learners”, Balduzzi et al 2020
- “Learning by Cheating”, Chen et al 2019
- “Increasing Generality in Machine Learning through Procedural Content Generation”, Risi & Togelius 2019
- “Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms”, Zhang et al 2019
- “Grandmaster Level in StarCraft II Using Multi-agent Reinforcement Learning”, Vinyals et al 2019
- “Stabilizing Generative Adversarial Networks: A Survey”, Wiatrak et al 2019
- “Emergent Tool Use from Multi-Agent Interaction § Surprising Behavior”, Baker et al 2019
- “Emergent Tool Use From Multi-Agent Autocurricula”, Baker et al 2019
- “No Press Diplomacy: Modeling Multi-Agent Gameplay”, Paquette et al 2019
- “A Review of Cooperative Multi-Agent Deep Reinforcement Learning”, OroojlooyJadid & Hajinezhad 2019
- “Pluribus: Superhuman AI for Multiplayer Poker”, Brown & Sandholm 2019
- “Evolving the Hearthstone Meta”, Silva et al 2019
- “Evolutionary Implementation of Bayesian Computations”, Czégel et al 2019
- “Finding Friend and Foe in Multi-Agent Games”, Serrino et al 2019
- “Hierarchical Decision Making by Generating and Following Natural Language Instructions”, Hu et al 2019
- “ICML 2019 Notes”, Abel 2019
- “Human-level Performance in 3D Multiplayer Games With Population-based Reinforcement Learning”, Jaderberg et al 2019
- “AI-GAs: AI-generating Algorithms, an Alternate Paradigm for Producing General Artificial Intelligence”, Clune 2019
- “Adversarial Policies: Attacking Deep Reinforcement Learning”, Gleave et al 2019
- “LIGHT: Learning to Speak and Act in a Fantasy Text Adventure Game”, Urbanek et al 2019
- “Α-Rank: Multi-Agent Evaluation by Evolution”, Omidshafiei et al 2019
- “Autocurricula and the Emergence of Innovation from Social Interaction: A Manifesto for Multi-Agent Intelligence Research”, Leibo et al 2019
- “Distilling Policy Distillation”, Czarnecki et al 2019
- “Open-ended Learning in Symmetric Zero-sum Games”, Balduzzi et al 2019
- “Hierarchical Reinforcement Learning for Multi-agent MOBA Game”, Zhang et al 2019
- “Paired Open-Ended Trailblazer (POET): Endlessly Generating Increasingly Complex and Diverse Learning Environments and Their Solutions”, Wang et al 2019
- “Hierarchical Macro Strategy Model for MOBA Game AI”, Wu et al 2018
- “Continual Match Based Training in Pommerman: Technical Report”, Peng et al 2018
- “Malthusian Reinforcement Learning”, Leibo et al 2018
- “Evolution As Backstop for Reinforcement Learning”, Gwern 2018
- “Stable Opponent Shaping in Differentiable Games”, Letcher et al 2018
- “Deep Counterfactual Regret Minimization”, Brown et al 2018
- “TarMAC: Targeted Multi-Agent Communication”, Das et al 2018
- “Graph Convolutional Reinforcement Learning”, Jiang et al 2018
- “Social Influence As Intrinsic Motivation for Multi-Agent Deep Reinforcement Learning”, Jaques et al 2018
- “Deep Reinforcement Learning”, Li 2018
- “A Survey and Critique of Multiagent Deep Reinforcement Learning”, Hernandez-Leal et al 2018
- “Learning to Coordinate Multiple Reinforcement Learning Agents for Diverse Query Reformulation”, Nogueira et al 2018
- “Pommerman: A Multi-Agent Playground”, Resnick et al 2018
- “Fully Distributed Multi-Robot Collision Avoidance via Deep Reinforcement Learning for Safe and Efficient Navigation in Complex Scenarios”, Fan et al 2018
- “Human-level Performance in First-person Multiplayer Games With Population-based Deep Reinforcement Learning”, Jaderberg et al 2018
- “Construction of Arbitrarily Strong Amplifiers of Natural Selection Using Evolutionary Graph Theory”, Pavlogiannis et al 2018
- “Adaptive Mechanism Design: Learning to Promote Cooperation”, Baumann et al 2018
- “Mix&Match—Agent Curricula for Reinforcement Learning”, Czarnecki et al 2018
- “Kickstarting Deep Reinforcement Learning”, Schmitt et al 2018
- “Machine Theory of Mind”, Rabinowitz et al 2018
- “Sim-to-Real Optimization of Complex Real World Mobile Network With Imperfect Information via Deep Reinforcement Learning from Self-play”, Tan et al 2018
- “Trust-Aware Decision Making for Human-Robot Collaboration: Model Learning and Planning”, Chen et al 2018
- “Emergent Complexity via Multi-Agent Competition”, Bansal et al 2017
- “Learning With Opponent-Learning Awareness”, Foerster et al 2017
- “LADDER: A Human-Level Bidding Agent for Large-Scale Real-Time Online Auctions”, Wang et al 2017
- “CAN: Creative Adversarial Networks, Generating "Art" by Learning About Styles and Deviating from Style Norms”, Elgammal et al 2017
- “Supervision via Competition: Robot Adversaries for Learning Tasks”, Pinto et al 2016
- “Policy Distillation”, Rusu et al 2015
- “Reflective Oracles: A Foundation for Classical Game Theory”, Fallenstein et al 2015
- “Language Evolution in the Laboratory”, Scott-Phillips & Kirby 2010
- “Market-Based Reinforcement Learning in Partially Observable Worlds”, Kwee et al 2001
- “Properties of the Bucket Brigade Algorithm”, Holland 1985
- “Computer-Aided Gas Pipeline Operation Using Genetic Algorithms And Rule Learning”, Goldberg 1983
- “One Writer Enters International Competition to Play the World-conquering Game That Redefines What It Means to Be a Geek (and a Person)”
- Wikipedia
- Miscellaneous
- Link Bibliography
See Also
Links
“Learning Few-shot Imitation As Cultural Transmission”, Bhoopchand et al 2023
“JaxMARL: Multi-Agent RL Environments in JAX”, Rutherford et al 2023
“Large Language Models Can Strategically Deceive Their Users When Put Under Pressure”, Scheurer et al 2023
“Large Language Models can Strategically Deceive their Users when Put Under Pressure”
“Neural MMO 2.0: A Massively Multi-task Addition to Massively Multi-agent Learning”, Suárez et al 2023
“Neural MMO 2.0: A Massively Multi-task Addition to Massively Multi-agent Learning”
“Diversifying AI: Towards Creative Chess With AlphaZero (AZdb)”, Zahavy et al 2023
“Diversifying AI: Towards Creative Chess with AlphaZero (AZdb)”
“Combining Human Expertise With Artificial Intelligence: Experimental Evidence from Radiology”, Agarwal et al 2023
“Combining Human Expertise with Artificial Intelligence: Experimental Evidence from Radiology”
“Reinforcement Learning in Newcomb-like Environments”, Bell et al 2023
“Posterior Sampling for Multi-agent Reinforcement Learning: Solving Extensive Games With Imperfect Information”, Zhou et al 2023
“Multi-Party Chat (MultiLIGHT): Conversational Agents in Group Settings With Humans and Models”, Wei et al 2023
“Multi-Party Chat (MultiLIGHT): Conversational Agents in Group Settings with Humans and Models”
“Learning Agile Soccer Skills for a Bipedal Robot With Deep Reinforcement Learning”, Haarnoja et al 2023
“Learning Agile Soccer Skills for a Bipedal Robot with Deep Reinforcement Learning”
“Melting Pot 2.0”, Agapiou et al 2022
“CICERO: Human-level Play in the Game of Diplomacy by Combining Language Models With Strategic Reasoning”, Team et al 2022
“Over-communicate No More: Situated RL Agents Learn Concise Communication Protocols”, Kalinowska et al 2022
“Over-communicate no more: Situated RL agents learn concise communication protocols”
“Human-AI Coordination via Human-Regularized Search and Learning”, Hu et al 2022
“Human-AI Coordination via Human-Regularized Search and Learning”
“Game Theoretic Rating in N-player General-sum Games With Equilibria”, Marris et al 2022
“Game Theoretic Rating in N-player general-sum games with Equilibria”
“Modeling Bounded Rationality in Multi-Agent Simulations Using Rationally Inattentive Reinforcement Learning”, Anonymous 2022
“Neural Payoff Machines: Predicting Fair and Stable Payoff Allocations Among Team Members”, Cornelisse et al 2022
“Neural Payoff Machines: Predicting Fair and Stable Payoff Allocations Among Team Members”
“Social Simulacra: Creating Populated Prototypes for Social Computing Systems”, Park et al 2022
“Social Simulacra: Creating Populated Prototypes for Social Computing Systems”
“DeepNash: Mastering the Game of Stratego With Model-Free Multiagent Reinforcement Learning”, Perolat et al 2022
“DeepNash: Mastering the Game of Stratego with Model-Free Multiagent Reinforcement Learning”
“Fleet-DAgger: Interactive Robot Fleet Learning With Scalable Human Supervision”, Hoque et al 2022
“Fleet-DAgger: Interactive Robot Fleet Learning with Scalable Human Supervision”
“Revisiting Some Common Practices in Cooperative Multi-Agent Reinforcement Learning”, Fu et al 2022
“Revisiting Some Common Practices in Cooperative Multi-Agent Reinforcement Learning”
“MAT: Multi-Agent Reinforcement Learning Is a Sequence Modeling Problem”, Wen et al 2022
“MAT: Multi-Agent Reinforcement Learning is a Sequence Modeling Problem”
“First Contact: Unsupervised Human-Machine Co-Adaptation via Mutual Information Maximization”, Reddy et al 2022
“First Contact: Unsupervised Human-Machine Co-Adaptation via Mutual Information Maximization”
“Emergent Bartering Behavior in Multi-Agent Reinforcement Learning”, Johanson et al 2022
“Emergent Bartering Behavior in Multi-Agent Reinforcement Learning”
“NeuPL: Neural Population Learning”, Liu et al 2022
“Uncalibrated Models Can Improve Human-AI Collaboration”, Vodrahalli et al 2022
“Human-centered Mechanism Design With Democratic AI”, Koster et al 2022
“Hidden Agenda: a Social Deduction Game With Diverse Learned Equilibria”, Kopparapu et al 2022
“Hidden Agenda: a Social Deduction Game with Diverse Learned Equilibria”
“Finding General Equilibria in Many-Agent Economic Simulations Using Deep Reinforcement Learning”, Curry et al 2022
“Finding General Equilibria in Many-Agent Economic Simulations Using Deep Reinforcement Learning”
“Maximum Entropy Population Based Training for Zero-Shot Human-AI Coordination”, Zhao et al 2021
“Maximum Entropy Population Based Training for Zero-Shot Human-AI Coordination”
“Modeling Strong and Human-Like Gameplay With KL-Regularized Search”, Jacob et al 2021
“Modeling Strong and Human-Like Gameplay with KL-Regularized Search”
“Player of Games”, Schmid et al 2021
“Offline Pre-trained Multi-Agent Decision Transformer: One Big Sequence Model Tackles All SMAC Tasks”, Meng et al 2021
“Collective Intelligence for Deep Learning: A Survey of Recent Developments”, Ha & Tang 2021
“Collective Intelligence for Deep Learning: A Survey of Recent Developments”
“Learning to Ground Multi-Agent Communication With Autoencoders”, Lin et al 2021
“Learning to Ground Multi-Agent Communication with Autoencoders”
“Meta-learning, Social Cognition and Consciousness in Brains and Machines”, Langdon et al 2021
“Meta-learning, social cognition and consciousness in brains and machines”
“Collaborating With Humans without Human Data”, Strouse et al 2021
“The Neural MMO Platform for Massively Multiagent Research”, Suarez et al 2021
“Embodied Intelligence via Learning and Evolution”, Gupta et al 2021
“No-Press Diplomacy from Scratch”, Bakhtin et al 2021
“Replay-Guided Adversarial Environment Design”, Jiang et al 2021
“Trust Region Policy Optimisation in Multi-Agent Reinforcement Learning”, Kuba et al 2021
“Trust Region Policy Optimisation in Multi-Agent Reinforcement Learning”
“WarpDrive: Extremely Fast End-to-End Deep Multi-Agent Reinforcement Learning on a GPU”, Lan et al 2021
“WarpDrive: Extremely Fast End-to-End Deep Multi-Agent Reinforcement Learning on a GPU”
“The AI Economist: Optimal Economic Policy Design via Two-level Deep Reinforcement Learning”, Zheng et al 2021
“The AI Economist: Optimal Economic Policy Design via Two-level Deep Reinforcement Learning”
“Open-Ended Learning Leads to Generally Capable Agents”, Team et al 2021
“Megaverse: Simulating Embodied Agents at One Million Experiences per Second”, Petrenko et al 2021
“Megaverse: Simulating Embodied Agents at One Million Experiences per Second”
“Scalable Evaluation of Multi-Agent Reinforcement Learning With Melting Pot”, Leibo et al 2021
“Scalable Evaluation of Multi-Agent Reinforcement Learning with Melting Pot”
“From Motor Control to Team Play in Simulated Humanoid Football”, Liu et al 2021
“From Motor Control to Team Play in Simulated Humanoid Football”
“Cooperative AI Foundation (CAIF)”, CAIF 2021
“Baller2vec++: A Look-Ahead Multi-Entity Transformer For Modeling Coordinated Agents”, Alcorn & Nguyen 2021
“baller2vec++: A Look-Ahead Multi-Entity Transformer For Modeling Coordinated Agents”
“Neural Tree Expansion for Multi-Robot Planning in Non-Cooperative Environments”, Riviere et al 2021
“Neural Tree Expansion for Multi-Robot Planning in Non-Cooperative Environments”
“Multitasking Inhibits Semantic Drift”, Jacob et al 2021
“Asymmetric Self-play for Automatic Goal Discovery in Robotic Manipulation”, OpenAI et al 2021
“Asymmetric self-play for automatic goal discovery in robotic manipulation”
“Reinforcement Learning for Datacenter Congestion Control”, Tessler et al 2021
“Baller2vec: A Multi-Entity Transformer For Multi-Agent Spatiotemporal Modeling”, Alcorn & Nguyen 2021
“baller2vec: A Multi-Entity Transformer For Multi-Agent Spatiotemporal Modeling”
“UPDeT: Universal Multi-agent Reinforcement Learning via Policy Decoupling With Transformers”, Hu et al 2021
“UPDeT: Universal Multi-agent Reinforcement Learning via Policy Decoupling with Transformers”
“Imitating Interactive Intelligence”, Abramson et al 2020
“TLeague: A Framework for Competitive Self-Play Based Distributed Multi-Agent Reinforcement Learning”, Sun et al 2020
“Towards Playing Full MOBA Games With Deep Reinforcement Learning”, Ye et al 2020
“Towards Playing Full MOBA Games with Deep Reinforcement Learning”
“Emergent Road Rules In Multi-Agent Driving Environments”, Pal et al 2020
“Reinforcement Learning for Optimization of COVID-19 Mitigation Policies”, Kompella et al 2020
“Reinforcement Learning for Optimization of COVID-19 Mitigation policies”
“Human-Level Performance in No-Press Diplomacy via Equilibrium Search”, Gray et al 2020
“Human-Level Performance in No-Press Diplomacy via Equilibrium Search”
“Emergent Social Learning via Multi-agent Reinforcement Learning”, Ndousse et al 2020
“Emergent Social Learning via Multi-agent Reinforcement Learning”
“Grounded Language Learning Fast and Slow”, Hill et al 2020
“ReBeL: Combining Deep Reinforcement Learning and Search for Imperfect-Information Games”, Brown et al 2020
“ReBeL: Combining Deep Reinforcement Learning and Search for Imperfect-Information Games”
“Decentralized Reinforcement Learning: Global Decision-Making via Local Economic Transactions [blog]”, Chang & Kaushik 2020
“One Policy to Control Them All: Shared Modular Policies for Agent-Agnostic Control”, Huang et al 2020
“One Policy to Control Them All: Shared Modular Policies for Agent-Agnostic Control”
“Decentralized Reinforcement Learning: Global Decision-Making via Local Economic Transactions”, Chang et al 2020
“Decentralized Reinforcement Learning: Global Decision-Making via Local Economic Transactions”
“Benchmarking Multi-Agent Deep Reinforcement Learning Algorithms in Cooperative Tasks”, Papoudakis et al 2020
“Benchmarking Multi-Agent Deep Reinforcement Learning Algorithms in Cooperative Tasks”
“Learning to Play No-Press Diplomacy With Best Response Policy Iteration”, Anthony et al 2020
“Learning to Play No-Press Diplomacy with Best Response Policy Iteration”
“Approximate Exploitability: Learning a Best Response in Large Games”, Timbers et al 2020
“Approximate exploitability: Learning a best response in large games”
“Real World Games Look Like Spinning Tops”, Czarnecki et al 2020
“Enhanced POET: Open-Ended Reinforcement Learning through Unbounded Invention of Learning Challenges and Their Solutions”, Wang et al 2020
“Social Diversity and Social Preferences in Mixed-motive Reinforcement Learning”, McKee et al 2020
“Social diversity and social preferences in mixed-motive reinforcement learning”
“Effective Diversity in Population Based Reinforcement Learning”, Parker-Holder et al 2020
“Effective Diversity in Population Based Reinforcement Learning”
“Towards Learning Multi-agent Negotiations via Self-Play”, Tang 2020
“Smooth Markets: A Basic Mechanism for Organizing Gradient-based Learners”, Balduzzi et al 2020
“Smooth markets: A basic mechanism for organizing gradient-based learners”
“Learning by Cheating”, Chen et al 2019
“Increasing Generality in Machine Learning through Procedural Content Generation”, Risi & Togelius 2019
“Increasing Generality in Machine Learning through Procedural Content Generation”
“Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms”, Zhang et al 2019
“Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms”
“Grandmaster Level in StarCraft II Using Multi-agent Reinforcement Learning”, Vinyals et al 2019
“Grandmaster level in StarCraft II using multi-agent reinforcement learning”
“Stabilizing Generative Adversarial Networks: A Survey”, Wiatrak et al 2019
“Emergent Tool Use from Multi-Agent Interaction § Surprising Behavior”, Baker et al 2019
“Emergent Tool Use from Multi-Agent Interaction § Surprising behavior”
“Emergent Tool Use From Multi-Agent Autocurricula”, Baker et al 2019
“No Press Diplomacy: Modeling Multi-Agent Gameplay”, Paquette et al 2019
“A Review of Cooperative Multi-Agent Deep Reinforcement Learning”, OroojlooyJadid & Hajinezhad 2019
“A Review of Cooperative Multi-Agent Deep Reinforcement Learning”
“Pluribus: Superhuman AI for Multiplayer Poker”, Brown & Sandholm 2019
“Evolving the Hearthstone Meta”, Silva et al 2019
“Evolutionary Implementation of Bayesian Computations”, Czégel et al 2019
“Finding Friend and Foe in Multi-Agent Games”, Serrino et al 2019
“Hierarchical Decision Making by Generating and Following Natural Language Instructions”, Hu et al 2019
“Hierarchical Decision Making by Generating and Following Natural Language Instructions”
“ICML 2019 Notes”, Abel 2019
“Human-level Performance in 3D Multiplayer Games With Population-based Reinforcement Learning”, Jaderberg et al 2019
“Human-level performance in 3D multiplayer games with population-based reinforcement learning”
“AI-GAs: AI-generating Algorithms, an Alternate Paradigm for Producing General Artificial Intelligence”, Clune 2019
“Adversarial Policies: Attacking Deep Reinforcement Learning”, Gleave et al 2019
“Adversarial Policies: Attacking Deep Reinforcement Learning”
“LIGHT: Learning to Speak and Act in a Fantasy Text Adventure Game”, Urbanek et al 2019
“LIGHT: Learning to Speak and Act in a Fantasy Text Adventure Game”
“Α-Rank: Multi-Agent Evaluation by Evolution”, Omidshafiei et al 2019
“Autocurricula and the Emergence of Innovation from Social Interaction: A Manifesto for Multi-Agent Intelligence Research”, Leibo et al 2019
“Distilling Policy Distillation”, Czarnecki et al 2019
“Open-ended Learning in Symmetric Zero-sum Games”, Balduzzi et al 2019
“Hierarchical Reinforcement Learning for Multi-agent MOBA Game”, Zhang et al 2019
“Hierarchical Reinforcement Learning for Multi-agent MOBA Game”
“Paired Open-Ended Trailblazer (POET): Endlessly Generating Increasingly Complex and Diverse Learning Environments and Their Solutions”, Wang et al 2019
“Hierarchical Macro Strategy Model for MOBA Game AI”, Wu et al 2018
“Continual Match Based Training in Pommerman: Technical Report”, Peng et al 2018
“Continual Match Based Training in Pommerman: Technical Report”
“Malthusian Reinforcement Learning”, Leibo et al 2018
“Evolution As Backstop for Reinforcement Learning”, Gwern 2018
“Stable Opponent Shaping in Differentiable Games”, Letcher et al 2018
“Deep Counterfactual Regret Minimization”, Brown et al 2018
“TarMAC: Targeted Multi-Agent Communication”, Das et al 2018
“Graph Convolutional Reinforcement Learning”, Jiang et al 2018
“Social Influence As Intrinsic Motivation for Multi-Agent Deep Reinforcement Learning”, Jaques et al 2018
“Social Influence as Intrinsic Motivation for Multi-Agent Deep Reinforcement Learning”
“Deep Reinforcement Learning”, Li 2018
“A Survey and Critique of Multiagent Deep Reinforcement Learning”, Hernandez-Leal et al 2018
“A Survey and Critique of Multiagent Deep Reinforcement Learning”
“Learning to Coordinate Multiple Reinforcement Learning Agents for Diverse Query Reformulation”, Nogueira et al 2018
“Learning to Coordinate Multiple Reinforcement Learning Agents for Diverse Query Reformulation”
“Pommerman: A Multi-Agent Playground”, Resnick et al 2018
“Fully Distributed Multi-Robot Collision Avoidance via Deep Reinforcement Learning for Safe and Efficient Navigation in Complex Scenarios”, Fan et al 2018
“Human-level Performance in First-person Multiplayer Games With Population-based Deep Reinforcement Learning”, Jaderberg et al 2018
“Construction of Arbitrarily Strong Amplifiers of Natural Selection Using Evolutionary Graph Theory”, Pavlogiannis et al 2018
“Construction of arbitrarily strong amplifiers of natural selection using evolutionary graph theory”
“Adaptive Mechanism Design: Learning to Promote Cooperation”, Baumann et al 2018
“Adaptive Mechanism Design: Learning to Promote Cooperation”
“Mix&Match—Agent Curricula for Reinforcement Learning”, Czarnecki et al 2018
“Kickstarting Deep Reinforcement Learning”, Schmitt et al 2018
“Machine Theory of Mind”, Rabinowitz et al 2018
“Sim-to-Real Optimization of Complex Real World Mobile Network With Imperfect Information via Deep Reinforcement Learning from Self-play”, Tan et al 2018
“Trust-Aware Decision Making for Human-Robot Collaboration: Model Learning and Planning”, Chen et al 2018
“Trust-Aware Decision Making for Human-Robot Collaboration: Model Learning and Planning”
“Emergent Complexity via Multi-Agent Competition”, Bansal et al 2017
“Learning With Opponent-Learning Awareness”, Foerster et al 2017
“LADDER: A Human-Level Bidding Agent for Large-Scale Real-Time Online Auctions”, Wang et al 2017
“LADDER: A Human-Level Bidding Agent for Large-Scale Real-Time Online Auctions”
“CAN: Creative Adversarial Networks, Generating "Art" by Learning About Styles and Deviating from Style Norms”, Elgammal et al 2017
“Supervision via Competition: Robot Adversaries for Learning Tasks”, Pinto et al 2016
“Supervision via Competition: Robot Adversaries for Learning Tasks”
“Policy Distillation”, Rusu et al 2015
“Reflective Oracles: A Foundation for Classical Game Theory”, Fallenstein et al 2015
“Reflective Oracles: A Foundation for Classical Game Theory”
“Language Evolution in the Laboratory”, Scott-Phillips & Kirby 2010
“Market-Based Reinforcement Learning in Partially Observable Worlds”, Kwee et al 2001
“Market-Based Reinforcement Learning in Partially Observable Worlds”
“Properties of the Bucket Brigade Algorithm”, Holland 1985
“Computer-Aided Gas Pipeline Operation Using Genetic Algorithms And Rule Learning”, Goldberg 1983
“Computer-Aided Gas Pipeline Operation Using Genetic Algorithms And Rule Learning”
“One Writer Enters International Competition to Play the World-conquering Game That Redefines What It Means to Be a Geek (and a Person)”
Wikipedia
Miscellaneous
-
/doc/reinforcement-learning/multi-agent/2019-jaderberg-supplement-movie-4-aau6249s4.mp4
-
/doc/reinforcement-learning/multi-agent/2019-jaderberg-supplement-movie-3-aau6249s3.mp4
-
/doc/reinforcement-learning/multi-agent/2019-jaderberg-supplement-movie-2-aau6249s2.mp4
-
/doc/reinforcement-learning/multi-agent/2019-jaderberg-supplement-movie-1-aau6249s1.mp4
-
https://blog.research.google/2019/06/introducing-google-research-football.html
-
https://blog.research.google/2021/03/leveraging-machine-learning-for-game.html
-
https://joao-abrantes.com/posts/mimicking-evolution-with-reinforcement-learning/
-
https://twitter.com/evanthebouncy/status/1642918859866009600
-
https://www.deepmind.com/blog/generally-capable-agents-emerge-from-open-ended-play
-
https://www.deepmind.com/publications/open-ended-learning-leads-to-generally-capable-agents
-
https://www.lesswrong.com/posts/65qmEJHDw3vw69tKm/proposal-scaling-laws-for-rl-generalization
-
https://www.lesswrong.com/posts/yHxmJch8dJoH6dwwz/so-has-ai-conquered-bridge
-
https://www.nytimes.com/2013/09/08/magazine/poker-computer.html
-
https://www.quantamagazine.org/computers-evolve-a-new-path-toward-human-intelligence-20191106/
Link Bibliography
-
https://www.nature.com/articles/s41467-023-42875-2#deepmind
: “Learning Few-shot Imitation As Cultural Transmission”, -
https://arxiv.org/abs/2311.10090
: “JaxMARL: Multi-Agent RL Environments in JAX”, -
https://arxiv.org/abs/2311.03736
: “Neural MMO 2.0: A Massively Multi-task Addition to Massively Multi-agent Learning”, -
https://www.nber.org/papers/w31422
: “Combining Human Expertise With Artificial Intelligence: Experimental Evidence from Radiology”, Nikhil Agarwal, Alex Moehring, Pranav Rajpurkar, Tobias Salz -
https://arxiv.org/abs/2304.13653#deepmind
: “Learning Agile Soccer Skills for a Bipedal Robot With Deep Reinforcement Learning”, -
https://www.science.org/doi/10.1126/science.ade9097#facebook
: “CICERO: Human-level Play in the Game of Diplomacy by Combining Language Models With Strategic Reasoning”, -
https://openreview.net/forum?id=DY1pMrmDkm
: “Modeling Bounded Rationality in Multi-Agent Simulations Using Rationally Inattentive Reinforcement Learning”, Anonymous -
https://arxiv.org/abs/2208.04024
: “Social Simulacra: Creating Populated Prototypes for Social Computing Systems”, Joon Sung Park, Lindsay Popowski, Carrie J. Cai, Meredith Ringel Morris, Percy Liang, Michael S. Bernstein -
https://arxiv.org/abs/2206.15378#deepmind
: “DeepNash: Mastering the Game of Stratego With Model-Free Multiagent Reinforcement Learning”, -
https://arxiv.org/abs/2206.14349
: “Fleet-DAgger: Interactive Robot Fleet Learning With Scalable Human Supervision”, -
https://arxiv.org/abs/2206.07505
: “Revisiting Some Common Practices in Cooperative Multi-Agent Reinforcement Learning”, Wei Fu, Chao Yu, Zelai Xu, Jiaqi Yang, Yi Wu -
https://arxiv.org/abs/2205.14953
: “MAT: Multi-Agent Reinforcement Learning Is a Sequence Modeling Problem”, Muning Wen, Jakub Grudzien Kuba, Runji Lin, Weinan Zhang, Ying Wen, Jun Wang, Yaodong Yang -
https://arxiv.org/abs/2202.07415#deepmind
: “NeuPL: Neural Population Learning”, Siqi Liu, Luke Marris, Daniel Hennes, Josh Merel, Nicolas Heess, Thore Graepel -
https://arxiv.org/abs/2112.11701#tencent
: “Maximum Entropy Population Based Training for Zero-Shot Human-AI Coordination”, Rui Zhao, Jinming Song, Hu Haifeng, Yang Gao, Yi Wu, Zhongqian Sun, Yang Wei -
https://arxiv.org/abs/2112.03178#deepmind
: “Player of Games”, -
https://arxiv.org/abs/2105.12196#deepmind
: “From Motor Control to Team Play in Simulated Humanoid Football”, -
https://arxiv.org/abs/2104.11980
: “Baller2vec++: A Look-Ahead Multi-Entity Transformer For Modeling Coordinated Agents”, Michael A. Alcorn, Anh Nguyen -
https://arxiv.org/abs/2012.05672#deepmind
: “Imitating Interactive Intelligence”, -
https://arxiv.org/abs/2011.12895#tencent
: “TLeague: A Framework for Competitive Self-Play Based Distributed Multi-Agent Reinforcement Learning”, Peng Sun, Jiechao Xiong, Lei Han, Xinghai Sun, Shuxing Li, Jiawei Xu, Meng Fang, Zhengyou Zhang -
https://arxiv.org/abs/2011.12692#tencent
: “Towards Playing Full MOBA Games With Deep Reinforcement Learning”, -
https://bair.berkeley.edu/blog/2020/07/11/auction/
: “Decentralized Reinforcement Learning: Global Decision-Making via Local Economic Transactions [blog]”, Michael Chang, Sidhant Kaushik -
2019-vinyals.pdf#deepmind
: “Grandmaster Level in StarCraft II Using Multi-agent Reinforcement Learning”, -
https://openai.com/research/emergent-tool-use#surprisingbehaviors
: “Emergent Tool Use from Multi-Agent Interaction § Surprising Behavior”, Bowen Baker, Ingmar Kanitscheider, Todor Markov, Yi Wu, Glenn Powell, Bob McGrew, Igor Mordatch -
https://david-abel.github.io/notes/icml_2019.pdf
: “ICML 2019 Notes”, David Abel -
2019-jaderberg.pdf#deepmind
: “Human-level Performance in 3D Multiplayer Games With Population-based Reinforcement Learning”, -
https://arxiv.org/abs/1902.02186#deepmind
: “Distilling Policy Distillation”, Wojciech Marian Czarnecki, Razvan Pascanu, Simon Osindero, Siddhant M. Jayakumar, Grzegorz Swirszcz, Max Jaderberg -
backstop
: “Evolution As Backstop for Reinforcement Learning”, Gwern -
https://www.nature.com/articles/s42003-018-0078-7
: “Construction of Arbitrarily Strong Amplifiers of Natural Selection Using Evolutionary Graph Theory”, Andreas Pavlogiannis, Josef Tkadlec, Krishnendu Chatterjee, Martin A. Nowak