Breaking the Curse of Multiagency in Robust Multi-Agent Reinforcement   Learning

Laixi Shi; Jingchu Gai; Eric Mazumdar; Yuejie Chi; Adam Wierman

arXiv:2409.20067·cs.LG·February 3, 2025

Breaking the Curse of Multiagency in Robust Multi-Agent Reinforcement Learning

Laixi Shi, Jingchu Gai, Eric Mazumdar, Yuejie Chi, Adam Wierman

PDF

Open Access

TL;DR

This paper introduces a novel class of distributionally robust Markov games inspired by behavioral economics, providing a sample-efficient algorithm that overcomes the exponential complexity typically associated with multiple agents in robust multi-agent reinforcement learning.

Contribution

It formulates a new class of RMGs considering agents' behaviors, proves the existence of solutions, and develops the first polynomial-sample complexity algorithm to break the curse of multiagency.

Findings

01

Established well-posedness of the new RMG class.

02

Proposed a polynomial-sample complexity algorithm for CCE.

03

First to overcome the curse of multiagency in RMGs.

Abstract

Standard multi-agent reinforcement learning (MARL) algorithms are vulnerable to sim-to-real gaps. To address this, distributionally robust Markov games (RMGs) have been proposed to enhance robustness in MARL by optimizing the worst-case performance when game dynamics shift within a prescribed uncertainty set. RMGs remains under-explored, from reasonable problem formulation to the development of sample-efficient algorithms. Two notorious and open challenges are the formulation of the uncertainty set and whether the corresponding RMGs can overcome the curse of multiagency, where the sample complexity scales exponentially with the number of agents. In this work, we propose a natural class of RMGs inspired by behavioral economics, where each agent's uncertainty set is shaped by both the environment and the integrated behavior of other agents. We first establish the well-posedness of this…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics

MethodsSparse Evolutionary Training