Taming the Curses of Multiagency in Robust Markov Games with Large State Space through Linear Function Approximation
Jingchu Gai, Laixi Shi

TL;DR
This paper introduces data-efficient algorithms for large-scale distributionally robust Markov games using linear function approximation, overcoming the curse of multiagency in complex environments.
Contribution
It develops the first provably data-efficient algorithms for large or infinite state space RMGs with LFA, addressing robustness and sample complexity challenges.
Findings
Algorithms work in both generative and online settings.
First to break the curse of multiagency for large-scale RMGs.
Applicable to uncertainty sets defined by total variation distance.
Abstract
Multi-agent reinforcement learning (MARL) holds great potential but faces robustness challenges due to environmental uncertainty. To address this, distributionally robust Markov games (RMGs) optimize worst-case performance when the environment deviates from the nominal model within a uncertainty set. Beyond robustness, an equally urgent goal for MARL is data efficiency -- sampling from vast state and action spaces that grow exponentially with the number of agents potentially leads to the curse of multiagency. However, current provably data-efficient algorithms for RMGs are limited to tabular settings with finite state and action spaces, which are only computationally manageable for small-scale problems, leaving RMGs with large-scale (or infinite) state spaces largely unexplored. The only existing work beyond tabular settings focuses on linear function approximation (LFA) for a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
