Provably Learning Nash Policies in Constrained Markov Potential Games

Pragnya Alatur; Giorgia Ramponi; Niao He; Andreas Krause

arXiv:2306.07749·cs.LG·June 14, 2023·2 cites

Provably Learning Nash Policies in Constrained Markov Potential Games

Pragnya Alatur, Giorgia Ramponi, Niao He, Andreas Krause

PDF

Open Access

TL;DR

This paper introduces a new class of safe multi-agent reinforcement learning problems called Constrained Markov Potential Games, and proposes a provably convergent algorithm with sample complexity bounds for learning Nash policies in these settings.

Contribution

It defines CMPGs, shows they lack strong duality, and develops CA-CMPG, the first provably convergent algorithm with sample complexity for unknown CMPGs.

Findings

01

CA-CMPG converges to Nash policies in finite-horizon CMPGs.

02

Provides the first sample complexity bounds for learning Nash policies in unknown CMPGs.

03

Guarantees safe exploration under certain assumptions.

Abstract

Multi-agent reinforcement learning (MARL) addresses sequential decision-making problems with multiple agents, where each agent optimizes its own objective. In many real-world instances, the agents may not only want to optimize their objectives, but also ensure safe behavior. For example, in traffic routing, each car (agent) aims to reach its destination quickly (objective) while avoiding collisions (safety). Constrained Markov Games (CMGs) are a natural formalism for safe MARL problems, though generally intractable. In this work, we introduce and study Constrained Markov Potential Games (CMPGs), an important class of CMGs. We first show that a Nash policy for CMPGs can be found via constrained optimization. One tempting approach is to solve it by Lagrangian-based primal-dual methods. As we show, in contrast to the single-agent setting, however, CMPGs do not satisfy strong duality,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Autonomous Vehicle Technology and Safety · Formal Methods in Verification