Markov Games with Decoupled Dynamics: Price of Anarchy and Sample   Complexity

Runyu Zhang; Yuyang Zhang; Rohit Konda; Bryce Ferguson; Jason Marden,; Na Li

arXiv:2304.03840·cs.GT·April 11, 2023·1 cites

Markov Games with Decoupled Dynamics: Price of Anarchy and Sample Complexity

Runyu Zhang, Yuyang Zhang, Rohit Konda, Bryce Ferguson, Jason Marden,, Na Li

PDF

Open Access

TL;DR

This paper analyzes Markov games with decoupled dynamics, establishing bounds on the price of anarchy, introducing a distributed learning algorithm for potential games, and validating results through a dynamic covering game.

Contribution

It extends smoothness concepts to Markov games with decoupled dynamics, introduces the MA-SPI algorithm, and provides sample complexity analysis.

Findings

01

Bounded the price of anarchy using smoothness in Markov games.

02

Developed the MA-SPI algorithm with convergence guarantees.

03

Validated theoretical results with a dynamic covering game.

Abstract

This paper studies the finite-time horizon Markov games where the agents' dynamics are decoupled but the rewards can possibly be coupled across agents. The policy class is restricted to local policies where agents make decisions using their local state. We first introduce the notion of smooth Markov games which extends the smoothness argument for normal form games to our setting, and leverage the smoothness property to bound the price of anarchy of the Markov game. For a specific type of Markov game called the Markov potential game, we also develop a distributed learning algorithm, multi-agent soft policy iteration (MA-SPI), which provably converges to a Nash equilibrium. Sample complexity of the algorithm is also provided. Lastly, our results are validated using a dynamic covering game.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Game Theory and Applications · Optimization and Search Problems