Distributionally Robust Markov Games with Average Reward
Zachary Roch, Yue Wang

TL;DR
This paper develops a theoretical and algorithmic framework for distributionally robust Markov games with average reward, addressing decision-making under uncertainty for multi-agent long-term interactions.
Contribution
It establishes the existence of stationary Nash Equilibria, proposes algorithms with convergence guarantees, and connects average-reward robust NE to discounted NE.
Findings
Existence of stationary Nash Equilibrium under standard assumptions.
Proposed algorithms with proven convergence guarantees.
Average-reward robust NE can be approximated by discounted NE.
Abstract
We study distributionally robust Markov games (DR-MGs) with the average-reward criterion, a framework for multi-agent decision-making under uncertainty over extended horizons. In average reward DR-MGs, agents aim to maximize their worst-case infinite-horizon average reward, to ensure satisfactory performance under environment uncertainties and opponent actions. We first establish a connection between the best-response policies and the optimal policies for the induced single-agent problems. Under a standard irreducible assumption, we derive a correspondence between the optimal policies and the solutions of the robust Bellman equation, and derive the existence of stationary Nash Equilibrium (NE) based on these results. We further study DR-MGs under the weakly communicating setting, where we construct a set-valued map and show its value is a subset of the best-response policies, convex and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Game Theory and Applications · Advanced Bandit Algorithms Research
