Distributionally Robust Markov Games with Average Reward

Zachary Roch; Yue Wang

arXiv:2508.03136·cs.MA·December 12, 2025

Distributionally Robust Markov Games with Average Reward

Zachary Roch, Yue Wang

PDF

Open Access

TL;DR

This paper develops a theoretical and algorithmic framework for distributionally robust Markov games with average reward, addressing decision-making under uncertainty for multi-agent long-term interactions.

Contribution

It establishes the existence of stationary Nash Equilibria, proposes algorithms with convergence guarantees, and connects average-reward robust NE to discounted NE.

Findings

01

Existence of stationary Nash Equilibrium under standard assumptions.

02

Proposed algorithms with proven convergence guarantees.

03

Average-reward robust NE can be approximated by discounted NE.

Abstract

We study distributionally robust Markov games (DR-MGs) with the average-reward criterion, a framework for multi-agent decision-making under uncertainty over extended horizons. In average reward DR-MGs, agents aim to maximize their worst-case infinite-horizon average reward, to ensure satisfactory performance under environment uncertainties and opponent actions. We first establish a connection between the best-response policies and the optimal policies for the induced single-agent problems. Under a standard irreducible assumption, we derive a correspondence between the optimal policies and the solutions of the robust Bellman equation, and derive the existence of stationary Nash Equilibrium (NE) based on these results. We further study DR-MGs under the weakly communicating setting, where we construct a set-valued map and show its value is a subset of the best-response policies, convex and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Game Theory and Applications · Advanced Bandit Algorithms Research