Anytime-Constrained Equilibria in Polynomial Time

Jeremy McMahan

arXiv:2410.23637·cs.LG·March 5, 2025

Anytime-Constrained Equilibria in Polynomial Time

Jeremy McMahan

PDF

Open Access 1 Video

TL;DR

This paper introduces anytime-constrained equilibria in Markov games, providing computational methods including fixed-parameter tractable and polynomial-time algorithms, with optimal approximation guarantees under complexity assumptions.

Contribution

It extends anytime constraints to Markov games, develops a comprehensive theory, and offers new algorithms for computing equilibria with optimal approximation guarantees.

Findings

01

Fixed-parameter tractable algorithm for ACE

02

Polynomial-time approximate algorithm for ACE

03

Optimal approximation guarantees assuming P ≠ NP

Abstract

We extend anytime constraints to the Markov game setting and the corresponding solution concept of an anytime-constrained equilibrium (ACE). Then, we present a comprehensive theory of anytime-constrained equilibria that includes (1) a computational characterization of feasible policies, (2) a fixed-parameter tractable algorithm for computing ACE, and (3) a polynomial-time algorithm for approximately computing ACE. Since computing a feasible policy is NP-hard even for two-player zero-sum games, our approximation guarantees are optimal so long as $P \neq = N P$ . We also develop the first theory of efficient computation for action-constrained Markov games, which may be of independent interest.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Anytime-Constrained Equilibria in Polynomial Time· slideslive

Taxonomy

TopicsReinforcement Learning in Robotics