A Polynomial-time Nash Equilibrium Algorithm for Repeated Stochastic   Games

Enrique Munoz de Cote; Michael L. Littman

arXiv:1206.3277·cs.GT·June 18, 2012·28 cites

A Polynomial-time Nash Equilibrium Algorithm for Repeated Stochastic Games

Enrique Munoz de Cote, Michael L. Littman

PDF

Open Access

TL;DR

This paper introduces a polynomial-time algorithm for finding approximate Nash equilibria in repeated two-player stochastic games, leveraging the folk theorem and efficiently approximating the egalitarian point to improve social welfare.

Contribution

The paper presents the first polynomial-time algorithm for approximate Nash equilibria in repeated stochastic games, combining equilibrium construction with egalitarian point approximation.

Findings

01

Algorithm finds equilibria efficiently in grid games

02

Solutions have higher social welfare than competing methods

03

Algorithm guarantees computational efficiency

Abstract

We present a polynomial-time algorithm that always finds an (approximate) Nash equilibrium for repeated two-player stochastic games. The algorithm exploits the folk theorem to derive a strategy profile that forms an equilibrium by buttressing mutually beneficial behavior with threats, where possible. One component of our algorithm efficiently searches for an approximation of the egalitarian point, the fairest pareto-efficient solution. The paper concludes by applying the algorithm to a set of grid games to illustrate typical solutions the algorithm finds. These solutions compare very favorably to those found by competing algorithms, resulting in strategies with higher social welfare, as well as guaranteed computational efficiency.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGame Theory and Applications · Economic theories and models · Game Theory and Voting Systems