Evolution toward a Nash equilibrium

Ioannis Avramopoulos

arXiv:2007.10331·cs.GT·July 22, 2020

Evolution toward a Nash equilibrium

Ioannis Avramopoulos

PDF

Open Access

TL;DR

This paper proves that the Hedge algorithm's average iterates converge to Nash equilibria in symmetric bimatrix games, providing a polynomial-time approximation scheme that impacts computational complexity theory.

Contribution

It generalizes convergence results of Hedge from zero-sum to symmetric bimatrix games and introduces a fully polynomial-time approximation scheme for symmetric equilibria.

Findings

01

Empirical averages of Hedge converge to Nash equilibria in symmetric games.

02

The analysis yields a symmetric equilibrium fully polynomial-time approximation scheme.

03

Implication that P equals PPAD under the proposed scheme.

Abstract

In this paper, we study the dynamic behavior of Hedge, a well-known algorithm in theoretical machine learning and algorithmic game theory. The empirical average (arithmetic mean) of the iterates Hedge generates is known to converge to a minimax equilibrium in zero-sum games. We generalize that result to show convergence of the empirical average to Nash equilibrium in symmetric bimatrix games (that is bimatrix games where the payoff matrix of each player is the transpose of that of the other) in the sense that every limit point of the sequence of averages is an $ϵ$ -approximate symmetric equilibrium strategy for any desirable $ϵ$ . Our analysis gives rise to a symmetric equilibrium fully polynomial-time approximation scheme, implying P = PPAD.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Auction Theory and Applications · Optimization and Search Problems