Evolution toward a Nash equilibrium
Ioannis Avramopoulos

TL;DR
This paper proves that the Hedge algorithm's average iterates converge to Nash equilibria in symmetric bimatrix games, providing a polynomial-time approximation scheme that impacts computational complexity theory.
Contribution
It generalizes convergence results of Hedge from zero-sum to symmetric bimatrix games and introduces a fully polynomial-time approximation scheme for symmetric equilibria.
Findings
Empirical averages of Hedge converge to Nash equilibria in symmetric games.
The analysis yields a symmetric equilibrium fully polynomial-time approximation scheme.
Implication that P equals PPAD under the proposed scheme.
Abstract
In this paper, we study the dynamic behavior of Hedge, a well-known algorithm in theoretical machine learning and algorithmic game theory. The empirical average (arithmetic mean) of the iterates Hedge generates is known to converge to a minimax equilibrium in zero-sum games. We generalize that result to show convergence of the empirical average to Nash equilibrium in symmetric bimatrix games (that is bimatrix games where the payoff matrix of each player is the transpose of that of the other) in the sense that every limit point of the sequence of averages is an -approximate symmetric equilibrium strategy for any desirable . Our analysis gives rise to a symmetric equilibrium fully polynomial-time approximation scheme, implying P = PPAD.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Auction Theory and Applications · Optimization and Search Problems
