A unified stochastic approximation framework for learning in games
Panayotis Mertikopoulos, Ya-Ping Hsieh, Volkan Cevher

TL;DR
This paper introduces a comprehensive stochastic approximation framework that unifies and extends the analysis of various learning algorithms in games, providing new convergence insights for both continuous and finite settings.
Contribution
It offers a unified analysis template for multiple learning algorithms in games, including new convergence results and the concept of coherence for finite-time convergence.
Findings
Established criteria for convergence to Nash equilibria.
Introduced the notion of coherence for finite-time convergence.
Applicable to payoff-based and bandit learning methods.
Abstract
We develop a flexible stochastic approximation framework for analyzing the long-run behavior of learning in games (both continuous and finite). The proposed analysis template incorporates a wide array of popular learning algorithms, including gradient-based methods, the exponential/multiplicative weights algorithm for learning in finite games, optimistic and bandit variants of the above, etc. In addition to providing an integrated view of these algorithms, our framework further allows us to obtain several new convergence results, both asymptotic and in finite time, in both continuous and finite games. Specifically, we provide a range of criteria for identifying classes of Nash equilibria and sets of action profiles that are attracting with high probability, and we also introduce the notion of coherence, a game-theoretic property that includes strict and sharp equilibria, and which leads…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExperimental Behavioral Economics Studies · Game Theory and Applications · Advanced Bandit Algorithms Research
