Exponentially fast convergence to (strict) equilibrium via hedging
Johanne Cohen, Am\'elie H\'eliou, Panayotis Mertikopoulos

TL;DR
This paper demonstrates that the hedge variant of exponential weights learning converges exponentially fast to strict equilibria in N-player games, even under imperfect information, with convergence speed depending on step-size policies.
Contribution
It establishes exponential convergence rates for the hedge algorithm in game learning, including under uncertainty, and characterizes conditions for local convergence.
Findings
Exponential convergence occurs when players have perfect payoff information.
Under uncertainty, convergence remains with high probability using conservative step-sizes.
The convergence rate is exponential, proportional to the sum of step-sizes over time.
Abstract
Motivated by applications to data networks where fast convergence is essential, we analyze the problem of learning in generic N-person games that admit a Nash equilibrium in pure strategies. Specifically, we consider a scenario where players interact repeatedly and try to learn from past experience by small adjustments based on local - and possibly imperfect - payoff information. For concreteness, we focus on the so-called "hedge" variant of the exponential weights algorithm where players select an action with probability proportional to the exponential of the action's cumulative payoff over time. When players have perfect information on their mixed payoffs, the algorithm converges locally to a strict equilibrium and the rate of convergence is exponentially fast - of the order of where is a constant and is the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Reinforcement Learning in Robotics · Game Theory and Applications
