Second-Order Mirror Descent: Convergence in Games Beyond Averaging and   Discounting

Bolin Gao; Lacra Pavel

arXiv:2111.09982·math.OC·October 28, 2024

Second-Order Mirror Descent: Convergence in Games Beyond Averaging and Discounting

Bolin Gao, Lacra Pavel

PDF

Open Access

TL;DR

This paper introduces a second-order mirror descent (MD2) dynamics for continuous-time game convergence, achieving stability without averaging or discounting, and extends to stochastic discrete-time settings with noisy data.

Contribution

The paper proposes MD2, a novel second-order mirror descent method that ensures convergence to variationally stable states without traditional averaging or discounting techniques.

Findings

01

MD2 converges to variationally stable states in continuous time.

02

MD2 achieves exponential convergence to strong VSS with slight modifications.

03

Discrete-time MD2 converges under noisy observations using stochastic approximation.

Abstract

In this paper, we propose a second-order extension of the continuous-time game-theoretic mirror descent (MD) dynamics, referred to as MD2, which provably converges to mere (but not necessarily strict) variationally stable states (VSS) without using common auxiliary techniques such as time-averaging or discounting. We show that MD2 enjoys no-regret as well as an exponential rate of convergence towards strong VSS upon a slight modification. MD2 can also be used to derive many novel continuous-time primal-space dynamics. We then use stochastic approximation techniques to provide a convergence guarantee of discrete-time MD2 with noisy observations towards interior mere VSS. Selected simulations are provided to illustrate our results.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComplex Systems and Time Series Analysis · Opinion Dynamics and Social Influence · Game Theory and Applications