Large Deviations Analysis For Regret Minimizing Stochastic Approximation   Algorithms

Hongjiang Qian; Vikram Krishnamurthy

arXiv:2406.00414·math.OC·June 4, 2024

Large Deviations Analysis For Regret Minimizing Stochastic Approximation Algorithms

Hongjiang Qian, Vikram Krishnamurthy

PDF

Open Access

TL;DR

This paper analyzes the probability of rare deviations in a multi-agent regret minimization algorithm using large deviations theory, providing insights into its convergence behavior.

Contribution

It introduces a large deviations framework for analyzing regret minimizing stochastic approximation algorithms with multi-agent communication.

Findings

01

Exponential decay rate towards stable point derived

02

Large deviations principles applied to multi-agent regret algorithms

03

Characterization of rare event probabilities in convergence analysis

Abstract

Motivated by learning of correlated equilibria in non-cooperative games, we perform a large deviations analysis of a regret minimizing stochastic approximation algorithm. The regret minimization algorithm we consider comprises multiple agents that communicate over a graph to coordinate their decisions. We derive an exponential decay rate towards the algorithm's stable point using large deviations theory. Our analysis leverages the variational representation of the Laplace functionals and weak convergence methods to characterize the exponential decay rate.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications