Stochastic Stability of Perturbed Learning Automata in Positive-Utility   Games

Georgios C. Chasparis

arXiv:1709.05859·cs.GT·January 29, 2019

Stochastic Stability of Perturbed Learning Automata in Positive-Utility Games

Georgios C. Chasparis

PDF

TL;DR

This paper analyzes the long-term stability of perturbed learning automata in positive-utility games by characterizing invariant measures of the induced Markov chain, offering a new approach beyond traditional Lyapunov-based methods.

Contribution

It introduces a novel framework for stochastic stability analysis that does not rely on potential functions, applicable to a broader class of positive-utility games.

Findings

01

Provides a method to compute invariant measures in positive-utility games

02

Demonstrates the approach with an example in coordination games

03

Offers insights into the global convergence behavior of learning automata

Abstract

This paper considers a class of reinforcement-based learning (namely, perturbed learning automata) and provides a stochastic-stability analysis in repeatedly-played, positive-utility, finite strategic-form games. Prior work in this class of learning dynamics primarily analyzes asymptotic convergence through stochastic approximations, where convergence can be associated with the limit points of an ordinary-differential equation (ODE). However, analyzing global convergence through an ODE-approximation requires the existence of a Lyapunov or a potential function, which naturally restricts the analysis to a fine class of games. To overcome these limitations, this paper introduces an alternative framework for analyzing asymptotic convergence that is based upon an explicit characterization of the invariant probability measure of the induced Markov chain. We further provide a methodology for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.