Aspiration-based Perturbed Learning Automata in Games with Noisy Utility Measurements. Part A: Stochastic Stability in Non-zero-Sum Games
Georgios C. Chasparis

TL;DR
This paper introduces aspiration-based perturbed learning automata (APLA), a novel reinforcement learning scheme that ensures stochastic stability in multi-player non-zero-sum games with noisy utility measurements, extending prior work beyond potential games.
Contribution
The paper proposes APLA, a new payoff-based learning scheme with aspiration factors, and provides the first stochastic stability analysis for such dynamics in generic non-zero-sum games.
Findings
APLA guarantees stochastic stability in positive-utility games with noise.
The analysis establishes equivalence between infinite-dimensional and finite-dimensional Markov chains.
Results extend stability guarantees to weakly acyclic games.
Abstract
Reinforcement-based learning has attracted considerable attention both in modeling human behavior as well as in engineering, for designing measurement- or payoff-based optimization schemes. Such learning schemes exhibit several advantages, especially in relation to filtering out noisy observations. However, they may exhibit several limitations when applied in a distributed setup. In multi-player weakly-acyclic games, and when each player applies an independent copy of the learning dynamics, convergence to (usually desirable) pure Nash equilibria cannot be guaranteed. Prior work has only focused on a small class of games, namely potential and coordination games. To address this main limitation, this paper introduces a novel payoff-based learning scheme for distributed optimization, namely aspiration-based perturbed learning automata (APLA). In this class of dynamics, and contrary to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOptimization and Search Problems · Advanced Bandit Algorithms Research · Game Theory and Applications
