Aspiration-based Perturbed Learning Automata in Games with Noisy Utility Measurements. Part A: Stochastic Stability in Non-zero-Sum Games

Georgios C. Chasparis

arXiv:2511.11602·cs.LG·November 26, 2025

Aspiration-based Perturbed Learning Automata in Games with Noisy Utility Measurements. Part A: Stochastic Stability in Non-zero-Sum Games

Georgios C. Chasparis

PDF

Open Access

TL;DR

This paper introduces aspiration-based perturbed learning automata (APLA), a novel reinforcement learning scheme that ensures stochastic stability in multi-player non-zero-sum games with noisy utility measurements, extending prior work beyond potential games.

Contribution

The paper proposes APLA, a new payoff-based learning scheme with aspiration factors, and provides the first stochastic stability analysis for such dynamics in generic non-zero-sum games.

Findings

01

APLA guarantees stochastic stability in positive-utility games with noise.

02

The analysis establishes equivalence between infinite-dimensional and finite-dimensional Markov chains.

03

Results extend stability guarantees to weakly acyclic games.

Abstract

Reinforcement-based learning has attracted considerable attention both in modeling human behavior as well as in engineering, for designing measurement- or payoff-based optimization schemes. Such learning schemes exhibit several advantages, especially in relation to filtering out noisy observations. However, they may exhibit several limitations when applied in a distributed setup. In multi-player weakly-acyclic games, and when each player applies an independent copy of the learning dynamics, convergence to (usually desirable) pure Nash equilibria cannot be guaranteed. Prior work has only focused on a small class of games, namely potential and coordination games. To address this main limitation, this paper introduces a novel payoff-based learning scheme for distributed optimization, namely aspiration-based perturbed learning automata (APLA). In this class of dynamics, and contrary to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsOptimization and Search Problems · Advanced Bandit Algorithms Research · Game Theory and Applications