Aspiration-based Perturbed Learning Automata in Games with Noisy Utility Measurements. Part B: Stochastic Stability in Weakly Acyclic Games

Georgios C. Chasparis

arXiv:2511.18418·cs.GT·November 25, 2025

Aspiration-based Perturbed Learning Automata in Games with Noisy Utility Measurements. Part B: Stochastic Stability in Weakly Acyclic Games

Georgios C. Chasparis

PDF

Open Access

TL;DR

This paper introduces aspiration-based perturbed learning automata (APLA), a novel reinforcement learning scheme that guarantees convergence to pure Nash equilibria in weakly-acyclic games with noisy observations, extending prior work to more general game classes.

Contribution

The paper proposes APLA, a new payoff-based learning scheme with stochastic stability analysis in noisy, multi-player weakly-acyclic games, addressing convergence limitations of previous methods.

Findings

01

Convergence to pure Nash equilibria is achieved under certain conditions.

02

APLA extends reinforcement learning to weakly-acyclic games with noisy data.

03

Simulation supports theoretical convergence results.

Abstract

Reinforcement-based learning dynamics may exhibit several limitations when applied in a distributed setup. In (repeatedly-played) multi-player/action strategic-form games, and when each player applies an independent copy of the learning dynamics, convergence to (usually desirable) pure Nash equilibria cannot be guaranteed. Prior work has only focused on a small class of games, namely potential and coordination games. Furthermore, strong convergence guarantees (i.e., almost sure convergence or weak convergence) are mostly restricted to two-player games. To address this main limitation of reinforcement-based learning in repeatedly-played strategic-form games, this paper introduces a novel payoff-based learning scheme for distributed optimization in multi-player/action strategic-form games. We present an extension of perturbed learning automata (PLA), namely aspiration-based perturbed…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsOptimization and Search Problems · Advanced Bandit Algorithms Research · Game Theory and Applications