Priority Based Synchronization for Faster Learning in Games
Abbasali Koochakzadeh, Yasin Yaz{\i}c{\i}o\u{g}lu

TL;DR
This paper introduces a decentralized synchronization strategy that allows multiple agents to update their actions simultaneously in game learning, significantly speeding up convergence without losing stability.
Contribution
It proposes a novel decentralized prioritization method enabling faster learning in potential games while preserving stochastic stability.
Findings
The synchronization strategy improves convergence speed in simulations.
The approach maintains the stochastic stability of optimal configurations.
Simulations demonstrate faster learning in coverage control scenarios.
Abstract
Learning in games has been widely used to solve many cooperative multi-agent problems such as coverage control, consensus, self-reconfiguration or vehicle-target assignment. One standard approach in this domain is to formulate the problem as a potential game and to use an algorithm such as log-linear learning to achieve the stochastic stability of globally optimal configurations. Standard versions of such learning algorithms are asynchronous, i.e., only one agent updates its action at each round of the learning process. To enable faster learning, we propose a synchronization strategy based on decentralized random prioritization of agents, which allows multiple agents to change their actions simultaneously when they do not affect each other's utility or feasible actions. We show that the proposed approach can be integrated into any standard asynchronous learning algorithm to improve the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDistributed Control Multi-Agent Systems · Distributed Sensor Networks and Detection Algorithms · Gene Regulatory Network Analysis
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
