Synchronization in Learning in Periodic Zero-Sum Games Triggers Divergence from Nash Equilibrium
Yuma Fujimoto, Kaito Ariu, Kenshi Abe

TL;DR
This paper investigates how synchronization between game variation speed and learning speed causes divergence from Nash equilibrium in periodic zero-sum games, revealing a novel phenomenon with theoretical and experimental support.
Contribution
It introduces the concept of synchronization-induced divergence in learning dynamics within periodic zero-sum games, supported by theoretical proofs and experiments.
Findings
Synchronization leads to divergence from Nash equilibrium.
Asynchronous speeds result in converging cycles.
Experimental results confirm theoretical predictions.
Abstract
Learning in zero-sum games studies a situation where multiple agents competitively learn their strategy. In such multi-agent learning, we often see that the strategies cycle around their optimum, i.e., Nash equilibrium. When a game periodically varies (called a ``periodic'' game), however, the Nash equilibrium moves generically. How learning dynamics behave in such periodic games is of interest but still unclear. Interestingly, we discover that the behavior is highly dependent on the relationship between the two speeds at which the game changes and at which players learn. We observe that when these two speeds synchronize, the learning dynamics diverge, and their time-average does not converge. Otherwise, the learning dynamics draw complicated cycles, but their time-average converges. Under some assumptions introduced for the dynamical systems analysis, we prove that this behavior…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGame Theory and Applications · Advanced Thermodynamics and Statistical Mechanics
