Global Behavior of Learning Dynamics in Zero-Sum Games with Memory   Asymmetry

Yuma Fujimoto; Kaito Ariu; Kenshi Abe

arXiv:2405.14546·cs.GT·March 6, 2025

Global Behavior of Learning Dynamics in Zero-Sum Games with Memory Asymmetry

Yuma Fujimoto, Kaito Ariu, Kenshi Abe

PDF

Open Access

TL;DR

This paper analyzes the global behavior of learning dynamics in zero-sum games with asymmetric memory, revealing conditions for convergence to Nash equilibrium using novel conserved and Lyapunov quantities.

Contribution

It introduces new analytical tools, including an extended Kullback-Leibler divergence and Lyapunov functions, to characterize complex learning dynamics with asymmetric memory.

Findings

01

Strategies converge to Nash when X exploits Y

02

Y's deviation from equilibrium increases X's exploitability

03

Global convergence to Nash is supported by theoretical and experimental evidence

Abstract

This study examines the global behavior of dynamics in learning in games between two players, X and Y. We consider the simplest situation for memory asymmetry between two players: X memorizes the other Y's previous action and uses reactive strategies, while Y has no memory. Although this memory complicates their learning dynamics, we characterize the global behavior of such complex dynamics by discovering and analyzing two novel quantities. One is an extended Kullback-Leibler divergence from the Nash equilibrium, a well-known conserved quantity from previous studies. The other is a family of Lyapunov functions of X's reactive strategy. One of the global behaviors we capture is that if X exploits Y, then their strategies converge to the Nash equilibrium. Another is that if Y's strategy is out of equilibrium, then X becomes more exploitative with time. Consequently, we suggest global…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGuidance and Control Systems