Learning in Multi-Memory Games Triggers Complex Dynamics Diverging from Nash Equilibrium
Yuma Fujimoto, Kaito Ariu, and Kenshi Abe

TL;DR
This paper extends key learning algorithms to multi-memory zero-sum games, revealing that their dynamics diverge from Nash equilibrium and tend to form heteroclinic cycles, highlighting complex behaviors in strategic learning.
Contribution
It introduces a novel extension of replicator dynamics and gradient ascent to multi-memory games and proves their equivalence, showing divergence from Nash equilibrium.
Findings
Dynamics diverge from Nash equilibrium in multi-memory zero-sum games.
Learning trajectories tend to form heteroclinic cycles.
Theoretical and experimental validation of divergence and cycle formation.
Abstract
Repeated games consider a situation where multiple agents are motivated by their independent rewards throughout learning. In general, the dynamics of their learning become complex. Especially when their rewards compete with each other like zero-sum games, the dynamics often do not converge to their optimum, i.e., the Nash equilibrium. To tackle such complexity, many studies have understood various learning algorithms as dynamical systems and discovered qualitative insights among the algorithms. However, such studies have yet to handle multi-memory games (where agents can memorize actions they played in the past and choose their actions based on their memories), even though memorization plays a pivotal role in artificial intelligence and interpersonal relationship. This study extends two major learning algorithms in games, i.e., replicator dynamics and gradient ascent, into multi-memory…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGame Theory and Applications · Evolutionary Game Theory and Cooperation
