Three-Player Game Training Dynamics
Kenneth Christofferson, Fernando J. Yanez

TL;DR
This paper investigates the dynamics of three-player game training, revealing conditions for convergence, the impact of update order, and demonstrating that a novel maximizer-first update order can lead to convergence on Nash equilibria.
Contribution
It introduces a new update order for three-player games, maximizer-first, which can achieve convergence on Nash equilibria, unlike traditional update schemes.
Findings
Three-player games often do not converge to Nash equilibrium.
Maximizer-first update order enables convergence to Nash equilibrium.
Maximizer-first updates outperform other orders with various momentum settings.
Abstract
This work explores three-player game training dynamics, under what conditions three-player games converge and the equilibria the converge on. In contrast to prior work, we examine a three-player game architecture in which all players explicitly interact with each other. Prior work analyzes games in which two of three agents interact with only one other player, constituting dual two-player games. We explore three-player game training dynamics using an extended version of a simplified bilinear smooth game, called a simplified trilinear smooth game. We find that trilinear games do not converge on the Nash equilibrium in most cases, rather converging on a fixed point which is optimal for two players, but not for the third. Further, we explore how the order of the updates influences convergence. In addition to alternating and simultaneous updates, we explore a new update…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsQuantum many-body systems · Advanced Bandit Algorithms Research · Cold Atom Physics and Bose-Einstein Condensates
