Convergence of Learning Dynamics in Stackelberg Games
Tanner Fiez, Benjamin Chasnov, Lillian J. Ratliff

TL;DR
This paper analyzes the convergence properties of learning dynamics in Stackelberg games, establishing conditions for equilibrium convergence and proposing algorithms with proven convergence guarantees, including applications to training GANs.
Contribution
It introduces new gradient-based algorithms for Stackelberg games with convergence guarantees and applies these methods to improve training of generative adversarial networks.
Findings
Stable critical points correspond to Stackelberg equilibria in zero-sum games.
Proposed algorithms converge to Stackelberg equilibria under certain conditions.
Numerical experiments validate theoretical results and improve GAN training.
Abstract
This paper investigates the convergence of learning dynamics in Stackelberg games. In the class of games we consider, there is a hierarchical game being played between a leader and a follower with continuous action spaces. We establish a number of connections between the Nash and Stackelberg equilibrium concepts and characterize conditions under which attracting critical points of simultaneous gradient descent are Stackelberg equilibria in zero-sum games. Moreover, we show that the only stable critical points of the Stackelberg gradient dynamics are Stackelberg equilibria in zero-sum games. Using this insight, we develop a gradient-based update for the leader while the follower employs a best response strategy for which each stable critical point is guaranteed to be a Stackelberg equilibrium in zero-sum games. As a result, the learning rule provably converges to a Stackelberg equilibria…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMarkov Chains and Monte Carlo Methods · Generative Adversarial Networks and Image Synthesis · Reinforcement Learning in Robotics
