Convergence of Learning Dynamics in Stackelberg Games

Tanner Fiez; Benjamin Chasnov; Lillian J. Ratliff

arXiv:1906.01217·cs.GT·December 7, 2024·45 cites

Convergence of Learning Dynamics in Stackelberg Games

Tanner Fiez, Benjamin Chasnov, Lillian J. Ratliff

PDF

Open Access 1 Repo

TL;DR

This paper analyzes the convergence properties of learning dynamics in Stackelberg games, establishing conditions for equilibrium convergence and proposing algorithms with proven convergence guarantees, including applications to training GANs.

Contribution

It introduces new gradient-based algorithms for Stackelberg games with convergence guarantees and applies these methods to improve training of generative adversarial networks.

Findings

01

Stable critical points correspond to Stackelberg equilibria in zero-sum games.

02

Proposed algorithms converge to Stackelberg equilibria under certain conditions.

03

Numerical experiments validate theoretical results and improve GAN training.

Abstract

This paper investigates the convergence of learning dynamics in Stackelberg games. In the class of games we consider, there is a hierarchical game being played between a leader and a follower with continuous action spaces. We establish a number of connections between the Nash and Stackelberg equilibrium concepts and characterize conditions under which attracting critical points of simultaneous gradient descent are Stackelberg equilibria in zero-sum games. Moreover, we show that the only stable critical points of the Stackelberg gradient dynamics are Stackelberg equilibria in zero-sum games. Using this insight, we develop a gradient-based update for the leader while the follower employs a best response strategy for which each stable critical point is guaranteed to be a Stackelberg equilibrium in zero-sum games. As a result, the learning rule provably converges to a Stackelberg equilibria…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

fiezt/Stackelberg-Code
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMarkov Chains and Monte Carlo Methods · Generative Adversarial Networks and Image Synthesis · Reinforcement Learning in Robotics