Recursive Reasoning in Minimax Games: A Level $k$ Gradient Play Method
Zichu Liu, Lacra Pavel

TL;DR
This paper introduces the Level $k$ Gradient Play algorithm for stabilizing minimax game training, demonstrating convergence properties and practical advantages in GAN training with reduced computational resources.
Contribution
The paper proposes a novel recursive reasoning algorithm for minimax games that converges asymptotically and improves GAN training efficiency without complex heuristics.
Findings
Lv.$k$ GP converges to accurate strategy estimation as $k$ increases
Lv.$ ext{infinity}$ GP generalizes provably convergent dynamics
Achieves state-of-the-art GAN performance with fewer resources
Abstract
Despite the success of generative adversarial networks (GANs) in generating visually appealing images, they are notoriously challenging to train. In order to stabilize the learning dynamics in minimax games, we propose a novel recursive reasoning algorithm: Level Gradient Play (Lv. GP) algorithm. In contrast to many existing algorithms, our algorithm does not require sophisticated heuristics or curvature information. We show that as increases, Lv. GP converges asymptotically towards an accurate estimation of players' future strategy. Moreover, we justify that Lv. GP naturally generalizes a line of provably convergent game dynamics which rely on predictive updates. Furthermore, we provide its local convergence property in nonconvex-nonconcave zero-sum games and global convergence in bilinear and quadratic games. By combining Lv. GP with Adam optimizer, our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Artificial Intelligence in Games · Model Reduction and Neural Networks
Methods((Reservation@Faqs))How do I cancel a reservation on Expedia? · Six Ways To Communicate To Someone At Expedia Via Phone And Email's. · BigGAN · *Communicated@Fast*How Do I Communicate to Expedia? · Dense Connections · Feedforward Network · Softmax · 1x1 Convolution · Convolution · Batch Normalization
