Recursive Reasoning in Minimax Games: A Level $k$ Gradient Play Method

Zichu Liu; Lacra Pavel

arXiv:2210.16482·cs.LG·November 1, 2022·1 cites

Recursive Reasoning in Minimax Games: A Level $k$ Gradient Play Method

Zichu Liu, Lacra Pavel

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces the Level $k$ Gradient Play algorithm for stabilizing minimax game training, demonstrating convergence properties and practical advantages in GAN training with reduced computational resources.

Contribution

The paper proposes a novel recursive reasoning algorithm for minimax games that converges asymptotically and improves GAN training efficiency without complex heuristics.

Findings

01

Lv.$k$ GP converges to accurate strategy estimation as $k$ increases

02

Lv.$ ext{infinity}$ GP generalizes provably convergent dynamics

03

Achieves state-of-the-art GAN performance with fewer resources

Abstract

Despite the success of generative adversarial networks (GANs) in generating visually appealing images, they are notoriously challenging to train. In order to stabilize the learning dynamics in minimax games, we propose a novel recursive reasoning algorithm: Level $k$ Gradient Play (Lv. $k$ GP) algorithm. In contrast to many existing algorithms, our algorithm does not require sophisticated heuristics or curvature information. We show that as $k$ increases, Lv. $k$ GP converges asymptotically towards an accurate estimation of players' future strategy. Moreover, we justify that Lv. $\infty$ GP naturally generalizes a line of provably convergent game dynamics which rely on predictive updates. Furthermore, we provide its local convergence property in nonconvex-nonconcave zero-sum games and global convergence in bilinear and quadratic games. By combining Lv. $k$ GP with Adam optimizer, our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zichuliu/submission
pytorchOfficial

Videos

Recursive Reasoning in Minimax Games: A Level $k$ Gradient Play Method· slideslive

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Artificial Intelligence in Games · Model Reduction and Neural Networks

Methods((Reservation@Faqs))How do I cancel a reservation on Expedia? · Six Ways To Communicate To Someone At Expedia Via Phone And Email's. · BigGAN · *Communicated@Fast*How Do I Communicate to Expedia? · Dense Connections · Feedforward Network · Softmax · 1x1 Convolution · Convolution · Batch Normalization