Towards Better Understanding of Adaptive Gradient Algorithms in   Generative Adversarial Nets

Mingrui Liu; Youssef Mroueh; Jerret Ross; Wei Zhang; Xiaodong Cui,; Payel Das; Tianbao Yang

arXiv:1912.11940·math.OC·December 29, 2020·21 cites

Towards Better Understanding of Adaptive Gradient Algorithms in Generative Adversarial Nets

Mingrui Liu, Youssef Mroueh, Jerret Ross, Wei Zhang, Xiaodong Cui,, Payel Das, Tianbao Yang

PDF

Open Access

TL;DR

This paper analyzes adaptive gradient algorithms in min-max problems like GANs, establishing new theoretical complexity bounds and demonstrating their empirical advantages over non-adaptive methods.

Contribution

It introduces an adaptive variant of Optimistic Stochastic Gradient with improved complexity bounds for non-convex non-concave min-max optimization, a novel theoretical contribution.

Findings

01

Adaptive algorithms outperform non-adaptive ones in GAN training.

02

Empirical evidence shows slow growth rate of cumulative stochastic gradient.

03

Theoretical analysis provides new complexity bounds for adaptive methods.

Abstract

Adaptive gradient algorithms perform gradient-based updates using the history of gradients and are ubiquitous in training deep neural networks. While adaptive gradient methods theory is well understood for minimization problems, the underlying factors driving their empirical success in min-max problems such as GANs remain unclear. In this paper, we aim at bridging this gap from both theoretical and empirical perspectives. First, we analyze a variant of Optimistic Stochastic Gradient (OSG) proposed in~\citep{daskalakis2017training} for solving a class of non-convex non-concave min-max problem and establish $O (ϵ^{- 4})$ complexity for finding $ϵ$ -first-order stationary point, in which the algorithm only requires invoking one stochastic first-order oracle while enjoying state-of-the-art iteration complexity achieved by stochastic extragradient method…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Advanced Neural Network Applications

MethodsAdaGrad · Convolution · Dogecoin Customer Service Number +1-833-534-1729