Conjugate Gradient Method for Generative Adversarial Networks

Hiroki Naganuma; Hideaki Iiduka

arXiv:2203.14495·cs.LG·February 22, 2023

Conjugate Gradient Method for Generative Adversarial Networks

Hiroki Naganuma, Hideaki Iiduka

PDF

Open Access 1 Repo

TL;DR

This paper introduces a conjugate gradient method for training GANs, providing convergence guarantees and demonstrating improved performance over traditional optimizers like SGD and Adam in generating high-quality images.

Contribution

It proposes a novel conjugate gradient approach for solving the local Nash equilibrium in GAN training, with theoretical convergence analysis and empirical performance improvements.

Findings

01

Converges to local Nash equilibrium under mild assumptions.

02

Outperforms SGD and momentum SGD in FID scores.

03

Performs better than Adam on average in experiments.

Abstract

One of the training strategies of generative models is to minimize the Jensen--Shannon divergence between the model distribution and the data distribution. Since data distribution is unknown, generative adversarial networks (GANs) formulate this problem as a game between two models, a generator and a discriminator. The training can be formulated in the context of game theory and the local Nash equilibrium (LNE). It does not seem feasible to derive guarantees of stability or optimality for the existing methods. This optimization problem is far more challenging than the single objective setting. Here, we use the conjugate gradient method to reliably and efficiently solve the LNE problem in GANs. We give a proof and convergence analysis under mild assumptions showing that the proposed method converges to a LNE with three different learning rate update rules, including a constant learning…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hiroki11x/conjugategradient_gan
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Model Reduction and Neural Networks · Gaussian Processes and Bayesian Inference

MethodsAdam · Stochastic Gradient Descent