$\alpha$-GAN: Convergence and Estimation Guarantees
Gowtham R. Kurri, Monica Welfert, Tyler Sypherd, Lalitha Sankar

TL;DR
This paper establishes theoretical convergence guarantees for $oldsymbol{ extalpha}$-GANs, a family of generative adversarial networks interpolating various divergence measures, and demonstrates practical benefits of tuning the $oldsymbol{ extalpha}$ hyperparameter.
Contribution
It provides a theoretical link between min-max GAN optimization and $f$-divergences, introduces $oldsymbol{ extalpha}$-GANs interpolating multiple divergences, and offers empirical insights on hyperparameter tuning.
Findings
$oldsymbol{ extalpha}$-GANs converge for all $oldsymbol{ extalpha}$ values.
Estimation bounds vary with $oldsymbol{ extalpha}$ under finite samples.
Tuning $oldsymbol{ extalpha}$ improves practical performance.
Abstract
We prove a two-way correspondence between the min-max optimization of general CPE loss function GANs and the minimization of associated -divergences. We then focus on -GAN, defined via the -loss, which interpolates several GANs (Hellinger, vanilla, Total Variation) and corresponds to the minimization of the Arimoto divergence. We show that the Arimoto divergences induced by -GAN equivalently converge, for all . However, under restricted learning models and finite samples, we provide estimation bounds which indicate diverse GAN behavior as a function of . Finally, we present empirical results on a toy dataset that highlight the practical utility of tuning the hyperparameter.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Statistical Methods and Inference · Statistical Mechanics and Entropy
MethodsCollaborative Preference Embedding
