Learning Distributions by Generative Adversarial Networks: Approximation and Generalization
Yunfei Yang

TL;DR
This paper analyzes how well GANs learn probability distributions from finite samples, providing convergence rates and error bounds based on neural network approximation capabilities and statistical learning theory, especially for low-dimensional data.
Contribution
The paper introduces a new oracle inequality for GAN error decomposition and derives explicit bounds on discriminator and generator approximation errors, advancing understanding of GAN convergence.
Findings
Convergence rates depend on intrinsic data dimension, not ambient dimension.
Error bounds are established for approximating Hölder functions with ReLU networks.
GANs achieve favorable rates when data concentrates on low-dimensional sets.
Abstract
We study how well generative adversarial networks (GAN) learn probability distributions from finite samples by analyzing the convergence rates of these models. Our analysis is based on a new oracle inequality that decomposes the estimation error of GAN into the discriminator and generator approximation errors, generalization error and optimization error. To estimate the discriminator approximation error, we establish error bounds on approximating H\"older functions by ReLU neural networks, with explicit upper bounds on the Lipschitz constant of the network or norm constraint on the weights. For generator approximation error, we show that neural network can approximately transform a low-dimensional source distribution to a high-dimensional target distribution and bound such approximation error by the width and depth of neural network. Combining the approximation results with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Model Reduction and Neural Networks · Adversarial Robustness in Machine Learning
