On the ability of neural nets to express distributions
Holden Lee, Rong Ge, Tengyu Ma, Andrej Risteski, Sanjeev Arora

TL;DR
This paper explores the expressive power of deep neural networks in modeling complex data distributions, providing theoretical criteria for their approximation capabilities using Fourier analysis and composition of functions.
Contribution
It introduces a Fourier-based criterion for neural network approximability of functions and distributions, extending Barron's Theorem to compositions of functions for deep networks.
Findings
Neural networks can approximate compositions of Barron functions with more layers.
A Fourier criterion determines when a distribution can be approximated in Wasserstein distance.
Composition of Barron functions exceeds the expressivity of single Barron functions.
Abstract
Deep neural nets have caused a revolution in many classification tasks. A related ongoing revolution -- also theoretically not understood -- concerns their ability to serve as generative models for complicated types of data such as images and texts. These models are trained using ideas like variational autoencoders and Generative Adversarial Networks. We take a first cut at explaining the expressivity of multilayer nets by giving a sufficient criterion for a function to be approximable by a neural network with hidden layers. A key ingredient is Barron's Theorem \cite{Barron1993}, which gives a Fourier criterion for approximability of a function by a neural network with 1 hidden layer. We show that a composition of functions which satisfy certain Fourier conditions ("Barron functions") can be approximated by a -layer neural network. For probability distributions, this…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Adversarial Robustness in Machine Learning · Model Reduction and Neural Networks
