On the ability of neural nets to express distributions

Holden Lee; Rong Ge; Tengyu Ma; Andrej Risteski; Sanjeev Arora

arXiv:1702.07028·cs.LG·April 20, 2021·27 cites

On the ability of neural nets to express distributions

Holden Lee, Rong Ge, Tengyu Ma, Andrej Risteski, Sanjeev Arora

PDF

Open Access

TL;DR

This paper explores the expressive power of deep neural networks in modeling complex data distributions, providing theoretical criteria for their approximation capabilities using Fourier analysis and composition of functions.

Contribution

It introduces a Fourier-based criterion for neural network approximability of functions and distributions, extending Barron's Theorem to compositions of functions for deep networks.

Findings

01

Neural networks can approximate compositions of Barron functions with more layers.

02

A Fourier criterion determines when a distribution can be approximated in Wasserstein distance.

03

Composition of Barron functions exceeds the expressivity of single Barron functions.

Abstract

Deep neural nets have caused a revolution in many classification tasks. A related ongoing revolution -- also theoretically not understood -- concerns their ability to serve as generative models for complicated types of data such as images and texts. These models are trained using ideas like variational autoencoders and Generative Adversarial Networks. We take a first cut at explaining the expressivity of multilayer nets by giving a sufficient criterion for a function to be approximable by a neural network with $n$ hidden layers. A key ingredient is Barron's Theorem \cite{Barron1993}, which gives a Fourier criterion for approximability of a function by a neural network with 1 hidden layer. We show that a composition of $n$ functions which satisfy certain Fourier conditions ("Barron functions") can be approximated by a $n + 1$ -layer neural network. For probability distributions, this…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Adversarial Robustness in Machine Learning · Model Reduction and Neural Networks