Optimal approximation of piecewise smooth functions using deep ReLU   neural networks

Philipp Petersen; Felix Voigtlaender

arXiv:1709.05289·math.FA·May 23, 2018

Optimal approximation of piecewise smooth functions using deep ReLU neural networks

Philipp Petersen, Felix Voigtlaender

PDF

TL;DR

This paper establishes the optimal complexity and depth requirements for ReLU neural networks to efficiently approximate piecewise smooth functions in high-dimensional spaces, demonstrating the necessity of depth for optimal approximation.

Contribution

It provides the first optimal bounds on the number of weights and depth needed for ReLU networks to approximate piecewise $C^eta$ functions in $L^2$, including high-dimensional and factorized cases.

Findings

01

Constructed networks achieve optimal approximation rates with minimal weights.

02

Depth requirement for optimal approximation scales with $eta/d$, showing depth's importance.

03

Approximation rate depends only on the feature space dimension in factorized functions.

Abstract

We study the necessary and sufficient complexity of ReLU neural networks---in terms of depth and number of weights---which is required for approximating classifier functions in $L^{2}$ . As a model class, we consider the set $E^{β} (R^{d})$ of possibly discontinuous piecewise $C^{β}$ functions $f : [- 1/2, 1/2]^{d} \to R$ , where the different smooth regions of $f$ are separated by $C^{β}$ hypersurfaces. For dimension $d \geq 2$ , regularity $β > 0$ , and accuracy $ε > 0$ , we construct artificial neural networks with ReLU activation function that approximate functions from $E^{β} (R^{d})$ up to $L^{2}$ error of $ε$ . The constructed networks have a fixed number of layers, depending only on $d$ and $β$ , and they have $O (ε^{- 2 (d - 1) / β})$ many nonzero weights, which we prove to be optimal. In addition to the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Methods*Communicated@Fast*How Do I Communicate to Expedia?