Optimal approximation of piecewise smooth functions using deep ReLU neural networks
Philipp Petersen, Felix Voigtlaender

TL;DR
This paper establishes the optimal complexity and depth requirements for ReLU neural networks to efficiently approximate piecewise smooth functions in high-dimensional spaces, demonstrating the necessity of depth for optimal approximation.
Contribution
It provides the first optimal bounds on the number of weights and depth needed for ReLU networks to approximate piecewise $C^eta$ functions in $L^2$, including high-dimensional and factorized cases.
Findings
Constructed networks achieve optimal approximation rates with minimal weights.
Depth requirement for optimal approximation scales with $eta/d$, showing depth's importance.
Approximation rate depends only on the feature space dimension in factorized functions.
Abstract
We study the necessary and sufficient complexity of ReLU neural networks---in terms of depth and number of weights---which is required for approximating classifier functions in . As a model class, we consider the set of possibly discontinuous piecewise functions , where the different smooth regions of are separated by hypersurfaces. For dimension , regularity , and accuracy , we construct artificial neural networks with ReLU activation function that approximate functions from up to error of . The constructed networks have a fixed number of layers, depending only on and , and they have many nonzero weights, which we prove to be optimal. In addition to the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Methods*Communicated@Fast*How Do I Communicate to Expedia?
