A general approximation lower bound in $L^p$ norm, with applications to feed-forward neural networks
El Mehdi Achour (IMT), Armand Foucault (IMT), S\'ebastien Gerchinovitz, (IMT), Fran\c{c}ois Malgouyres (IMT)

TL;DR
This paper establishes a general lower bound on the approximation capabilities of neural networks in $L^p$ norm, revealing fundamental limits and comparing approximation in $L^p$ versus sup norm.
Contribution
It introduces a novel lower bound framework for neural network approximation in $L^p$ norm, applicable to various function classes and measures, and clarifies differences between $L^p$ and sup norm approximations.
Findings
Lower bounds depend on packing number, range, and fat-shattering dimension.
Application to piecewise-polynomial neural networks and specific function classes.
Matching upper bounds up to log factors and insights into $L^p$ vs sup norm approximation.
Abstract
We study the fundamental limits to the expressive power of neural networks. Given two sets , of real-valued functions, we first prove a general lower bound on how well functions in can be approximated in norm by functions in , for any and any probability measure . The lower bound depends on the packing number of , the range of , and the fat-shattering dimension of . We then instantiate this bound to the case where corresponds to a piecewise-polynomial feed-forward neural network, and describe in details the application to two sets : H{\"o}lder balls and multivariate monotonic functions. Beside matching (known or new) upper bounds up to log factors, our lower bounds shed some light on the similarities or differences between approximation in norm or in sup norm, solving an open question by DeVore et al. (2021). Our proof…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsMachine Learning and Algorithms · Advanced Neural Network Applications · Neural Networks and Applications
