Approximation Rates of Shallow Neural Networks: Barron Spaces, Activation Functions and Optimality Analysis
Jian Lu, Xiaohuang Huang

TL;DR
This paper analyzes the approximation capabilities of shallow neural networks with exponential power activation functions, revealing their limitations and optimal rates within Barron and Sobolev spaces, and highlighting the impact of dimension and smoothness.
Contribution
It provides a detailed analysis of approximation rates for ReLU$^{k}$ activations, establishing optimal bounds and clarifying the influence of smoothness and dimensionality on neural network approximation.
Findings
Optimal approximation rates are established for functions in Barron and Sobolev spaces.
ReLU$^{k}$ activations cannot achieve optimal rates under certain coefficient bounds.
The curse of dimensionality is confirmed in the approximation capabilities of shallow neural networks.
Abstract
This paper investigates the approximation properties of shallow neural networks with activation functions that are powers of exponential functions. It focuses on the dependence of the approximation rate on the dimension and the smoothness of the function being approximated within the Barron function space. We examine the approximation rates of ReLU activation functions, proving that the optimal rate cannot be achieved under -bounded coefficients or insufficient smoothness conditions. We also establish optimal approximation rates in various norms for functions in Barron spaces and Sobolev spaces, confirming the curse of dimensionality. Our results clarify the limits of shallow neural networks' approximation capabilities and offer insights into the selection of activation functions and network structures.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Neural Networks and Applications · Model Reduction and Neural Networks
