Risk Bounds for High-dimensional Ridge Function Combinations Including Neural Networks
Jason M. Klusowski, Andrew R. Barron

TL;DR
This paper derives risk bounds for high-dimensional ridge function combinations, including neural networks, showing small estimation error even when input dimension exceeds sample size, by analyzing penalized estimators with various smooth ridge functions.
Contribution
It provides new risk bounds for ridge function estimators in high-dimensional settings, including neural networks, with bounds that hold even when the input dimension is very large.
Findings
Risk bounds depend on spectral norm and sample size.
Estimates remain accurate even when input dimension exceeds sample size.
Bounds improve over traditional rates when dimension is large.
Abstract
Let be a function on with an assumption of a spectral norm . For various noise settings, we show that , where is the sample size and is either a penalized least squares estimator or a greedily obtained version of such using linear combinations of sinusoidal, sigmoidal, ramp, ramp-squared or other smooth ridge functions. The candidate fits may be chosen from a continuum of functions, thus avoiding the rigidity of discretizations of the parameter space. On the other hand, if the candidate fits are chosen from a discretization, we show that . This work bridges non-linear and non-parametric function estimation and includes single-hidden…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel Reduction and Neural Networks · Probabilistic and Robust Engineering Design · Image and Signal Denoising Methods
