On optimality of empirical risk minimization in linear aggregation
Adrien Saumard

TL;DR
This paper investigates the limitations of the small-ball condition for certain function classes in empirical risk minimization and proposes refined methods to achieve optimal convergence rates in linear aggregation, especially for Fourier dictionaries.
Contribution
It identifies the poor behavior of the small-ball condition for localized functions and develops refined techniques to attain optimal convergence rates for ERM with Fourier dictionaries.
Findings
Small-ball condition can be suboptimal for wavelets and trigonometric polynomials.
Optimal convergence rates are recovered for ERM with trigonometric function dictionaries.
Refined small-ball arguments lead to precise concentration results in the bounded case.
Abstract
In the first part of this paper, we show that the small-ball condition, recently introduced by Mendelson (2015), may behave poorly for important classes of localized functions such as wavelets, piecewise polynomials or trigonometric polynomials, in particular leading to suboptimal estimates of the rate of convergence of ERM for the linear aggregation problem. In a second part, we recover optimal rates of covergence for the excess risk of ERM when the dictionary is made of trigonometric functions. Considering the bounded case, we derive the concentration of the excess risk around a single point, which is an information far more precise than the rate of convergence. In the general setting of a L2 noise, we finally refine the small ball argument by rightly selecting the directions we are looking at, in such a way that we obtain optimal rates of aggregation for the Fourier dictionary.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRisk and Portfolio Optimization · Sparse and Compressive Sensing Techniques · Statistical Methods and Inference
