Fast rates for empirical risk minimization over c\`adl\`ag functions with bounded sectional variation norm
Aur\'elien F. Bibaut, Mark J. van der Laan

TL;DR
This paper establishes fast convergence rates for empirical risk minimization over cdlg functions with bounded sectional variation norm, highlighting its suitability for high-dimensional nonparametric regression.
Contribution
It introduces a novel approach to bound bracketing entropy for cdlg functions, deriving explicit convergence rates that depend only logarithmically on dimension.
Findings
Empirical risk minimizers achieve convergence rate of O_P(n^{-1/3} (\,log n)^{2(d-1)/3} a_n).
The method is effective in high-dimensional settings due to minimal dimension dependence.
Results apply to nonparametric regression with sub-exponential errors under random design.
Abstract
Empirical risk minimization over classes functions that are bounded for some version of the variation norm has a long history, starting with Total Variation Denoising (Rudin et al., 1992), and has been considered by several recent articles, in particular Fang et al., 2019 and van der Laan, 2015. In this article, we consider empirical risk minimization over the class of c\`adl\`ag functions over with bounded sectional variation norm (also called Hardy-Krause variation). We show how a certain representation of functions in allows to bound the bracketing entropy of sieves of , and therefore derive rates of convergence in nonparametric function estimation. Specifically, for sieves whose growth is controlled by some rate , we show that the empirical risk minimizer has rate of convergence .…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Mathematical Approximation and Integration · Machine Learning and Algorithms
