Chaining Bounds for Empirical Risk Minimization

G\'abor Bal\'azs; Andr\'as Gy\"orgy; Csaba Szepesv\'ari

arXiv:1609.01872·stat.ML·September 8, 2016·2 cites

Chaining Bounds for Empirical Risk Minimization

G\'abor Bal\'azs, Andr\'as Gy\"orgy, Csaba Szepesv\'ari

PDF

Open Access

TL;DR

This paper advances the theoretical understanding of empirical risk minimization by extending chaining bounds to unbounded noise settings, applicable to various loss functions and providing tight sample complexity results.

Contribution

It introduces a generalized chaining technique for excess risk bounds in unbounded noise scenarios, applicable to multiple loss functions and detailed analysis for linear regression.

Findings

01

Bounds apply to many loss functions beyond squared loss

02

Sample complexity bounds are tight up to logarithmic factors

03

Method handles unbounded noise and estimates without kurtosis assumptions

Abstract

This paper extends the standard chaining technique to prove excess risk upper bounds for empirical risk minimization with random design settings even if the magnitude of the noise and the estimates is unbounded. The bound applies to many loss functions besides the squared loss, and scales only with the sub-Gaussian or subexponential parameters without further statistical assumptions such as the bounded kurtosis condition over the hypothesis class. A detailed analysis is provided for slope constrained and penalized linear least squares regression with a sub-Gaussian setting, which often proves tight sample complexity bounds up to logartihmic factors.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Methods and Inference · Probabilistic and Robust Engineering Design · Mathematical Approximation and Integration