Learning without Concentration for General Loss Functions

Shahar Mendelson

arXiv:1410.3192·stat.ML·October 14, 2014

Learning without Concentration for General Loss Functions

Shahar Mendelson

PDF

TL;DR

This paper develops a framework for empirical risk minimization with general convex loss functions that achieves sharp error rates even in heavy-tailed or non-concentrated scenarios, by analyzing intrinsic complexity and interactions.

Contribution

It introduces a novel analysis that provides sharp error bounds without relying on concentration, applicable to heavy-tailed data and outlier robustness.

Findings

01

Error rates depend on class complexity and noise level.

02

Sharp bounds are obtained without concentration assumptions.

03

Method handles heavy-tailed and outlier-affected data.

Abstract

We study prediction and estimation problems using empirical risk minimization, relative to a general convex loss function. We obtain sharp error rates even when concentration is false or is very restricted, for example, in heavy-tailed scenarios. Our results show that the error rate depends on two parameters: one captures the intrinsic complexity of the class, and essentially leads to the error rate in a noise-free (or realizable) problem; the other measures interactions between class members the target and the loss, and is dominant when the problem is far from realizable. We also explain how one may deal with outliers by choosing the loss in a way that is calibrated to the intrinsic complexity of the class and to the noise-level of the problem (the latter is measured by the distance between the target and the class).

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.