ERM and RERM are optimal estimators for regression problems when   malicious outliers corrupt the labels

Geoffrey Chinot

arXiv:1910.10923·math.ST·September 28, 2020

ERM and RERM are optimal estimators for regression problems when malicious outliers corrupt the labels

Geoffrey Chinot

PDF

TL;DR

This paper demonstrates that ERM and RERM are optimal for regression with malicious label outliers, providing error bounds that remain minimax-rate-optimal even with contamination, applicable to various regularized procedures.

Contribution

The paper establishes minimax-rate-optimal error bounds for ERM and RERM in contaminated regression settings, extending to heavy-tailed noise and regularized methods.

Findings

01

Error rate bounded by non-contaminated rate plus contamination term

02

Minimax optimality maintained under label contamination

03

Applicable to Huber's M-estimators and kernel methods

Abstract

We study Empirical Risk Minimizers (ERM) and Regularized Empirical Risk Minimizers (RERM) for regression problems with convex and $L$ -Lipschitz loss functions. We consider a setting where $∣ \cO ∣$ malicious outliers contaminate the labels. In that case, under a local Bernstein condition, we show that the $L_{2}$ -error rate is bounded by $r_{N} + A L ∣ \cO ∣/ N$ , where $N$ is the total number of observations, $r_{N}$ is the $L_{2}$ -error rate in the non-contaminated setting and $A$ is a parameter coming from the local Bernstein condition. When $r_{N}$ is minimax-rate-optimal in a non-contaminated setting, the rate $r_{N} + A L ∣ \cO ∣/ N$ is also minimax-rate-optimal when $∣ \cO ∣$ outliers contaminate the label. The main results of the paper can be used for many non-regularized and regularized procedures under weak assumptions on the noise. We present results for Huber's M-estimators (without penalization or…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.