Robust Regression with Adaptive Contamination in Response: Optimal Rates and Computational Barriers
Ilias Diakonikolas, Chao Gao, Daniel M. Kane, Ankit Pensia, Dong Xie

TL;DR
This paper investigates robust regression with clean covariates under an adaptive contamination model, demonstrating improved estimation rates and fundamental computational barriers compared to classical models.
Contribution
It introduces an estimator exploiting clean covariates for better rates, establishes minimax lower bounds, and reveals computational-statistical gaps in robust regression.
Findings
Achieves better estimation rates than Huber's model with contaminated responses.
Establishes minimax lower bounds matching the upper bounds.
Demonstrates strong computational-statistical gaps via lower bounds.
Abstract
We study robust regression under a contamination model in which covariates are clean while the responses may be corrupted in an adaptive manner. Unlike the classical Huber's contamination model, where both covariates and responses may be contaminated and consistent estimation is impossible when the contamination proportion is a non-vanishing constant, it turns out that the clean-covariate setting admits strictly improved statistical guarantees. Specifically, we show that the additional information in the clean covariates can be carefully exploited to construct an estimator that achieves a better estimation rate than that attainable under Huber contamination. In contrast to the Huber model, this improved rate implies consistency even when the contamination is a constant. A matching minimax lower bound is established using Fano's inequality together with the construction of contamination…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
