A spectral least-squares-type method for heavy-tailed corrupted regression with unknown covariance \& heterogeneous noise
Roberto I. Oliveira, Zoraida F. Rico, Philip Thompson

TL;DR
This paper introduces a computationally efficient spectral least-squares method for robust linear regression in heavy-tailed, corrupted data scenarios with unknown covariance and heterogeneous noise, achieving near-optimal statistical rates.
Contribution
It proposes the first tractable algorithm that attains optimal robustness and statistical guarantees in heavy-tailed corrupted regression with unknown covariance.
Findings
Achieves near-optimal statistical rate with high probability.
Breakdown point is proportional to inverse of data hypercontractivity and condition number.
First algorithm to simultaneously guarantee robustness, efficiency, and optimal statistical performance.
Abstract
We revisit heavy-tailed corrupted least-squares linear regression assuming to have a corrupted -sized label-feature sample of at most arbitrary outliers. We wish to estimate a -dimensional parameter given such sample of a label-feature pair satisfying with heavy-tailed . We only assume is hypercontractive with constant and has covariance matrix with minimum eigenvalue and bounded condition number . The noise can be arbitrarily dependent on and nonsymmetric as long as has finite covariance matrix . We propose a near-optimal computationally tractable estimator, based on the power method, assuming no knowledge on nor the operator norm of . With probability at least , our proposed estimator attains the statistical rate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Statistical Methods and Inference · Advanced Statistical Methods and Models
MethodsLinear Regression
