Efficient Algorithms for Outlier-Robust Regression
Adam Klivans, Pravesh K. Kothari, Raghu Meka

TL;DR
This paper introduces the first polynomial-time algorithm for outlier-robust linear and polynomial regression that tolerates adversarial corruptions, leveraging the sum-of-squares method under certifiably hypercontractive distributions.
Contribution
It provides a novel polynomial-time algorithm for robust regression against adversarial corruptions using sum-of-squares, applicable to a broad class of distributions, and establishes a lower bound showing distributional assumptions are necessary.
Findings
First polynomial-time algorithm for outlier-robust regression.
Algorithm works under certifiably hypercontractive distributions.
Lower bound shows distributional assumptions are necessary.
Abstract
We give the first polynomial-time algorithm for performing linear or polynomial regression resilient to adversarial corruptions in both examples and labels. Given a sufficiently large (polynomial-size) training set drawn i.i.d. from distribution D and subsequently corrupted on some fraction of points, our algorithm outputs a linear function whose squared error is close to the squared error of the best-fitting linear function with respect to D, assuming that the marginal distribution of D over the input space is \emph{certifiably hypercontractive}. This natural property is satisfied by many well-studied distributions such as Gaussian, strongly log-concave distributions and, uniform distribution on the hypercube among others. We also give a simple statistical lower bound showing that some distributional assumption is necessary to succeed in this setting. These results are the first of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Efficient Algorithms for Outlier-Robust Regression· youtube
Taxonomy
TopicsMachine Learning and Algorithms · Adversarial Robustness in Machine Learning · Sparse and Compressive Sensing Techniques
