Provable Approximations for Constrained $\ell_p$ Regression
Ibrahim Jubran, David Cohn, Dan Feldman

TL;DR
This paper introduces the first provable constant factor approximation algorithm for constrained p regression that is efficient, handles outliers, and is applicable to streaming and distributed data scenarios.
Contribution
It presents a novel approximation algorithm for constrained p regression that is provably effective, efficient, and adaptable to large-scale and streaming data.
Findings
Provides the first provable constant factor approximation for constrained p regression.
Achieves an p regression solution in nearly linear time with core-sets.
Demonstrates effectiveness through experiments and open-source implementation.
Abstract
The linear regression problem is to minimize over , where , , and . To avoid overfitting and bound , the constrained regression minimizes over every unit vector . This makes the problem non-convex even for the simplest case . Instead, ridge regression is used to minimize the Lagrange form over , which yields a convex problem in the price of calibrating the regularization parameter . We provide the first provable constant factor approximation algorithm that solves the constrained regression directly, for every constant . Using core-sets, its running time is including extensions for streaming and distributed (big) data. In polynomial time, it can handle…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Control Systems and Identification · Statistical and numerical algorithms
MethodsLinear Regression
