Provable Approximations for Constrained $\ell_p$ Regression

Ibrahim Jubran; David Cohn; Dan Feldman

arXiv:1902.10407·cs.LG·February 28, 2019·1 cites

Provable Approximations for Constrained $\ell_p$ Regression

Ibrahim Jubran, David Cohn, Dan Feldman

PDF

Open Access

TL;DR

This paper introduces the first provable constant factor approximation algorithm for constrained p regression that is efficient, handles outliers, and is applicable to streaming and distributed data scenarios.

Contribution

It presents a novel approximation algorithm for constrained p regression that is provably effective, efficient, and adaptable to large-scale and streaming data.

Findings

01

Provides the first provable constant factor approximation for constrained p regression.

02

Achieves an p regression solution in nearly linear time with core-sets.

03

Demonstrates effectiveness through experiments and open-source implementation.

Abstract

The $ℓ_{p}$ linear regression problem is to minimize $f (x) = ∣∣ A x - b ∣ ∣_{p}$ over $x \in R^{d}$ , where $A \in R^{n \times d}$ , $b \in R^{n}$ , and $p > 0$ . To avoid overfitting and bound $∣∣ x ∣ ∣_{2}$ , the constrained $ℓ_{p}$ regression minimizes $f (x)$ over every unit vector $x \in R^{d}$ . This makes the problem non-convex even for the simplest case $d = p = 2$ . Instead, ridge regression is used to minimize the Lagrange form $f (x) + λ ∣∣ x ∣ ∣_{2}$ over $x \in R^{d}$ , which yields a convex problem in the price of calibrating the regularization parameter $λ > 0$ . We provide the first provable constant factor approximation algorithm that solves the constrained $ℓ_{p}$ regression directly, for every constant $p, d \geq 1$ . Using core-sets, its running time is $O (n lo g n)$ including extensions for streaming and distributed (big) data. In polynomial time, it can handle…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Methods and Inference · Control Systems and Identification · Statistical and numerical algorithms

MethodsLinear Regression