Active Linear Regression for $\ell_p$ Norms and Beyond

Cameron Musco; Christopher Musco; David P. Woodruff; Taisuke Yasuda

arXiv:2111.04888·cs.LG·September 28, 2022

Active Linear Regression for $\ell_p$ Norms and Beyond

Cameron Musco, Christopher Musco, David P. Woodruff, Taisuke Yasuda

PDF

Open Access

TL;DR

This paper develops optimal active sampling algorithms for $\, ext{ell}_p$ norm linear regression across all $p$ ranges, improving sample complexity bounds and extending to robust and polynomial growth loss functions, with applications in subspace approximation and dimension reduction.

Contribution

It introduces the first near-optimal active sampling bounds for $\, ext{ell}_p$ regression for all $p$, and establishes new sensitivity bounds for polynomial loss functions, enabling efficient algorithms for robust regression.

Findings

01

Optimal query complexity bounds for $0<p<2$ and $2<p<\infty$ $\, ext{ell}_p$ regression.

02

First total sensitivity bound for polynomial growth loss functions.

03

Sublinear time algorithms for Kronecker product regression under all $p$ norms.

Abstract

We study active sampling algorithms for linear regression, which aim to query only a few entries of a target vector $b \in R^{n}$ and output a near minimizer to $min_{x \in R^{d}} ∥ A x - b ∥$ , for a design matrix $A \in R^{n \times d}$ and loss $∥ \cdot ∥$ . For $p$ norm regression for any $0 < p < \infty$ , we give an algorithm based on Lewis weight sampling outputting a $(1 + ϵ)$ -approximate solution using just $\tilde{O} (d / ϵ^{2})$ queries to $b$ for $p \in (0, 1)$ , $\tilde{O} (d / ϵ)$ queries for $1 < p < 2$ , and $\tilde{O} (d^{p /2} / ϵ^{p})$ queries for $2 < p < \infty$ . For $0 < p < 2$ , our bounds are optimal up to log factors, settling the query complexity for this range. For $2 < p < \infty$ , our dependence on $d$ is optimal, while our dependence on $ϵ$ is off by at most $ϵ$ , up to log factors. Our result resolves an open question of [CD21], who gave near…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Markov Chains and Monte Carlo Methods · Complexity and Algorithms in Graphs

MethodsHuber loss