# Sparse Regression via Range Counting

**Authors:** Jean Cardinal, Aur\'elien Ooms

arXiv: 1908.00351 · 2020-01-01

## TL;DR

This paper presents a novel randomized approximation algorithm for sparse regression, achieving sub-$n^k$ runtime, and extends the approach to related regression problems, improving efficiency and avoiding preprocessing.

## Contribution

Introduces the first sub-$n^k$ time randomized approximation algorithm for sparse regression with near-optimal complexity, and adapts it to linear and convex regression tasks.

## Key findings

- Achieves $O(n^{k-1} 	ext{polylog} n)$ runtime for approximation.
- Matches conditional lower bounds up to polylogarithmic factors.
- Provides a deterministic exact algorithm for fixed small $k$ and $d$.

## Abstract

The sparse regression problem, also known as best subset selection problem, can be cast as follows: Given a set $S$ of $n$ points in $\mathbb{R}^d$, a point $y\in \mathbb{R}^d$, and an integer $2 \leq k \leq d$, find an affine combination of at most $k$ points of $S$ that is nearest to $y$. We describe a $O(n^{k-1} \log^{d-k+2} n)$-time randomized $(1+\varepsilon)$-approximation algorithm for this problem with \(d\) and \(\varepsilon\) constant. This is the first algorithm for this problem running in time $o(n^k)$. Its running time is similar to the query time of a data structure recently proposed by Har-Peled, Indyk, and Mahabadi (ICALP'18), while not requiring any preprocessing. Up to polylogarithmic factors, it matches a conditional lower bound relying on a conjecture about affine degeneracy testing. In the special case where $k = d = O(1)$, we also provide a simple $O_\delta(n^{d-1+\delta})$-time deterministic exact algorithm, for any \(\delta > 0\). Finally, we show how to adapt the approximation algorithm for the sparse linear regression and sparse convex regression problems with the same running time, up to polylogarithmic factors.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1908.00351/full.md

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/1908.00351/full.md

## References

34 references — full list in the complete paper: https://tomesphere.com/paper/1908.00351/full.md

---
Source: https://tomesphere.com/paper/1908.00351