Compressed Regression

Shuheng Zhou; John Lafferty; Larry Wasserman

arXiv:0706.0534·stat.ML·January 11, 2012

Compressed Regression

Shuheng Zhou, John Lafferty, Larry Wasserman

PDF

TL;DR

This paper investigates the use of random linear compression in high-dimensional sparse regression, establishing conditions for successful model recovery, prediction accuracy, and data privacy preservation.

Contribution

It introduces a framework for sparse regression from compressed data, providing theoretical guarantees for model recovery, prediction, and privacy in the compressed setting.

Findings

01

Conditions for successful sparse model recovery from compressed data

02

Asymptotic prediction performance matching oracle models

03

Information-theoretic bounds on data privacy through compression

Abstract

Recent research has studied the role of sparsity in high dimensional regression and signal reconstruction, establishing theoretical limits for recovering sparse models from sparse data. This line of work shows that $ℓ_{1}$ -regularized least squares regression can accurately estimate a sparse linear model from $n$ noisy examples in $p$ dimensions, even if $p$ is much larger than $n$ . In this paper we study a variant of this problem where the original $n$ input variables are compressed by a random linear transformation to $m ≪ n$ examples in $p$ dimensions, and establish conditions under which a sparse linear model can be successfully recovered from the compressed data. A primary motivation for this compression procedure is to anonymize the data and preserve privacy by revealing little information about the original data. We characterize the number of random projections that are…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.