Compressed Regression
Shuheng Zhou, John Lafferty, Larry Wasserman

TL;DR
This paper investigates the use of random linear compression in high-dimensional sparse regression, establishing conditions for successful model recovery, prediction accuracy, and data privacy preservation.
Contribution
It introduces a framework for sparse regression from compressed data, providing theoretical guarantees for model recovery, prediction, and privacy in the compressed setting.
Findings
Conditions for successful sparse model recovery from compressed data
Asymptotic prediction performance matching oracle models
Information-theoretic bounds on data privacy through compression
Abstract
Recent research has studied the role of sparsity in high dimensional regression and signal reconstruction, establishing theoretical limits for recovering sparse models from sparse data. This line of work shows that -regularized least squares regression can accurately estimate a sparse linear model from noisy examples in dimensions, even if is much larger than . In this paper we study a variant of this problem where the original input variables are compressed by a random linear transformation to examples in dimensions, and establish conditions under which a sparse linear model can be successfully recovered from the compressed data. A primary motivation for this compression procedure is to anonymize the data and preserve privacy by revealing little information about the original data. We characterize the number of random projections that are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
