High-dimensional regression with noisy and missing data: Provable guarantees with nonconvexity
Po-Ling Loh, Martin J. Wainwright

TL;DR
This paper develops provable guarantees for high-dimensional sparse linear regression with noisy, missing, or dependent data, demonstrating that simple algorithms can efficiently find near-global solutions.
Contribution
It introduces novel estimators and analyzes their statistical and computational properties, proving convergence and error bounds in nonconvex settings with noisy or missing data.
Findings
Projected gradient descent converges to a small neighborhood of global minimizers.
Nonasymptotic bounds hold with high probability for noisy, missing, and dependent data.
Simulations confirm theoretical predictions and scalings.
Abstract
Although the standard formulations of prediction problems involve fully-observed and noiseless data drawn in an i.i.d. manner, many applications involve noisy and/or missing data, possibly involving dependence, as well. We study these issues in the context of high-dimensional sparse linear regression, and propose novel estimators for the cases of noisy, missing and/or dependent data. Many standard approaches to noisy or missing data, such as those using the EM algorithm, lead to optimization problems that are inherently nonconvex, and it is difficult to establish theoretical guarantees on practical algorithms. While our approach also involves optimizing nonconvex programs, we are able to both analyze the statistical error associated with any global optimum, and more surprisingly, to prove that a simple algorithm based on projected gradient descent will converge in polynomial time to a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Statistical Methods and Inference · Photoacoustic and Ultrasonic Imaging
