Sparse Principal Component Analysis with missing observations
Karim Lounici

TL;DR
This paper addresses the challenge of estimating the leading sparse principal component in high-dimensional data with missing observations, proposing new theoretical bounds and adaptive, computationally feasible methods.
Contribution
It establishes the first information-theoretic lower bound for sparse PCA with missing data and introduces adaptive procedures that do not require prior sparsity knowledge.
Findings
Proposed a simple, adaptive estimator achieving near-optimal rates.
Derived the first lower bounds for sparse PCA with missing observations.
Developed a data-driven, computationally feasible method for approximately low-rank covariance matrices.
Abstract
In this paper, we study the problem of sparse Principal Component Analysis (PCA) in the high-dimensional setting with missing observations. Our goal is to estimate the first principal component when we only have access to partial observations. Existing estimation techniques are usually derived for fully observed data sets and require a prior knowledge of the sparsity of the first principal component in order to achieve good statistical guarantees. Our contributions is threefold. First, we establish the first information-theoretic lower bound for the sparse PCA problem with missing observations. Second, we propose a simple procedure that does not require any prior knowledge on the sparsity of the unknown first principal component or any imputation of the missing observations, adapts to the unknown sparsity of the first principal component and achieves the optimal rate of estimation up to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Statistical Methods and Inference
