Support Recovery in Sparse PCA with Non-Random Missing Data
Hanbyul Lee, Qifan Song, Jean Honorio

TL;DR
This paper presents a semidefinite relaxation algorithm for support recovery in sparse PCA with non-random missing data, providing theoretical guarantees and empirical validation under various structural conditions.
Contribution
The paper introduces a novel semidefinite relaxation approach for sparse PCA with incomplete data, with theoretical support for support recovery under general sampling schemes.
Findings
High-probability support recovery under certain spectral gap and noise conditions
Algorithm outperforms existing methods on synthetic and real data
Theoretical results extend to deterministic sampling schemes
Abstract
We analyze a practical algorithm for sparse PCA on incomplete and noisy data under a general non-random sampling scheme. The algorithm is based on a semidefinite relaxation of the -regularized PCA problem. We provide theoretical justification that under certain conditions, we can recover the support of the sparse leading eigenvector with high probability by obtaining a unique solution. The conditions involve the spectral gap between the largest and second-largest eigenvalues of the true data matrix, the magnitude of the noise, and the structural properties of the observed entries. The concepts of algebraic connectivity and irregularity are used to describe the structural properties of the observed entries. We empirically justify our theorem with synthetic and real data analysis. We also show that our algorithm outperforms several other sparse PCA approaches especially when the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Blind Source Separation Techniques · Random Matrices and Applications
MethodsPrincipal Components Analysis
