Sparse PCA via Covariance Thresholding
Yash Deshpande, Andrea Montanari

TL;DR
This paper proves that a covariance thresholding algorithm can reliably recover sparse principal components in high-dimensional settings, outperforming previous methods and matching conjectured optimal support recovery thresholds.
Contribution
The paper provides a rigorous proof that covariance thresholding correctly recovers support for sparse PCA up to a support size of order support size, matching conjectures and extending results to higher rank and smaller sample sizes.
Findings
Proved covariance thresholding recovers support for s_0 up to order support size.
Established bounds on the norm of kernel random matrices in new regimes.
Supported conjecture that the method is optimal under computational constraints.
Abstract
In sparse principal component analysis we are given noisy observations of a low-rank matrix of dimension and seek to reconstruct it under additional sparsity assumptions. In particular, we assume here each of the principal components has at most non-zero entries. We are particularly interested in the high dimensional regime wherein is comparable to, or even much larger than . In an influential paper, \cite{johnstone2004sparse} introduced a simple algorithm that estimates the support of the principal vectors by the largest entries in the diagonal of the empirical covariance. This method can be shown to identify the correct support with high probability if , and to fail with high probability if for two constants . Despite a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Random Matrices and Applications · Blind Source Separation Techniques
