Sparse PCA via Covariance Thresholding

Yash Deshpande; Andrea Montanari

arXiv:1311.5179·math.ST·April 27, 2016·J. Mach. Learn. Res.·45 cites

Sparse PCA via Covariance Thresholding

Yash Deshpande, Andrea Montanari

PDF

Open Access

TL;DR

This paper proves that a covariance thresholding algorithm can reliably recover sparse principal components in high-dimensional settings, outperforming previous methods and matching conjectured optimal support recovery thresholds.

Contribution

The paper provides a rigorous proof that covariance thresholding correctly recovers support for sparse PCA up to a support size of order support size, matching conjectures and extending results to higher rank and smaller sample sizes.

Findings

01

Proved covariance thresholding recovers support for s_0 up to order support size.

02

Established bounds on the norm of kernel random matrices in new regimes.

03

Supported conjecture that the method is optimal under computational constraints.

Abstract

In sparse principal component analysis we are given noisy observations of a low-rank matrix of dimension $n \times p$ and seek to reconstruct it under additional sparsity assumptions. In particular, we assume here each of the principal components $v_{1}, \dots, v_{r}$ has at most $s_{0}$ non-zero entries. We are particularly interested in the high dimensional regime wherein $p$ is comparable to, or even much larger than $n$ . In an influential paper, \cite{johnstone2004sparse} introduced a simple algorithm that estimates the support of the principal vectors $v_{1}, \dots, v_{r}$ by the largest entries in the diagonal of the empirical covariance. This method can be shown to identify the correct support with high probability if $s_{0} \leq K_{1} n / lo g p$ , and to fail with high probability if $s_{0} \geq K_{2} n / lo g p$ for two constants $0 < K_{1}, K_{2} < \infty$ . Despite a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSparse and Compressive Sensing Techniques · Random Matrices and Applications · Blind Source Separation Techniques