Augmented sparse principal component analysis for high dimensional data

Debashis Paul; Iain M. Johnstone

arXiv:1202.1242·math.ST·February 7, 2012·71 cites

Augmented sparse principal component analysis for high dimensional data

Debashis Paul, Iain M. Johnstone

PDF

Open Access

TL;DR

This paper investigates the estimation of leading eigenvectors in high-dimensional covariance matrices, establishing convergence bounds, proposing a sparsity-aware estimator, and comparing its performance to traditional PCA.

Contribution

It introduces an augmented sparse PCA method with optimal convergence rates under sparsity constraints and compares it to standard PCA.

Findings

01

Proposed estimator achieves optimal convergence rate under sparsity.

02

Lower bounds on convergence rates established for eigenvector estimators.

03

Standard PCA can attain minimax rates in certain scenarios.

Abstract

We study the problem of estimating the leading eigenvectors of a high-dimensional population covariance matrix based on independent Gaussian observations. We establish lower bounds on the rates of convergence of the estimators of the leading eigenvectors under $l^{q}$ -sparsity constraints when an $l^{2}$ loss function is used. We also propose an estimator of the leading eigenvectors based on a coordinate selection scheme combined with PCA and show that the proposed estimator achieves the optimal rate of convergence under a sparsity regime. Moreover, we establish that under certain scenarios, the usual PCA achieves the minimax convergence rate.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSparse and Compressive Sensing Techniques · Statistical Methods and Inference · Blind Source Separation Techniques