A General Framework For Consistency of Principal Component Analysis
Dan Shen, Haipeng Shen, J. S. Marron

TL;DR
This paper introduces a comprehensive asymptotic framework for analyzing the consistency of PCA, revealing how dimension, sample size, and eigenvalue spikes influence its performance across various scenarios.
Contribution
It develops a unified framework that encompasses existing asymptotic regimes and explores new scenarios, providing insights into PCA's consistency and convergence rates under general models.
Findings
Dimension hampers PCA consistency
Sample size and spike strength promote PCA consistency
The framework clarifies relationships among key factors affecting PCA
Abstract
A general asymptotic framework is developed for studying consis- tency properties of principal component analysis (PCA). Our frame- work includes several previously studied domains of asymptotics as special cases and allows one to investigate interesting connections and transitions among the various domains. More importantly, it enables us to investigate asymptotic scenarios that have not been considered before, and gain new insights into the consistency, subspace consistency and strong inconsistency regions of PCA and the boundaries among them. We also establish the corresponding convergence rate within each region. Under general spike covariance models, the dimension (or the number of variables) discourages the consistency of PCA, while the sample size and spike information (the relative size of the population eigenvalues) encourages PCA consistency. Our framework nicely illustrates…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBlind Source Separation Techniques · Theoretical and Computational Physics · Spectroscopy and Chemometric Analyses
