Statistical inference for principal components of spiked covariance matrices
Zhigang Bao, Xiucai Ding, Jingming Wang, Ke Wang

TL;DR
This paper analyzes the asymptotic behavior of eigenvalues and eigenvectors of high-dimensional spiked covariance matrices, deriving joint distributions and proposing improved hypothesis testing methods that outperform existing approaches.
Contribution
It provides a general framework for the joint distribution of eigenvalues and eigenvectors in spiked models, allowing for multiple, weak, and unstructured spikes, and introduces new adaptive hypothesis tests.
Findings
Joint distribution of eigenvalues and eigenvectors derived
Proposed hypothesis tests outperform existing methods
Methods effective for small spikes and large dimensions
Abstract
In this paper, we study the asymptotic behavior of the extreme eigenvalues and eigenvectors of the high dimensional spiked sample covariance matrices, in the supercritical case when a reliable detection of spikes is possible. Especially, we derive the joint distribution of the extreme eigenvalues and the generalized components of the associated eigenvectors, i.e., the projections of the eigenvectors onto arbitrary given direction, assuming that the dimension and sample size are comparably large. In general, the joint distribution is given in terms of linear combinations of finitely many Gaussian and Chi-square variables, with parameters depending on the projection direction and the spikes. Our assumption on the spikes is fully general. First, the strengths of spikes are only required to be slightly above the critical threshold and no upper bound on the strengths is needed. Second,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBlind Source Separation Techniques · Target Tracking and Data Fusion in Sensor Networks · Statistical Mechanics and Entropy
