Statistical Inference for Linear Functions of Eigenvectors with Small Eigengaps
Joshua Agterberg

TL;DR
This paper develops statistical inference methods for linear functions of eigenvectors in high-dimensional models with small eigengaps, providing Gaussian approximations and optimal confidence intervals without sample splitting.
Contribution
It introduces new Gaussian approximation results and confidence interval procedures for eigenvector functionals in challenging small eigengap regimes, filling a gap in existing spectral analysis literature.
Findings
Gaussian approximation for linear forms of eigenvectors under small eigengaps
Proposed confidence intervals are minimax optimal up to constants
Intervals can be computed directly from data without sample-splitting
Abstract
Spectral methods have myriad applications in high-dimensional statistics and data science, and while previous works have primarily focused on or eigenvector and singular vector perturbation theory, in many settings these analyses fall short of providing the fine-grained guarantees required for various inferential tasks. In this paper we study statistical inference for linear functions of eigenvectors and principal components with a particular emphasis on the setting where gaps between eigenvalues may be extremely small relative to the corresponding spiked eigenvalue, a regime which has been oft-neglected in the literature. First, we prove the approximate Gaussianity for debiased linear forms in the matrix denoising model and the spiked principal component analysis model, both under Gaussian noise. Based on this limiting behavior, we propose estimators for the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
