Asymptotic properties of the first principal component and equality tests of covariance matrices in high-dimension, low-sample-size context
Aki Ishii, Kazuyoshi Yata, Makoto Aoshima

TL;DR
This paper investigates the asymptotic behavior of the first principal component in high-dimensional, low-sample-size data and applies these findings to develop tests for equality of covariance matrices, providing theoretical insights and numerical validation.
Contribution
It introduces an eigenvalue estimator using noise-reduction methodology and derives its asymptotic distribution in HDLSS settings, advancing covariance matrix comparison methods.
Findings
Asymptotic distribution of the largest eigenvalue derived
Confidence intervals for the first contribution ratio constructed
Performance of eigenvalue estimator and tests validated through numerical results
Abstract
A common feature of high-dimensional data is that the data dimension is high, however, the sample size is relatively low. We call such data HDLSS data. In this paper, we study asymptotic properties of the first principal component in the HDLSS context and apply them to equality tests of covariance matrices for high dimensional data sets. We consider HDLSS asymptotic theories as the dimension grows for both the cases when the sample size is fixed and the sample size goes to infinity. We introduce an eigenvalue estimator by the noise-reduction methodology and provide asymptotic distributions of the largest eigenvalue in the HDLSS context. We construct a confidence interval of the first contribution ratio. We give asymptotic properties both for the first PC direction and PC score as well. We apply the findings to equality tests of two covariance matrices in the HDLSS context. We provide…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRandom Matrices and Applications · Statistical Methods and Inference · Bayesian Methods and Mixture Models
