A dichotomous behavior of Guttman-Kaiser criterion from equi-correlated normal population
Yohji Akama, Atina Husnaqilati

TL;DR
This paper analyzes the asymptotic behavior of the Guttman-Kaiser criterion in high-dimensional normal populations with equi-correlation, revealing a dichotomous outcome depending on the correlation coefficient.
Contribution
It provides a rigorous theoretical foundation for the behavior of the Guttman-Kaiser criterion in large, equi-correlated normal populations, connecting spectral distribution limits to factor retention rules.
Findings
For $ ho>0$, the criterion retains few variables, aligning with classical expectations.
For $ ho=0$, the criterion retains about half of the variables, matching simulation results.
The spectral distribution converges to the Marčenko-Pastur law under the specified conditions.
Abstract
We consider a -dimensional, centered normal population such that all variables have a positive variance and any correlation coefficient between different variables is a given nonnegative constant . Suppose that both the sample size and population dimension tend to infinity with . We prove that the limiting spectral distribution of a sample correlation matrix is Mar\v{c}enko-Pastur distribution of index and scale parameter . By the limiting spectral distributions, we rigorously show the limiting behavior of widespread stopping rules Guttman-Kaiser criterion and cumulative-percentage-of-variation rule in PCA and EFA. As a result, we establish the following dichotomous behavior of Guttman-Kaiser criterion when both and are large, but is small: (1) the criterion retains a small number of variables for , as…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRandom Matrices and Applications · Consumer Market Behavior and Pricing · Statistical Methods and Bayesian Inference
