Bi-cross-validation for factor analysis
A. B. Owen, J. Wang

TL;DR
This paper introduces a bi-cross-validation method for selecting the number of factors in factor analysis, outperforming existing methods like parallel analysis and Kaiser's rule, especially in cases with heteroscedastic noise.
Contribution
The authors propose a novel bi-cross-validation approach for factor number selection that is based on hold-out submatrices, improving over traditional subjective and simulation-based methods.
Findings
Bi-cross-validation outperforms parallel analysis and Kaiser's rule.
The method is particularly effective for detecting and estimating large enough factors.
Early stopping regularization enhances signal matrix recovery.
Abstract
Factor analysis is over a century old, but it is still problematic to choose the number of factors for a given data set. The scree test is popular but subjective. The best performing objective methods are recommended on the basis of simulations. We introduce a method based on bi-cross-validation, using randomly held-out submatrices of the data to choose the number of factors. We find it performs better than the leading methods of parallel analysis (PA) and Kaiser's rule. Our performance criterion is based on recovery of the underlying factor-loading (signal) matrix rather than identifying the true number of factors. Like previous comparisons, our work is simulation based. Recent advances in random matrix theory provide principled choices for the number of factors when the noise is homoscedastic, but not for the heteroscedastic case. The simulations we choose are designed using guidance…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRandom Matrices and Applications · Theoretical and Computational Physics · Advanced Neuroimaging Techniques and Applications
