Optimal rank-based testing for principal components
Marc Hallin, Davy Paindaveine, Thomas Verdebout

TL;DR
This paper develops optimal parametric and rank-based tests for eigenvectors and eigenvalues of covariance and scatter matrices in elliptical distributions, broadening applicability beyond Gaussian assumptions.
Contribution
It introduces new rank-based tests for principal components that are valid even when covariance matrices do not exist, extending classical Gaussian-based methods.
Findings
Rank-based tests outperform traditional methods in validity and efficiency.
Proposed tests are applicable to broader classes of scatter matrices.
Theoretical results on curved experiments are of independent interest.
Abstract
This paper provides parametric and rank-based optimal tests for eigenvectors and eigenvalues of covariance or scatter matrices in elliptical families. The parametric tests extend the Gaussian likelihood ratio tests of Anderson (1963) and their pseudo-Gaussian robustifications by Davis (1977) and Tyler (1981, 1983). The rank-based tests address a much broader class of problems, where covariance matrices need not exist and principal components are associated with more general scatter matrices. The proposed tests are shown to outperform daily practice both from the point of view of validity as from the point of view of efficiency. This is achieved by utilizing the Le Cam theory of locally asymptotically normal experiments, in the nonstandard context, however, of a curved parametrization. The results we derive for curved experiments are of independent interest, and likely to apply in other…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
