A New Two-Sample Test for Covariance Matrices in High Dimensions: U-Statistics Meet Leading Eigenvalues
Thomas Lam, Nina D\"ornemann, Holger Dette

TL;DR
This paper introduces a novel high-dimensional two-sample test for covariance matrices that combines Frobenius norm and leading eigenvalue statistics, offering improved sensitivity to various alternatives.
Contribution
It develops a hybrid test using Fisher's method that integrates two different statistics and proves their asymptotic independence, enhancing detection capabilities in high-dimensional settings.
Findings
The test controls the significance level asymptotically.
It is consistent against both sparse and dense alternatives.
Numerical results show superior performance over existing methods.
Abstract
We propose a two-sample test for covariance matrices in the high-dimensional regime, where the dimension diverges proportionally to the sample size. Our hybrid test combines a Frobenius-norm-based statistic as considered in Li and Chen (2012) with the leading eigenvalue approach proposed in Zhang et al. (2022), making it sensitive to both dense and sparse alternatives. The two statistics are combined via Fisher's method, leveraging our key theoretical result: a joint central limit theorem showing the asymptotic independence of the leading eigenvalues of the sample covariance matrix and an estimator of the Frobenius norm of the difference of the two population covariance matrices, under suitable signal conditions. The level of the test can be controlled asymptotically, and we show consistency against certain types of both sparse and dense alternatives. A comprehensive numerical study…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRandom Matrices and Applications · Matrix Theory and Algorithms · Statistical Methods and Inference
