Distribution of singular values in large sample cross-covariance matrices
Arabind Swain, Sean Alexander Ridout, Ilya Nemenman

TL;DR
This paper derives the probability distribution of singular values of large cross-covariance matrices with Gaussian entries, extending classical results to better assess statistical significance of cross-correlations in data analysis.
Contribution
It extends the Marchenko-Pastur law to the singular values of empirical cross-covariance matrices for large Gaussian datasets.
Findings
Distribution derived for different parameter regimes.
Provides a theoretical basis for significance testing of cross-correlations.
Enhances understanding of spectral properties of large cross-covariance matrices.
Abstract
For two large matrices and with Gaussian i.i.d.\ entries and dimensions and , respectively, we derive the probability distribution of the singular values of in different parameter regimes. This extends the Marchenko-Pastur result for the distribution of eigenvalues of empirical sample covariance matrices to singular values of empirical cross-covariances. Our results will help to establish statistical significance of cross-correlations in many data-science applications.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Statistical Methods and Models · Sensory Analysis and Statistical Methods · Random Matrices and Applications
