High Dimensional Correlation Matrices: CLT and Its Applications
Jiti Gao, Xiao Han, Guangming Pan, Yanrong Yang

TL;DR
This paper establishes a new CLT for spectral statistics of high-dimensional correlation matrices, enabling improved independence testing and factor analysis in large datasets, with demonstrated practical effectiveness.
Contribution
It introduces a novel CLT for linear spectral statistics of high-dimensional correlation matrices, advancing theoretical understanding and practical testing methods.
Findings
New CLT for spectral statistics in high dimensions
Effective independence and factor loadings tests
Empirical validation on household income data
Abstract
Statistical inferences for sample correlation matrices are important in high dimensional data analysis. Motivated by this, this paper establishes a new central limit theorem (CLT) for a linear spectral statistic (LSS) of high dimensional sample correlation matrices for the case where the dimension p and the sample size are comparable. This result is of independent interest in large dimensional random matrix theory. Meanwhile, we apply the linear spectral statistic to an independence test for random variables, and then an equivalence test for p factor loadings and factors in a factor model. The finite sample performance of the proposed test shows its applicability and effectiveness in practice. An empirical application to test the independence of household incomes from different cities in China is also conducted.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
