Bootstraps Regularize Singular Correlation Matrices
Christian Bongiorno

TL;DR
This paper demonstrates that averaging multiple bootstrapped correlation matrices can effectively regularize singular matrices, ensuring positive-definiteness especially when the number of bootstraps exceeds a certain threshold related to data dimensions.
Contribution
It provides an analytical proof that bootstrapping can rapidly regularize singular correlation matrices, with explicit bounds on the number of bootstraps needed for positive-definiteness.
Findings
Average of k bootstrapped correlation matrices becomes positive-definite as k increases.
The required number of bootstraps is less than the number of objects n.
Method is applicable in high-dimensional data fields like finance and genetics.
Abstract
I show analytically that the average of bootstrapped correlation matrices rapidly becomes positive-definite as increases, which provides a simple approach to regularize singular Pearson correlation matrices. If is the number of objects and the number of features, the averaged correlation matrix is almost surely positive-definite if in the limit of large and . The probability of obtaining a positive-definite correlation matrix with bootstraps is also derived for finite and . Finally, I demonstrate that the number of required bootstraps is always smaller than . This method is particularly relevant in fields where is orders of magnitude larger than the size of data points , e.g., in finance, genetics, social science, or image processing.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRandom Matrices and Applications · Fractal and DNA sequence analysis · Face and Expression Recognition
