Bulk Spectra of Truncated Sample Covariance Matrices
Subhroshekhar Ghosh, Soumendu Sundar Mukherjee, Himasish Talukdar

TL;DR
This paper analyzes the spectral properties of truncated sample covariance matrices, modeled as matrix-valued U-statistics, revealing their bulk spectra and connecting them to generalized Marčenko-Pastur laws and free probability.
Contribution
It provides a complete description of the bulk spectra of matrix-valued U-statistics in the null setting, extending to kernelized random matrices with dependent Laplacian matrices.
Findings
Bulk spectra characterized via Stieltjes transforms.
Connections established to generalized Marčenko-Pastur laws.
Applicable to broader class of kernelized random matrices.
Abstract
Determinantal Point Processes (DPPs), which originate from quantum and statistical physics, are known for modelling diversity. Recent research [Ghosh and Rigollet (2020)] has demonstrated that certain matrix-valued -statistics (that are truncated versions of the usual sample covariance matrix) can effectively estimate parameters in the context of Gaussian DPPs and enhance dimension reduction techniques, outperforming standard methods like PCA in clustering applications. This paper explores the spectral properties of these matrix-valued -statistics in the null setting of an isotropic design. These matrices may be represented as , where is a data matrix and is the Laplacian matrix of a random geometric graph associated to . The main mathematically interesting twist here is that the matrix is dependent on . We give complete descriptions of the bulk…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMatrix Theory and Algorithms · Advanced Mathematical Theories and Applications · Random Matrices and Applications
