Detecting the large entries of a sparse covariance matrix in sub-quadratic time
Ofer Shwartz, Boaz Nadler

TL;DR
This paper introduces two randomized algorithms that efficiently detect large entries in approximately sparse covariance matrices using sub-quadratic time, significantly speeding up computations in high-dimensional data analysis.
Contribution
The paper proposes and analyzes novel randomized algorithms for fast detection of large entries in sparse covariance matrices, reducing computational complexity from quadratic to sub-quadratic time.
Findings
Algorithms operate in O(np poly log p) time.
Conditions established for sample size and data distribution to ensure sparsity assumptions hold.
Simulations demonstrate the effectiveness of the proposed methods.
Abstract
The covariance matrix of a -dimensional random variable is a fundamental quantity in data analysis. Given i.i.d. observations, it is typically estimated by the sample covariance matrix, at a computational cost of operations. When are large, this computation may be prohibitively slow. Moreover, in several contemporary applications, the population matrix is approximately sparse, and only its few large entries are of interest. This raises the following question, at the focus of our work: Assuming approximate sparsity of the covariance matrix, can its large entries be detected much faster, say in sub-quadratic time, without explicitly computing all its entries? In this paper, we present and theoretically analyze two randomized algorithms that detect the large entries of an approximately sparse sample covariance matrix using only …
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
