Detecting the large entries of a sparse covariance matrix in   sub-quadratic time

Ofer Shwartz; Boaz Nadler

arXiv:1505.03001·stat.CO·November 13, 2018

Detecting the large entries of a sparse covariance matrix in sub-quadratic time

Ofer Shwartz, Boaz Nadler

PDF

TL;DR

This paper introduces two randomized algorithms that efficiently detect large entries in approximately sparse covariance matrices using sub-quadratic time, significantly speeding up computations in high-dimensional data analysis.

Contribution

The paper proposes and analyzes novel randomized algorithms for fast detection of large entries in sparse covariance matrices, reducing computational complexity from quadratic to sub-quadratic time.

Findings

01

Algorithms operate in O(np poly log p) time.

02

Conditions established for sample size and data distribution to ensure sparsity assumptions hold.

03

Simulations demonstrate the effectiveness of the proposed methods.

Abstract

The covariance matrix of a $p$ -dimensional random variable is a fundamental quantity in data analysis. Given $n$ i.i.d. observations, it is typically estimated by the sample covariance matrix, at a computational cost of $O (n p^{2})$ operations. When $n, p$ are large, this computation may be prohibitively slow. Moreover, in several contemporary applications, the population matrix is approximately sparse, and only its few large entries are of interest. This raises the following question, at the focus of our work: Assuming approximate sparsity of the covariance matrix, can its large entries be detected much faster, say in sub-quadratic time, without explicitly computing all its $p^{2}$ entries? In this paper, we present and theoretically analyze two randomized algorithms that detect the large entries of an approximately sparse sample covariance matrix using only $O (n p poly log p)$ …

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.