Fast approximation of matrix coherence and statistical leverage
Petros Drineas, Malik Magdon-Ismail, Michael W. Mahoney and, David P. Woodruff

TL;DR
This paper introduces a fast randomized algorithm to approximate statistical leverage scores and coherence of large matrices, significantly reducing computational complexity for applications in matrix approximation and data analysis.
Contribution
The paper presents a novel $O(nd \, \log n)$ time randomized algorithm for relative-error approximation of leverage scores, improving over the traditional $O(nd^2)$ approach.
Findings
Algorithm achieves relative-error approximations efficiently.
Extends to cross-leverage scores and streaming data.
Applicable to matrices with $n \approx d$.
Abstract
The statistical leverage scores of a matrix are the squared row-norms of the matrix containing its (top) left singular vectors and the coherence is the largest leverage score. These quantities are of interest in recently-popular problems such as matrix completion and Nystr\"{o}m-based low-rank matrix approximation as well as in large-scale statistical data analysis applications more generally; moreover, they are of interest since they define the key structural nonuniformity that must be dealt with in developing fast randomized matrix algorithms. Our main result is a randomized algorithm that takes as input an arbitrary matrix , with , and that returns as output relative-error approximations to all of the statistical leverage scores. The proposed algorithm runs (under assumptions on the precise values of and ) in time, as opposed to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques · Tensor decomposition and applications
