On the Estimation of Coherence
Mehryar Mohri, Ameet Talwalkar

TL;DR
This paper introduces an efficient algorithm to estimate matrix coherence from limited data, enabling better low-rank approximations and sampling strategies in large-scale machine learning tasks.
Contribution
The paper presents a novel, theoretically analyzed algorithm for estimating matrix coherence from few columns, improving practical low-rank approximation methods.
Findings
The algorithm accurately estimates coherence across diverse datasets.
Coherence estimates strongly predict sampling-based approximation success.
Theoretical bounds are supported by extensive experiments.
Abstract
Low-rank matrix approximations are often used to help scale standard machine learning algorithms to large-scale problems. Recently, matrix coherence has been used to characterize the ability to extract global information from a subset of matrix entries in the context of these low-rank approximations and other sampling-based algorithms, e.g., matrix com- pletion, robust PCA. Since coherence is defined in terms of the singular vectors of a matrix and is expensive to compute, the practical significance of these results largely hinges on the following question: Can we efficiently and accurately estimate the coherence of a matrix? In this paper we address this question. We propose a novel algorithm for estimating coherence from a small number of columns, formally analyze its behavior, and derive a new coherence-based matrix approximation bound based on this analysis. We then present…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Tensor decomposition and applications · Stochastic Gradient Optimization Techniques
