Revisiting the Nystrom Method for Improved Large-Scale Machine Learning
Alex Gittens, Michael W. Mahoney

TL;DR
This paper empirically evaluates randomized low-rank approximation algorithms for SPSD matrices, compares sampling and projection methods, and provides improved theoretical bounds to guide large-scale machine learning applications.
Contribution
It offers a comprehensive empirical comparison of sampling and projection methods and introduces improved theoretical bounds for these algorithms.
Findings
Sampling and projection methods have complementary strengths.
Data preprocessing significantly affects algorithm performance.
Existing theory poorly predicts practical performance.
Abstract
We reconsider randomized algorithms for the low-rank approximation of symmetric positive semi-definite (SPSD) matrices such as Laplacian and kernel matrices that arise in data analysis and machine learning applications. Our main results consist of an empirical evaluation of the performance quality and running time of sampling and projection methods on a diverse suite of SPSD matrices. Our results highlight complementary aspects of sampling versus projection methods; they characterize the effects of common data preprocessing steps on the performance of these algorithms; and they point to important differences between uniform sampling and nonuniform sampling methods based on leverage scores. In addition, our empirical results illustrate that existing theory is so weak that it does not provide even a qualitative guide to practice. Thus, we complement our empirical results with a suite of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques · Markov Chains and Monte Carlo Methods
