New Subset Selection Algorithms for Low Rank Approximation: Offline and Online
David P. Woodruff, Taisuke Yasuda

TL;DR
This paper develops new algorithms for subset selection in low-rank matrix approximation, achieving near-optimal trade-offs for various loss functions in both offline and online settings, with significant theoretical improvements.
Contribution
It introduces nearly optimal offline bicriteria algorithms for entrywise losses and the first online algorithms for b5_pb5_p subspace and low-rank approximation, advancing the theoretical understanding.
Findings
Improved bicriteria algorithms for entrywise b5_pb5_p losses, tight for b5_1.
First oblivious b5_pb5_p subspace embeddings for 1<p<2 with nearly optimal distortion.
First online algorithms for b5_pb5_p subspace and low-rank approximation.
Abstract
Subset selection for the rank approximation of an matrix offers improvements in the interpretability of matrices, as well as a variety of computational savings. This problem is well-understood when the error measure is the Frobenius norm, with various tight algorithms known even in challenging models such as the online model, where an algorithm must select the column subset irrevocably when the columns arrive one by one. In contrast, for other matrix losses, optimal trade-offs between the subset size and approximation quality have not been settled, even in the offline setting. We give a number of results towards closing these gaps. In the offline setting, we achieve nearly optimal bicriteria algorithms in two settings. First, we remove a factor from a result of [SWZ19] when the loss function is any entrywise loss with an approximate triangle inequality…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Machine Learning and Algorithms · Blind Source Separation Techniques
