An Improved Approximation Algorithm for the Column Subset Selection Problem
Christos Boutsidis, Michael W. Mahoney, and Petros Drineas

TL;DR
This paper introduces a novel two-stage randomized and deterministic algorithm for the column subset selection problem, achieving improved approximation bounds in Frobenius and spectral norms compared to previous methods.
Contribution
The paper presents a new two-stage algorithm that improves approximation bounds for the column subset selection problem, with probabilistic guarantees and better theoretical performance.
Findings
Achieves a Frobenius norm error within a factor of .8 of the best rank- approximation.
Provides spectral norm bounds that depend on .8 of the Frobenius norm error.
Offers an algorithm with runtime complexity of O(min\u00a0rac{mn^2,m^2n}) for selecting columns.
Abstract
We consider the problem of selecting the best subset of exactly columns from an matrix . We present and analyze a novel two-stage algorithm that runs in time and returns as output an matrix consisting of exactly columns of . In the first (randomized) stage, the algorithm randomly selects columns according to a judiciously-chosen probability distribution that depends on information in the top- right singular subspace of . In the second (deterministic) stage, the algorithm applies a deterministic column-selection procedure to select and return exactly columns from the set of columns selected in the first stage. Let be the matrix containing those columns, let denote the projection matrix onto the span of those columns, and let denote the best rank-…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Optimization and Search Problems · Complexity and Algorithms in Graphs
