Determinant-Based Error Bounds for CUR Matrix Approximation: Oversampling and Volume Sampling
Frank de Hoog, Markus Hegland

TL;DR
This paper introduces determinant-based error bounds for CUR matrix approximation, demonstrating how oversampling improves approximation quality and providing a unified analysis for CUR and Nyström methods.
Contribution
It develops a probabilistic framework using volume sampling to quantify the benefits of oversampling in CUR approximation, connecting local errors to global bounds.
Findings
Expected error reduces linearly with oversampling
Determinant identities relate projection errors to geometric volume
Bounded error by optimal low-rank approximation
Abstract
We derive error bounds for CUR matrix approximation using determinant-based methods that relate local projection errors to global approximation quality. For general matrices, we establish determinant identities for bordered Gramian matrices that decompose CUR approximation errors into interpretable local components. These identities connect projection errors onto submatrix column spaces directly to determinants, providing geometric insight into approximation degradation. We develop a probabilistic framework based on volume sampling that yields interpolation-type error bounds quantifying the benefits of oversampling: when rows are selected for columns, the expected error factor transitions linearly from (no oversampling) to (full oversampling). Our analysis establishes that the expected squared error is bounded by this interpolation factor times the squared…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Matrix Theory and Algorithms
