Consistent Low-Rank Approximation
David P. Woodruff, Samson Zhou

TL;DR
This paper studies the problem of maintaining low-rank approximations of a matrix as its rows arrive sequentially, aiming to minimize the total change in the solutions while achieving near-optimal approximation quality.
Contribution
It introduces bounds on recourse for both additive and multiplicative approximation goals in the online low-rank approximation setting, with improved bounds for specific cases.
Findings
Achieves $ ilde{O}(k/ ext{epsilon})$ recourse for additive approximation.
Provides upper bounds of $rac{k^{3/2}}{ ext{epsilon}^2} ext{poly}\log(nd)$ for multiplicative approximation.
Establishes lower bounds showing $ ilde{ ext{Omega}}(k/ ext{epsilon})$ recourse is necessary.
Abstract
We introduce and study the problem of consistent low-rank approximation, in which rows of an input matrix arrive sequentially and the goal is to provide a sequence of subspaces that well-approximate the optimal rank- approximation to the submatrix that has arrived at each time , while minimizing the recourse, i.e., the overall change in the sequence of solutions. We first show that when the goal is to achieve a low-rank cost within an additive factor of the optimal cost, roughly recourse is feasible. For the more challenging goal of achieving a relative -multiplicative approximation of the optimal rank- cost, we show that a simple upper bound in this setting is…
Peer Reviews
Decision·ICLR 2026 Poster
- The problem setting is interesting, i.e., studying of the subspace corresponding to streaming updates and understand how subspace can differ for different algorithms is an interesting idea. Mostly because one can imagine having to reconstruct the approximation matrix again and again if the subspace is changing significantly (e.g., as stated for the Frequent directions method). - I have only glossed over the proofs, which are pretty simple, and believe they are correct. Given the authors pres
- There is a significant body of work on rank-$k$ approximation algorithms. However, only frequent directions has been empirically compared against. I am surprised as why this is the case. - Most of the theoretical contributions are really derivative of prior work. While I really appreciate the problem setting, the contributions are really understanding how the subspace are drifting with time given the subspace approximation algorithm. - Algorithm 2 requires computing SVD at each round in the
The paper introduces a novel model for studying low rank approximation of consistency. Consistent and low recourse algorithms have been studied for other problems in data science thus making low rank approximation a natural problem to study from a theoretical perspective. Moreover the authors show good upper and lower bounds for low rank approximation in this model.
The paper does not have many weaknesses but one is that although the problem has a natural theoretical motivation, it would be interesting for the authors to discuss more concrete practical motivations for studying low recourse algorithms for low rank approximation.
Standard streaming subspace approximation algorithms like FrequentDirections and Ridge Leverage Score Sampling can have very large recourse as shown theoretically, and empirically on real data. That means they can bounce between solutions. The algorithm is subtle yet simple. It is careful about when to update the estimate with extra care to not to change the subspace too much if it does not have to. It reminds me of distributed streaming algorithms (e.g., https://arxiv.org/abs/1404.7571) t
Nothing to note.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Tensor decomposition and applications
