Consistent Low-Rank Approximation

David P. Woodruff; Samson Zhou

arXiv:2603.02148·cs.DS·March 3, 2026

Consistent Low-Rank Approximation

David P. Woodruff, Samson Zhou

PDF

Open Access 3 Reviews

TL;DR

This paper studies the problem of maintaining low-rank approximations of a matrix as its rows arrive sequentially, aiming to minimize the total change in the solutions while achieving near-optimal approximation quality.

Contribution

It introduces bounds on recourse for both additive and multiplicative approximation goals in the online low-rank approximation setting, with improved bounds for specific cases.

Findings

01

Achieves $ ilde{O}(k/ ext{epsilon})$ recourse for additive approximation.

02

Provides upper bounds of $rac{k^{3/2}}{ ext{epsilon}^2} ext{poly}\log(nd)$ for multiplicative approximation.

03

Establishes lower bounds showing $ ilde{ ext{Omega}}(k/ ext{epsilon})$ recourse is necessary.

Abstract

We introduce and study the problem of consistent low-rank approximation, in which rows of an input matrix $A \in R^{n \times d}$ arrive sequentially and the goal is to provide a sequence of subspaces that well-approximate the optimal rank- $k$ approximation to the submatrix $A^{(t)}$ that has arrived at each time $t$ , while minimizing the recourse, i.e., the overall change in the sequence of solutions. We first show that when the goal is to achieve a low-rank cost within an additive $ε \cdot ∣∣ A^{(t)} ∣ ∣_{F}^{2}$ factor of the optimal cost, roughly $O (\frac{k}{ε} lo g (n d))$ recourse is feasible. For the more challenging goal of achieving a relative $(1 + ε)$ -multiplicative approximation of the optimal rank- $k$ cost, we show that a simple upper bound in this setting is…

Peer Reviews

Decision·ICLR 2026 Poster

Reviewer 01Rating 4Confidence 3

Strengths

- The problem setting is interesting, i.e., studying of the subspace corresponding to streaming updates and understand how subspace can differ for different algorithms is an interesting idea. Mostly because one can imagine having to reconstruct the approximation matrix again and again if the subspace is changing significantly (e.g., as stated for the Frequent directions method). - I have only glossed over the proofs, which are pretty simple, and believe they are correct. Given the authors pres

Weaknesses

- There is a significant body of work on rank-$k$ approximation algorithms. However, only frequent directions has been empirically compared against. I am surprised as why this is the case. - Most of the theoretical contributions are really derivative of prior work. While I really appreciate the problem setting, the contributions are really understanding how the subspace are drifting with time given the subspace approximation algorithm. - Algorithm 2 requires computing SVD at each round in the

Reviewer 02Rating 8Confidence 2

Strengths

The paper introduces a novel model for studying low rank approximation of consistency. Consistent and low recourse algorithms have been studied for other problems in data science thus making low rank approximation a natural problem to study from a theoretical perspective. Moreover the authors show good upper and lower bounds for low rank approximation in this model.

Weaknesses

The paper does not have many weaknesses but one is that although the problem has a natural theoretical motivation, it would be interesting for the authors to discuss more concrete practical motivations for studying low recourse algorithms for low rank approximation.

Reviewer 03Rating 8Confidence 5

Strengths

Standard streaming subspace approximation algorithms like FrequentDirections and Ridge Leverage Score Sampling can have very large recourse as shown theoretically, and empirically on real data. That means they can bounce between solutions. The algorithm is subtle yet simple. It is careful about when to update the estimate with extra care to not to change the subspace too much if it does not have to. It reminds me of distributed streaming algorithms (e.g., https://arxiv.org/abs/1404.7571) t

Weaknesses

Nothing to note.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Tensor decomposition and applications