TL;DR
This paper introduces scaled subgradient methods for low-rank matrix recovery that achieve fast, robust convergence independent of the matrix's condition number and dimension, even with corrupted data.
Contribution
It proposes a novel nonsmooth, nonconvex optimization approach using scaled subgradients that overcomes ill-conditioning and robustness issues in low-rank matrix estimation.
Findings
Convergence rate is nearly dimension-free and independent of condition number.
Effective in robust low-rank matrix sensing and quadratic sampling.
Achieves state-of-the-art guarantees under certain restricted isometry conditions.
Abstract
Many problems in data science can be treated as estimating a low-rank matrix from highly incomplete, sometimes even corrupted, observations. One popular approach is to resort to matrix factorization, where the low-rank matrix factors are optimized via first-order methods over a smooth loss function, such as the residual sum of squares. While tremendous progresses have been made in recent years, the natural smooth formulation suffers from two sources of ill-conditioning, where the iteration complexity of gradient descent scales poorly both with the dimension as well as the condition number of the low-rank matrix. Moreover, the smooth formulation is not robust to corruptions. In this paper, we propose scaled subgradient methods to minimize a family of nonsmooth and nonconvex formulations -- in particular, the residual sum of absolute errors -- which is guaranteed to converge at a fast…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
