Low-Rank Matrix Recovery with Scaled Subgradient Methods: Fast and   Robust Convergence Without the Condition Number

Tian Tong; Cong Ma; Yuejie Chi

arXiv:2010.13364·cs.LG·April 23, 2021

Low-Rank Matrix Recovery with Scaled Subgradient Methods: Fast and Robust Convergence Without the Condition Number

Tian Tong, Cong Ma, Yuejie Chi

PDF

2 Repos

TL;DR

This paper introduces scaled subgradient methods for low-rank matrix recovery that achieve fast, robust convergence independent of the matrix's condition number and dimension, even with corrupted data.

Contribution

It proposes a novel nonsmooth, nonconvex optimization approach using scaled subgradients that overcomes ill-conditioning and robustness issues in low-rank matrix estimation.

Findings

01

Convergence rate is nearly dimension-free and independent of condition number.

02

Effective in robust low-rank matrix sensing and quadratic sampling.

03

Achieves state-of-the-art guarantees under certain restricted isometry conditions.

Abstract

Many problems in data science can be treated as estimating a low-rank matrix from highly incomplete, sometimes even corrupted, observations. One popular approach is to resort to matrix factorization, where the low-rank matrix factors are optimized via first-order methods over a smooth loss function, such as the residual sum of squares. While tremendous progresses have been made in recent years, the natural smooth formulation suffers from two sources of ill-conditioning, where the iteration complexity of gradient descent scales poorly both with the dimension as well as the condition number of the low-rank matrix. Moreover, the smooth formulation is not robust to corruptions. In this paper, we propose scaled subgradient methods to minimize a family of nonsmooth and nonconvex formulations -- in particular, the residual sum of absolute errors -- which is guaranteed to converge at a fast…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.