Beyond Procrustes: Balancing-Free Gradient Descent for Asymmetric   Low-Rank Matrix Sensing

Cong Ma; Yuanxin Li; Yuejie Chi

arXiv:2101.05113·eess.SP·April 7, 2021

Beyond Procrustes: Balancing-Free Gradient Descent for Asymmetric Low-Rank Matrix Sensing

Cong Ma, Yuanxin Li, Yuejie Chi

PDF

TL;DR

This paper demonstrates that for asymmetric low-rank matrix sensing, gradient descent with spectral initialization converges linearly without explicit regularization, as the factors naturally stay balanced during optimization, simplifying the approach.

Contribution

The paper provides a theoretical justification that balancing regularization is unnecessary in asymmetric matrix sensing when using gradient descent with spectral initialization.

Findings

01

Gradient descent converges linearly without balancing regularization.

02

Factors remain balanced automatically during the optimization process.

03

Analysis based on a new distance metric accounting for invertible transform ambiguity.

Abstract

Low-rank matrix estimation plays a central role in various applications across science and engineering. Recently, nonconvex formulations based on matrix factorization are provably solved by simple gradient descent algorithms with strong computational and statistical guarantees. However, when the low-rank matrices are asymmetric, existing approaches rely on adding a regularization term to balance the scale of the two matrix factors which in practice can be removed safely without hurting the performance when initialized via the spectral method. In this paper, we provide a theoretical justification to this for the matrix sensing problem, which aims to recover a low-rank matrix from a small number of linear measurements. As long as the measurement ensemble satisfies the restricted isometry property, gradient descent -- in conjunction with spectral initialization -- converges linearly…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.