Global Convergence of Gradient Descent for Asymmetric Low-Rank Matrix   Factorization

Tian Ye; Simon S. Du

arXiv:2106.14289·math.OC·June 29, 2021·6 cites

Global Convergence of Gradient Descent for Asymmetric Low-Rank Matrix Factorization

Tian Ye, Simon S. Du

PDF

Open Access 1 Video

TL;DR

This paper proves that simple gradient descent, starting from random initialization, can efficiently find a global minimum in the challenging asymmetric low-rank matrix factorization problem without artificial modifications.

Contribution

It provides the first rigorous proof of polynomial-time convergence for unaltered gradient descent on this non-convex, non-smooth problem, introducing new analytical techniques.

Findings

01

Gradient descent converges to a global minimum in polynomial time.

02

New symmetrization technique captures symmetry and asymmetry effects.

03

Quantitative perturbation analysis approximates matrix derivatives.

Abstract

We study the asymmetric low-rank factorization problem: \[\min_{\mathbf{U} \in \mathbb{R}^{m \times d}, \mathbf{V} \in \mathbb{R}^{n \times d}} \frac{1}{2}\|\mathbf{U}\mathbf{V}^\top -\mathbf{\Sigma}\|_F^2\] where $Σ$ is a given matrix of size $m \times n$ and rank $d$ . This is a canonical problem that admits two difficulties in optimization: 1) non-convexity and 2) non-smoothness (due to unbalancedness of $U$ and $V$ ). This is also a prototype for more complex problems such as asymmetric matrix sensing and matrix completion. Despite being non-convex and non-smooth, it has been observed empirically that the randomly initialized gradient descent algorithm can solve this problem in polynomial time. Existing theories to explain this phenomenon all require artificial modifications of the algorithm, such as adding noise in each iteration and adding a balancing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Global Convergence of Gradient Descent for Asymmetric Low-Rank Matrix Factorization· slideslive

Taxonomy

TopicsSparse and Compressive Sensing Techniques · Antenna Design and Optimization · Advanced Adaptive Filtering Techniques