Preconditioned Gradient Descent for Over-Parameterized Nonconvex Matrix   Factorization

Gavin Zhang; Salar Fattahi; Richard Y. Zhang

arXiv:2504.09708·math.OC·April 15, 2025·5 cites

Preconditioned Gradient Descent for Over-Parameterized Nonconvex Matrix Factorization

Gavin Zhang, Salar Fattahi, Richard Y. Zhang

PDF

Open Access 1 Video

TL;DR

This paper introduces PrecGD, a preconditioned gradient descent method that restores linear convergence in over-parameterized nonconvex matrix factorization, even with ill-conditioned ground truth, by using a specific regularization-based preconditioning.

Contribution

The paper proposes a novel preconditioning technique for nonconvex matrix factorization that achieves linear convergence in over-parameterized settings, addressing issues of slow convergence and ill-conditioning.

Findings

01

PrecGD restores linear convergence in over-parameterized matrix factorization.

02

The method is stable under noise and converges to an optimal error bound.

03

Numerical experiments confirm PrecGD's effectiveness across variants.

Abstract

In practical instances of nonconvex matrix factorization, the rank of the true solution $r^{⋆}$ is often unknown, so the rank $r$ of the model can be overspecified as $r > r^{⋆}$ . This over-parameterized regime of matrix factorization significantly slows down the convergence of local search algorithms, from a linear rate with $r = r^{⋆}$ to a sublinear rate when $r > r^{⋆}$ . We propose an inexpensive preconditioner for the matrix sensing variant of nonconvex matrix factorization that restores the convergence rate of gradient descent back to linear, even in the over-parameterized case, while also making it agnostic to possible ill-conditioning in the ground truth. Classical gradient descent in a neighborhood of the solution slows down due to the need for the model matrix factor to become singular. Our key result is that this singularity can be corrected by $ℓ_{2}$ …

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Preconditioned Gradient Descent for Over-Parameterized Nonconvex Matrix Factorization· slideslive

Taxonomy

TopicsSparse and Compressive Sensing Techniques · Face and Expression Recognition · Medical Image Segmentation Techniques