Implicit Regularization in Nonconvex Statistical Estimation: Gradient Descent Converges Linearly for Phase Retrieval, Matrix Completion, and Blind Deconvolution
Cong Ma, Kaizheng Wang, Yuejie Chi, Yuxin Chen

TL;DR
This paper demonstrates that gradient descent inherently regularizes itself in nonconvex statistical estimation problems, enabling efficient convergence without explicit regularization across phase retrieval, matrix completion, and blind deconvolution.
Contribution
It reveals the implicit regularization phenomenon in gradient descent, providing a unified analysis framework for its convergence in key nonconvex estimation problems without explicit regularization.
Findings
Gradient descent stays within a well-behaved geometric basin.
It achieves near-optimal statistical and computational guarantees.
It attains near-optimal error bounds in noisy matrix completion.
Abstract
Recent years have seen a flurry of activities in designing provably efficient nonconvex procedures for solving statistical estimation problems. Due to the highly nonconvex nature of the empirical loss, state-of-the-art procedures often require proper regularization (e.g. trimming, regularized cost, projection) in order to guarantee fast convergence. For vanilla procedures such as gradient descent, however, prior theory either recommends highly conservative learning rates to avoid overshooting, or completely lacks performance guarantees. This paper uncovers a striking phenomenon in nonconvex optimization: even in the absence of explicit regularization, gradient descent enforces proper regularization implicitly under various statistical models. In fact, gradient descent follows a trajectory staying within a basin that enjoys nice geometry, consisting of points incoherent with the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsAffine Coupling · Normalizing Flows
