On the Convergence of Projected-Gradient Methods with Low-Rank Projections for Smooth Convex Minimization over Trace-Norm Balls and Related Problems
Dan Garber

TL;DR
This paper analyzes conditions under which low-rank projections in projected-gradient methods for smooth convex minimization over trace-norm balls still guarantee convergence, enabling efficient large-scale low-rank matrix recovery.
Contribution
It provides theoretical conditions and bounds for the convergence of low-rank projected-gradient methods, extending to regularized and positive semidefinite cases, supported by empirical evidence.
Findings
Optimal solutions are centers of Euclidean balls with bounded rank.
Radius of the convergence ball depends on the spectral gap of the gradient.
Using higher-rank SVDs can significantly enlarge the convergence region.
Abstract
Smooth convex minimization over the unit trace-norm ball is an important optimization problem in machine learning, signal processing, statistics and other fields, that underlies many tasks in which one wishes to recover a low-rank matrix given certain measurements. While first-order methods for convex optimization enjoy optimal convergence rates, they require in worst-case to compute a full-rank SVD on each iteration, in order to compute the projection onto the trace-norm ball. These full-rank SVD computations however prohibit the application of such methods to large problems. A simple and natural heuristic to reduce the computational cost is to approximate the projection using only a low-rank SVD. This raises the question if, and under what conditions, this simple heuristic can indeed result in provable convergence to the optimal solution. In this paper we show that any optimal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Advanced Optimization Algorithms Research · Stochastic Gradient Optimization Techniques
