Dropping Convexity for Faster Semi-definite Optimization
Srinadh Bhojanapalli, Anastasios Kyrillidis, Sujay Sanghavi

TL;DR
This paper analyzes the convergence of Factored Gradient Descent (FGD) for semi-definite optimization, showing it achieves rates comparable to standard gradient descent and providing initialization strategies for global convergence.
Contribution
It introduces convergence guarantees for FGD on general convex functions, including step size rules and initialization procedures, under standard convex assumptions.
Findings
FGD attains $O(1/k)$ convergence for smooth functions.
FGD converges exponentially fast for strongly convex functions.
Proper initialization ensures global convergence in certain cases.
Abstract
We study the minimization of a convex function over the set of positive semi-definite matrices, but when the problem is recast as , with and . We study the performance of gradient descent on ---which we refer to as Factored Gradient Descent (FGD)---under standard assumptions on the original function . We provide a rule for selecting the step size and, with this choice, show that the local convergence rate of FGD mirrors that of standard gradient descent on the original : i.e., after steps, the error is for smooth , and exponentially small in when is (restricted) strongly convex. In addition, we provide a procedure to initialize FGD for (restricted) strongly convex objectives and when one only has access to via a first-order oracle; for several problem instances,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques · Advanced Optimization Algorithms Research
