Orthogonal Directions Constrained Gradient Method: from non-linear equality constraints to Stiefel manifold
Sholom Schechtman, Daniil Tiapkin, Michael Muehlebach, Eric Moulines

TL;DR
This paper introduces ODCGM, a simple and efficient algorithm for non-convex optimization on manifolds that avoids complex retractions and achieves near-optimal convergence rates, extending previous methods for Stiefel manifold optimization.
Contribution
The paper proposes ODCGM, a novel projection-based gradient method that simplifies manifold optimization and extends the analysis of existing algorithms with near-optimal complexity guarantees.
Findings
ODCGM converges towards the manifold with simple projections.
The method achieves near-optimal oracle complexities in deterministic and stochastic settings.
Numerical experiments demonstrate high-dimensional efficiency of ODCGM.
Abstract
We consider the problem of minimizing a non-convex function over a smooth manifold . We propose a novel algorithm, the Orthogonal Directions Constrained Gradient Method (ODCGM) which only requires computing a projection onto a vector space. ODCGM is infeasible but the iterates are constantly pulled towards the manifold, ensuring the convergence of ODCGM towards . ODCGM is much simpler to implement than the classical methods which require the computation of a retraction. Moreover, we show that ODCGM exhibits the near-optimal oracle complexities and in the deterministic and stochastic cases, respectively. Furthermore, we establish that, under an appropriate choice of the projection metric, our method recovers the landing algorithm of Ablin and Peyr\'e (2022), a recently introduced algorithm for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques · Point processes and geometric inequalities
