Optimization without Retraction on the Random Generalized Stiefel Manifold
Simon Vary, Pierre Ablin, Bin Gao, P.-A. Absil

TL;DR
This paper introduces a stochastic iterative optimization method for the generalized Stiefel manifold that converges to critical points without retraction, reducing computational costs in applications like CCA, ICA, and GEVP.
Contribution
It presents a novel stochastic algorithm that avoids retraction, converging in expectation with lower per-iteration costs for problems involving generalized orthogonality constraints.
Findings
Method achieves convergence rates comparable to full-matrix approaches.
Experiments validate effectiveness in machine learning applications.
Requires only matrix multiplications, reducing computational complexity.
Abstract
Optimization over the set of matrices that satisfy , referred to as the generalized Stiefel manifold, appears in many applications involving sampled covariance matrices such as the canonical correlation analysis (CCA), independent component analysis (ICA), and the generalized eigenvalue problem (GEVP). Solving these problems is typically done by iterative methods that require a fully formed . We propose a cheap stochastic iterative method that solves the optimization problem while having access only to random estimates of . Our method does not enforce the constraint in every iteration; instead, it produces iterations that converge to critical points on the generalized Stiefel manifold defined in expectation. The method has lower per-iteration cost, requires only matrix multiplications, and has the same convergence rates as its Riemannian optimization…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEvolutionary Game Theory and Cooperation · Topological and Geometric Data Analysis · Opinion Dynamics and Social Influence
MethodsSparse Evolutionary Training · Independent Component Analysis
