A second-order method landing on the Stiefel manifold via Newton$\unicode{x2013}$Schulz iteration
Xinhui Xiong, Bin Gao, P.-A. Absil

TL;DR
This paper introduces a second-order, retraction-free optimization method on the Stiefel manifold that achieves quadratic convergence by combining tangent and normal components, utilizing Newton–Schulz iteration.
Contribution
It proposes a novel second-order method on the Stiefel manifold that avoids retractions and leverages Newton–Schulz iteration for improved efficiency and convergence.
Findings
The method achieves local quadratic or superlinear convergence.
Numerical experiments show it outperforms existing methods.
It effectively reduces both objective and infeasibility on test problems.
Abstract
Retraction-free approaches offer attractive low-cost alternatives to Riemannian methods on the Stiefel manifold, but they are often first-order, which may limit the efficiency under high-accuracy requirements. To this end, we propose a second-order method landing on the Stiefel manifold without invoking retractions, which is proved to enjoy local quadratic (or superlinear for its inexact variant) convergence. The update consists of the sum of (i) a component tangent to the level set of the constraint-defining function that aims to reduce the objective and (ii) a component normal to the same level set that reduces the infeasibility. Specifically, we construct the normal component via NewtonSchulz, a fixed-point iteration for orthogonalization. Moreover, we establish a geometric connection between the NewtonSchulz iteration and Stiefel manifolds, in which…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
