Nonconvex Stochastic Scaled-Gradient Descent and Generalized Eigenvector Problems
Chris Junchi Li, Michael I. Jordan

TL;DR
This paper introduces the Stochastic Scaled-Gradient Descent (SSGD) algorithm for Riemannian optimization, providing theoretical guarantees and applying it to online canonical correlation analysis with optimal convergence rates.
Contribution
The paper develops SSGD, a novel Riemannian stochastic optimization method with minimax optimal rates and asymptotic normality, specifically applied to generalized eigenvector problems and online CCA.
Findings
Achieves a $rac{1}{ oot T}$ convergence rate in spherical constraints.
Establishes minimax optimality up to polylog factors.
Provides the first explicit rate of local asymptotic normality for online CCA.
Abstract
Motivated by the problem of online canonical correlation analysis, we propose the \emph{Stochastic Scaled-Gradient Descent} (SSGD) algorithm for minimizing the expectation of a stochastic function over a generic Riemannian manifold. SSGD generalizes the idea of projected stochastic gradient descent and allows the use of scaled stochastic gradients instead of stochastic gradients. In the special case of a spherical constraint, which arises in generalized eigenvector problems, we establish a nonasymptotic finite-sample bound of , and show that this rate is minimax optimal, up to a polylogarithmic factor of relevant parameters. On the asymptotic side, a novel trajectory-averaging argument allows us to achieve local asymptotic normality with a rate that matches that of Ruppert-Polyak-Juditsky averaging. We bring these ideas together in an application to online canonical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Markov Chains and Monte Carlo Methods · Statistical Mechanics and Entropy
