Averaging Stochastic Gradient Descent on Riemannian Manifolds

Nilesh Tripuraneni; Nicolas Flammarion; Francis Bach; Michael I.; Jordan

arXiv:1802.09128·cs.LG·June 11, 2018·40 cites

Averaging Stochastic Gradient Descent on Riemannian Manifolds

Nilesh Tripuraneni, Nicolas Flammarion, Francis Bach, Michael I., Jordan

PDF

Open Access

TL;DR

This paper introduces a geometric framework for averaging stochastic gradient descent on Riemannian manifolds, achieving faster convergence rates and improving algorithms like streaming k-PCA without prior spectral gap knowledge.

Contribution

The paper develops a novel geometric averaging method for SGD on Riemannian manifolds, enhancing convergence speed and applying it to problems like streaming k-PCA.

Findings

01

Achieves $O(1/n)$ convergence rate for averaged iterates on manifolds.

02

Accelerates streaming k-PCA to optimal convergence rate.

03

Provides a robust framework applicable to geodesically-strongly-convex problems.

Abstract

We consider the minimization of a function defined on a Riemannian manifold $M$ accessible only through unbiased estimates of its gradients. We develop a geometric framework to transform a sequence of slowly converging iterates generated from stochastic gradient descent (SGD) on $M$ to an averaged iterate sequence with a robust and fast $O (1/ n)$ convergence rate. We then present an application of our framework to geodesically-strongly-convex (and possibly Euclidean non-convex) problems. Finally, we demonstrate how these ideas apply to the case of streaming $k$ -PCA, where we show how to accelerate the slow rate of the randomized power method (without requiring knowledge of the eigengap) into a robust algorithm achieving the optimal rate of convergence.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Numerical methods in inverse problems · Topological and Geometric Data Analysis