Riemannian Adaptive Optimization Methods

Gary B\'ecigneul; Octavian-Eugen Ganea

arXiv:1810.00760·cs.LG·February 19, 2019·93 cites

Riemannian Adaptive Optimization Methods

Gary B\'ecigneul, Octavian-Eugen Ganea

PDF

Open Access 1 Repo

TL;DR

This paper extends popular adaptive optimization algorithms like Adam and Adagrad to Riemannian manifolds, providing theoretical guarantees and demonstrating improved convergence in embedding tasks.

Contribution

It introduces Riemannian versions of Adam, Adagrad, and Amsgrad with convergence proofs for geodesically convex functions on product manifolds.

Findings

01

Faster convergence observed in Riemannian adaptive methods.

02

Lower train loss achieved on WordNet taxonomy embedding.

03

Algorithms reduce to standard Euclidean methods when applied to flat spaces.

Abstract

Several first order stochastic optimization methods commonly used in the Euclidean domain such as stochastic gradient descent (SGD), accelerated gradient descent or variance reduced methods have already been adapted to certain Riemannian settings. However, some of the most popular of these optimization tools - namely Adam , Adagrad and the more recent Amsgrad - remain to be generalized to Riemannian manifolds. We discuss the difficulty of generalizing such adaptive schemes to the most agnostic Riemannian setting, and then provide algorithms and convergence proofs for geodesically convex objectives in the particular case of a product of Riemannian manifolds, in which adaptivity is implemented across manifolds in the cartesian product. Our generalization is tight in the sense that choosing the Euclidean space as Riemannian manifold yields the same algorithms and regret bounds as those…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

geoopt/geoopt
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · 3D Shape Modeling and Analysis · Generative Adversarial Networks and Image Synthesis

MethodsAdaGrad · Adam