Riemannian Stochastic Gradient Method for Nested Composition   Optimization

Dewei Zhang; Sam Davanloo Tajbakhsh

arXiv:2207.09350·math.OC·March 20, 2024

Riemannian Stochastic Gradient Method for Nested Composition Optimization

Dewei Zhang, Sam Davanloo Tajbakhsh

PDF

Open Access

TL;DR

This paper introduces a Riemannian stochastic gradient method for nested composition optimization problems on manifolds, addressing bias issues and providing convergence guarantees with applications in reinforcement learning.

Contribution

It proposes the R-SCGD algorithm for two-level and multi-level nested compositional problems on Riemannian manifolds, with proven convergence rates.

Findings

01

Converges to an approximate stationary point in $O( ext{epsilon}^{-2})$ calls.

02

Generalizes to multi-level nested structures with same complexity.

03

Numerical evaluation demonstrates effectiveness in reinforcement learning.

Abstract

This work considers optimization of composition of functions in a nested form over Riemannian manifolds where each function contains an expectation. This type of problems is gaining popularity in applications such as policy evaluation in reinforcement learning or model customization in meta-learning. The standard Riemannian stochastic gradient methods for non-compositional optimization cannot be directly applied as stochastic approximation of inner functions create bias in the gradients of the outer functions. For two-level composition optimization, we present a Riemannian Stochastic Composition Gradient Descent (R-SCGD) method that finds an approximate stationary point, with expected squared Riemannian gradient smaller than $ϵ$ , in $O (ϵ^{- 2})$ calls to the stochastic gradient oracle of the outer function and stochastic function and gradient oracles of the inner function.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Methods and Inference · Stochastic Gradient Optimization Techniques · Markov Chains and Monte Carlo Methods