Multi-Level Stochastic Gradient Methods for Nested Composition   Optimization

Shuoguang Yang; Mengdi Wang; Ethan X. Fang

arXiv:1801.03600·math.OC·January 15, 2018·SIAM J. Optim.·5 cites

Multi-Level Stochastic Gradient Methods for Nested Composition Optimization

Shuoguang Yang, Mengdi Wang, Ethan X. Fang

PDF

Open Access

TL;DR

This paper introduces multi-level stochastic gradient methods for complex nested optimization problems, achieving improved convergence rates especially when component functions are smooth, with applications in risk-averse optimization.

Contribution

It develops a multi-level stochastic gradient framework with accelerated methods and provides convergence guarantees for both convex and nonconvex cases.

Findings

01

Convergence rate of $O(n^{-1/2^T})$ for basic multi-level methods.

02

Accelerated methods improve convergence to $O(n^{-4/(7+T)})$ for smooth functions.

03

Validated effectiveness through numerical experiments.

Abstract

Stochastic gradient methods are scalable for solving large-scale optimization problems that involve empirical expectations of loss functions. Existing results mainly apply to optimization problems where the objectives are one- or two-level expectations. In this paper, we consider the multi-level compositional optimization problem that involves compositions of multi-level component functions and nested expectations over a random path. It finds applications in risk-averse optimization and sequential planning. We propose a class of multi-level stochastic gradient methods that are motivated from the method of multi-timescale stochastic approximation. First we propose a basic $T$ -level stochastic compositional gradient algorithm, establish its almost sure convergence and obtain an $n$ -iteration error bound $O (n^{- 1/ 2^{T}})$ . Then we develop accelerated multi-level stochastic gradient methods…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRisk and Portfolio Optimization · Stochastic Gradient Optimization Techniques · Statistical Methods and Inference