Optimal Algorithms for Stochastic Multi-Level Compositional Optimization

Wei Jiang; Bokun Wang; Yibo Wang; Lijun Zhang; Tianbao Yang

arXiv:2202.07530·cs.LG·October 20, 2022

Optimal Algorithms for Stochastic Multi-Level Compositional Optimization

Wei Jiang, Bokun Wang, Yibo Wang, Lijun Zhang, Tianbao Yang

PDF

Open Access

TL;DR

This paper introduces the SMVR algorithm for stochastic multi-level compositional optimization, achieving optimal sample complexities for non-convex, convex, and PL-condition objectives without large batch sizes.

Contribution

The paper proposes the SMVR method and its variants, achieving optimal sample complexities for various objective conditions in stochastic multi-level compositional optimization.

Findings

01

Achieves $ ilde{O}(1/ ext{epsilon}^3)$ sample complexity for non-convex objectives.

02

Improves to $ ilde{O}(1/ ext{epsilon}^2)$ for convex functions.

03

Achieves $ ilde{O}(1/( ext{mu} ext{epsilon}))$ for functions satisfying the PL condition.

Abstract

In this paper, we investigate the problem of stochastic multi-level compositional optimization, where the objective function is a composition of multiple smooth but possibly non-convex functions. Existing methods for solving this problem either suffer from sub-optimal sample complexities or need a huge batch size. To address these limitations, we propose a Stochastic Multi-level Variance Reduction method (SMVR), which achieves the optimal sample complexity of $O (1/ ϵ^{3})$ to find an $ϵ$ -stationary point for non-convex objectives. Furthermore, when the objective function satisfies the convexity or Polyak-{\L}ojasiewicz (PL) condition, we propose a stage-wise variant of SMVR and improve the sample complexity to $O (1/ ϵ^{2})$ for convex functions or $O (1/ (μ ϵ))$ for non-convex…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Advanced Bandit Algorithms Research