Optimal Algorithms for Stochastic Multi-Level Compositional Optimization
Wei Jiang, Bokun Wang, Yibo Wang, Lijun Zhang, Tianbao Yang

TL;DR
This paper introduces the SMVR algorithm for stochastic multi-level compositional optimization, achieving optimal sample complexities for non-convex, convex, and PL-condition objectives without large batch sizes.
Contribution
The paper proposes the SMVR method and its variants, achieving optimal sample complexities for various objective conditions in stochastic multi-level compositional optimization.
Findings
Achieves $ ilde{O}(1/ ext{epsilon}^3)$ sample complexity for non-convex objectives.
Improves to $ ilde{O}(1/ ext{epsilon}^2)$ for convex functions.
Achieves $ ilde{O}(1/( ext{mu} ext{epsilon}))$ for functions satisfying the PL condition.
Abstract
In this paper, we investigate the problem of stochastic multi-level compositional optimization, where the objective function is a composition of multiple smooth but possibly non-convex functions. Existing methods for solving this problem either suffer from sub-optimal sample complexities or need a huge batch size. To address these limitations, we propose a Stochastic Multi-level Variance Reduction method (SMVR), which achieves the optimal sample complexity of to find an -stationary point for non-convex objectives. Furthermore, when the objective function satisfies the convexity or Polyak-{\L}ojasiewicz (PL) condition, we propose a stage-wise variant of SMVR and improve the sample complexity to for convex functions or for non-convex…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Advanced Bandit Algorithms Research
