Stochastic Multi-level Composition Optimization Algorithms with Level-Independent Convergence Rates
Krishnakumar Balasubramanian, Saeed Ghadimi, Anthony Nguyen

TL;DR
This paper introduces new stochastic algorithms for multi-level composition optimization problems, achieving level-independent convergence rates and improved sample complexity, with practical advantages like being parameter-free and easy to implement.
Contribution
The paper proposes two novel algorithms for stochastic multi-level optimization, with the second algorithm achieving improved sample complexity and practical benefits over previous methods.
Findings
First algorithm achieves $ ilde{O}(1/ ext{epsilon}^6)$ sample complexity.
Modified algorithm reduces complexity to $ ilde{O}(1/ ext{epsilon}^4)$ and is parameter-free.
First to match single-level convergence rates in multi-level stochastic optimization.
Abstract
In this paper, we study smooth stochastic multi-level composition optimization problems, where the objective function is a nested composition of functions. We assume access to noisy evaluations of the functions and their gradients, through a stochastic first-order oracle. For solving this class of problems, we propose two algorithms using moving-average stochastic estimates, and analyze their convergence to an -stationary point of the problem. We show that the first algorithm, which is a generalization of \cite{GhaRuswan20} to the level case, can achieve a sample complexity of by using mini-batches of samples in each iteration. By modifying this algorithm using linearized stochastic estimates of the function values, we improve the sample complexity to . {\color{black}This modification not only removes the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Advanced Bandit Algorithms Research
