Solving Stochastic Compositional Optimization is Nearly as Easy as Solving Stochastic Optimization
Tianyi Chen, Yuejiao Sun, Wotao Yin

TL;DR
This paper introduces a new stochastic compositional gradient method (SCSC) that converges at the same rate as standard SGD, simplifying and accelerating compositional optimization in applications like reinforcement learning.
Contribution
The paper proposes SCSC, a single-loop, fixed batch size method that achieves SGD-like convergence rates for stochastic compositional optimization, with easy integration of acceleration techniques.
Findings
SCSC converges at the same rate as SGD for non-compositional optimization.
Applying Adam to SCSC yields state-of-the-art performance.
SCSC performs well on portfolio management and meta-learning tasks.
Abstract
Stochastic compositional optimization generalizes classic (non-compositional) stochastic optimization to the minimization of compositions of functions. Each composition may introduce an additional expectation. The series of expectations may be nested. Stochastic compositional optimization is gaining popularity in applications such as reinforcement learning and meta learning. This paper presents a new Stochastically Corrected Stochastic Compositional gradient method (SCSC). SCSC runs in a single-time scale with a single loop, uses a fixed batch size, and guarantees to converge at the same rate as the stochastic gradient descent (SGD) method for non-compositional stochastic optimization. This is achieved by making a careful improvement to a popular stochastic compositional gradient method. It is easy to apply SGD-improvement techniques to accelerate SCSC. This helps SCSC achieve…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsAdam
