Katyusha Acceleration for Convex Finite-Sum Compositional Optimization
Yibo Xu, Yangyang Xu

TL;DR
This paper introduces accelerated algorithms for convex finite-sum compositional optimization problems, improving convergence rates and complexity bounds by leveraging Katyusha acceleration and mini-batch sampling.
Contribution
It develops new algorithms that incorporate Katyusha acceleration for convex and strongly-convex finite-sum COPs, achieving improved dependence on condition number and accuracy.
Findings
Achieves linear convergence for strongly-convex COPs.
Improves complexity dependence on condition number from $ au^3$ to $ au^{2.5}$.
Reduces complexity dependence on $ ext{epsilon}$ from $ ext{epsilon}^{-3}$ to $ ext{epsilon}^{-2.5}$.
Abstract
Structured problems arise in many applications. To solve these problems, it is important to leverage the structure information. This paper focuses on convex problems with a finite-sum compositional structure. Finite-sum problems appear as the sample average approximation of a stochastic optimization problem and also arise in machine learning with a huge amount of training data. One popularly used numerical approach for finite-sum problems is the stochastic gradient method (SGM). However, the additional compositional structure prohibits easy access to unbiased stochastic approximation of the gradient, so directly applying the SGM to a finite-sum compositional optimization problem (COP) is often inefficient. We design new algorithms for solving strongly-convex and also convex two-level finite-sum COPs. Our design incorporates the Katyusha acceleration technique and adopts the mini-batch…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Domain Adaptation and Few-Shot Learning
