Katyusha Acceleration for Convex Finite-Sum Compositional Optimization

Yibo Xu; Yangyang Xu

arXiv:1910.11217·math.OC·March 26, 2021·5 cites

Katyusha Acceleration for Convex Finite-Sum Compositional Optimization

Yibo Xu, Yangyang Xu

PDF

Open Access

TL;DR

This paper introduces accelerated algorithms for convex finite-sum compositional optimization problems, improving convergence rates and complexity bounds by leveraging Katyusha acceleration and mini-batch sampling.

Contribution

It develops new algorithms that incorporate Katyusha acceleration for convex and strongly-convex finite-sum COPs, achieving improved dependence on condition number and accuracy.

Findings

01

Achieves linear convergence for strongly-convex COPs.

02

Improves complexity dependence on condition number from $ au^3$ to $ au^{2.5}$.

03

Reduces complexity dependence on $ ext{epsilon}$ from $ ext{epsilon}^{-3}$ to $ ext{epsilon}^{-2.5}$.

Abstract

Structured problems arise in many applications. To solve these problems, it is important to leverage the structure information. This paper focuses on convex problems with a finite-sum compositional structure. Finite-sum problems appear as the sample average approximation of a stochastic optimization problem and also arise in machine learning with a huge amount of training data. One popularly used numerical approach for finite-sum problems is the stochastic gradient method (SGM). However, the additional compositional structure prohibits easy access to unbiased stochastic approximation of the gradient, so directly applying the SGM to a finite-sum compositional optimization problem (COP) is often inefficient. We design new algorithms for solving strongly-convex and also convex two-level finite-sum COPs. Our design incorporates the Katyusha acceleration technique and adopts the mini-batch…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Domain Adaptation and Few-Shot Learning