Accelerated Method for Stochastic Composition Optimization with   Nonsmooth Regularization

Zhouyuan Huo; Bin Gu; Ji Liu; Heng Huang

arXiv:1711.03937·cs.LG·January 1, 2018·21 cites

Accelerated Method for Stochastic Composition Optimization with Nonsmooth Regularization

Zhouyuan Huo, Bin Gu, Ji Liu, Heng Huang

PDF

Open Access

TL;DR

This paper introduces a new stochastic composition optimization method with variance reduction that achieves the fastest known convergence rates, including linear convergence for strongly convex problems and significant improvements for general cases, validated by experiments.

Contribution

The paper proposes a novel variance-reduction based stochastic method for nonsmooth composition optimization with the fastest convergence rates to date.

Findings

01

Achieves linear convergence for strongly convex problems.

02

Improves convergence rate from O(T^{-1/2}) to O((n_1+n_2)^{2/3} T^{-1}).

03

Experimental results confirm theoretical advantages.

Abstract

Stochastic composition optimization draws much attention recently and has been successful in many emerging applications of machine learning, statistical analysis, and reinforcement learning. In this paper, we focus on the composition problem with nonsmooth regularization penalty. Previous works either have slow convergence rate or do not provide complete convergence analysis for the general problem. In this paper, we tackle these two issues by proposing a new stochastic composition optimization method for composition problem with nonsmooth regularization penalty. In our method, we apply variance reduction technique to accelerate the speed of convergence. To the best of our knowledge, our method admits the fastest convergence rate for stochastic composition optimization: for strongly convex composition problem, our algorithm is proved to admit linear convergence; for general composition…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Optimization and Search Problems · Machine Learning and ELM

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings