Accelerated Method for Stochastic Composition Optimization with Nonsmooth Regularization
Zhouyuan Huo, Bin Gu, Ji Liu, Heng Huang

TL;DR
This paper introduces a new stochastic composition optimization method with variance reduction that achieves the fastest known convergence rates, including linear convergence for strongly convex problems and significant improvements for general cases, validated by experiments.
Contribution
The paper proposes a novel variance-reduction based stochastic method for nonsmooth composition optimization with the fastest convergence rates to date.
Findings
Achieves linear convergence for strongly convex problems.
Improves convergence rate from O(T^{-1/2}) to O((n_1+n_2)^{2/3} T^{-1}).
Experimental results confirm theoretical advantages.
Abstract
Stochastic composition optimization draws much attention recently and has been successful in many emerging applications of machine learning, statistical analysis, and reinforcement learning. In this paper, we focus on the composition problem with nonsmooth regularization penalty. Previous works either have slow convergence rate or do not provide complete convergence analysis for the general problem. In this paper, we tackle these two issues by proposing a new stochastic composition optimization method for composition problem with nonsmooth regularization penalty. In our method, we apply variance reduction technique to accelerate the speed of convergence. To the best of our knowledge, our method admits the fastest convergence rate for stochastic composition optimization: for strongly convex composition problem, our algorithm is proved to admit linear convergence; for general composition…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Optimization and Search Problems · Machine Learning and ELM
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
