On the Convergence Rate of Stochastic Mirror Descent for Nonsmooth   Nonconvex Optimization

Siqi Zhang; Niao He

arXiv:1806.04781·math.OC·June 14, 2018·39 cites

On the Convergence Rate of Stochastic Mirror Descent for Nonsmooth Nonconvex Optimization

Siqi Zhang, Niao He

PDF

Open Access

TL;DR

This paper analyzes the convergence rate of Stochastic Mirror Descent (SMD) in nonconvex, nonsmooth stochastic optimization, establishing a rate of O(1/√t) for convergence to stationary points without mini-batch use.

Contribution

It provides the first non-asymptotic convergence guarantees for SMD in weakly convex, nonsmooth nonconvex problems, including deterministic and proximal variants.

Findings

01

SMD converges at a rate of O(1/√t) to stationary points.

02

The analysis applies to both stochastic and deterministic SMD variants.

03

Efficiency matches existing stochastic subgradient results under stronger stationarity measures.

Abstract

In this paper, we investigate the non-asymptotic stationary convergence behavior of Stochastic Mirror Descent (SMD) for nonconvex optimization. We focus on a general class of nonconvex nonsmooth stochastic optimization problems, in which the objective can be decomposed into a relatively weakly convex function (possibly non-Lipschitz) and a simple non-smooth convex regularizer. We prove that SMD, without the use of mini-batch, is guaranteed to converge to a stationary point in a convergence rate of $O (1/ t)$ . The efficiency estimate matches with existing results for stochastic subgradient method, but is evaluated under a stronger stationarity measure. Our convergence analysis applies to both the original SMD and its proximal version, as well as the deterministic variants, for solving relatively weakly convex problems.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques · Advanced Optimization Algorithms Research