A Unified Convergence Theorem for Stochastic Optimization Methods

Xiao Li; Andre Milzarek

arXiv:2206.03907·math.OC·October 20, 2022·5 cites

A Unified Convergence Theorem for Stochastic Optimization Methods

Xiao Li, Andre Milzarek

PDF

Open Access 1 Video

TL;DR

This paper introduces a unified convergence theorem applicable to various stochastic optimization methods, enabling simplified analysis and extending convergence guarantees to new algorithms in nonconvex, nonsmooth settings.

Contribution

The paper presents a general convergence theorem that applies to multiple stochastic optimization algorithms without algorithm-specific tailoring, including new results for prox-SGD and SMM.

Findings

01

Unified convergence theorem for stochastic methods

02

Extended convergence results for SGD and RR

03

New convergence guarantees for prox-SGD and SMM

Abstract

In this work, we provide a fundamental unified convergence theorem used for deriving expected and almost sure convergence results for a series of stochastic optimization methods. Our unified theorem only requires to verify several representative conditions and is not tailored to any specific algorithm. As a direct application, we recover expected and almost sure convergence results of the stochastic gradient method (SGD) and random reshuffling (RR) under more general settings. Moreover, we establish new expected and almost sure convergence results for the stochastic proximal gradient method (prox-SGD) and stochastic model-based methods (SMM) for nonsmooth nonconvex optimization problems. These applications reveal that our unified theorem provides a plugin-type convergence analysis and strong convergence guarantees for a wide class of stochastic optimization methods.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

A Unified Convergence Theorem for Stochastic Optimization Methods· slideslive

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Markov Chains and Monte Carlo Methods