Nonsmooth optimization using Taylor-like models: error bounds,   convergence, and termination criteria

Dmitriy Drusvyatskiy; Alexander D. Ioffe; Adrian S. Lewis

arXiv:1610.03446·math.OC·October 12, 2016·Math. Program.

Nonsmooth optimization using Taylor-like models: error bounds, convergence, and termination criteria

Dmitriy Drusvyatskiy, Alexander D. Ioffe, Adrian S. Lewis

PDF

TL;DR

This paper develops a theoretical framework for nonsmooth optimization algorithms based on Taylor-like models, establishing relationships between step-sizes, convergence, and error bounds, with practical termination criteria and convergence guarantees.

Contribution

It provides explicit links between step-sizes and function slopes, enabling reliable termination and convergence analysis for nonsmooth optimization methods.

Findings

01

Step-sizes can reliably determine algorithm termination.

02

Limit points of the iterates are stationary if step-sizes tend to zero.

03

Error bounds imply linear convergence rates.

Abstract

We consider optimization algorithms that successively minimize simple Taylor-like models of the objective function. Methods of Gauss-Newton type for minimizing the composition of a convex function and a smooth map are common examples. Our main result is an explicit relationship between the step-size of any such algorithm and the slope of the function at a nearby point. Consequently, we (1) show that the step-sizes can be reliably used to terminate the algorithm, (2) prove that as long as the step-sizes tend to zero, every limit point of the iterates is stationary, and (3) show that conditions, akin to classical quadratic growth, imply that the step-sizes linearly bound the distance of the iterates to the solution set. The latter so-called error bound property is typically used to establish linear (or faster) convergence guarantees. Analogous results hold when the step-size is replaced…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.