Survey Descent: A Multipoint Generalization of Gradient Descent for   Nonsmooth Optimization

X.Y. Han; Adrian S. Lewis

arXiv:2111.15645·math.OC·January 19, 2023

Survey Descent: A Multipoint Generalization of Gradient Descent for Nonsmooth Optimization

X.Y. Han, Adrian S. Lewis

PDF

Open Access

TL;DR

This paper introduces a multipoint generalization of gradient descent tailored for nonsmooth optimization, achieving linear convergence for max-of-smooth objectives and offering a new theoretical framework for such problems.

Contribution

It proposes a novel multipoint descent method that extends gradient descent to nonsmooth settings and proves linear convergence for max-of-smooth objectives.

Findings

01

Proves linear convergence for max-of-smooth objectives.

02

Demonstrates the method's effectiveness through experiments.

03

Addresses challenges in nonsmooth optimization theory.

Abstract

For strongly convex objectives that are smooth, the classical theory of gradient descent ensures linear convergence relative to the number of gradient evaluations. An analogous nonsmooth theory is challenging. Even when the objective is smooth at every iterate, the corresponding local models are unstable and the number of cutting planes invoked by traditional remedies is difficult to bound, leading to convergences guarantees that are sublinear relative to the cumulative number of gradient evaluations. We instead propose a multipoint generalization of the gradient descent iteration for local optimization. While designed with general objectives in mind, we are motivated by a ``max-of-smooth'' model that captures the subdifferential dimension at optimality. We prove linear convergence when the objective is itself max-of-smooth, and experiments suggest a more general phenomenon.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques · Advanced Optimization Algorithms Research