TL;DR
This paper investigates the convergence speed of nonsmooth optimization algorithms using the Goldstein subdifferential, providing theoretical insights and numerical validation for gradient sampling methods.
Contribution
It introduces a detailed analysis of the convergence rate of $(oldsymbol{ ext{ε,δ}})$-critical points in nonsmooth optimization using the Goldstein subdifferential, with applications to gradient sampling.
Findings
Convergence speed of $( ext{ε,δ})$-critical points analyzed
Theoretical results supported by simple illustrative examples
Numerical experiments demonstrate practical implications
Abstract
The Goldstein -subdifferential is a relaxed version of the Clarke subdifferential which has recently appeared in several algorithms for nonsmooth optimization. With it comes the notion of -critical points, which are points in which the element with the smallest norm in the -subdifferential has norm at most . To obtain points that are critical in the classical sense, and must vanish. In this article, we analyze at which speed the distance of -critical points to the minimum vanishes with respect to and . Afterwards, we apply our results to gradient sampling methods and perform numerical experiments. Throughout the article, we put a special emphasis on supporting the theoretical results with simple examples that visualize them.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
