Convergence of Nonmonotone Proximal Gradient Methods under the Kurdyka-Lojasiewicz Property without a Global Lipschitz Assumption
Christian Kanzow, Leo Lehmann

TL;DR
This paper proves that nonmonotone proximal gradient methods converge globally with favorable rates for composite optimization problems under the Kurdyka-Lojasiewicz property, without requiring a global Lipschitz condition or prior boundedness of iterates.
Contribution
It establishes that nonmonotone proximal gradient methods share convergence properties with monotone methods under the Kurdyka-Lojasiewicz property, without strong assumptions.
Findings
Global convergence of nonmonotone methods proven
Convergence rates comparable to monotone methods shown
No need for global Lipschitz or boundedness assumptions
Abstract
We consider the composite minimization problem with the objective function being the sum of a continuously differentiable and a merely lower semicontinuous and extended-valued function. The proximal gradient method is probably the most popular solver for this class of problems. Its convergence theory typically requires that either the gradient of the smooth part of the objective function is globally Lipschitz continuous or the (implicit or explicit) a priori assumption that the iterates generated by this method are bounded. Some recent results show that, without these assumptions, the proximal gradient method, combined with a monotone stepsize strategy, is still globally convergent with a suitable rate-of-convergence under the Kurdyka-Lojasiewicz property. For a nonmonotone stepsize strategy, there exist some attempts to verify similar convergence results, but, so far, they need…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIterative Methods for Nonlinear Equations · Advanced Optimization Algorithms Research · Optimization and Variational Analysis
