Approaching nonsmooth nonconvex optimization problems through first order dynamical systems with hidden acceleration and Hessian driven damping terms
Radu Ioan Bot, Ern\"o Robert Csetnek

TL;DR
This paper analyzes a first-order dynamical system combining proximal-gradient steps with hidden acceleration and damping terms, showing convergence to critical points of a nonsmooth, nonconvex optimization problem under Kurdyka-{}ojasiewicz conditions.
Contribution
It introduces a novel dynamical system framework for nonsmooth nonconvex optimization with acceleration and damping, proving convergence to critical points.
Findings
Trajectories approach the set of critical points.
Convergence rates depend on the Kurdyka-{}ojasiewicz exponent.
Results apply under mild regularity assumptions.
Abstract
In this paper we carry out an asymptotic analysis of the proximal-gradient dynamical system \begin{equation*}\left\{ \begin{array}{ll} \dot x(t) +x(t) = \prox_{\gamma f}\big[x(t)-\gamma\nabla\Phi(x(t))-ax(t)-by(t)\big],\\ \dot y(t)+ax(t)+by(t)=0 \end{array}\right.\end{equation*} where is a proper, convex and lower semicontinuous function, a possibly nonconvex smooth function and and are positive real numbers. We show that the generated trajectories approach the set of critical points of , here understood as zeros of its limiting subdifferential, under the premise that a regularization of this sum function satisfies the Kurdyka-\L{}ojasiewicz property. We also establish convergence rates for the trajectories, formulated in terms of the \L{}ojasiewicz exponent of the considered regularization function.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
