Loading paper

Newton-like dynamics associated to nonconvex optimization problems | Tomesphere

arXiv:1703.01339·math.OC·March 7, 2017

Newton-like dynamics associated to nonconvex optimization problems

Radu Ioan Bot, Ern\"o Robert Csetnek

TL;DR

This paper introduces a Newton-like dynamical system for nonconvex optimization, demonstrating convergence to critical points under certain conditions and providing convergence rates based on the Kurdyka- property.

Contribution

It proposes a novel dynamical system framework for nonconvex optimization and establishes convergence results and rates under the Kurdyka- property.

Findings

01

Limit points are contained in the set of critical points.

02

Trajectory convergence to critical points is proven under the Kurdyka- property.

03

Convergence rates depend on the exponent.

Abstract

We consider the dynamical system \begin{equation*}\left\{ \begin{array}{ll} v(t)\in\partial\phi(x(t))\\ \lambda\dot x(t) + \dot v(t) + v(t) + \nabla \psi(x(t))=0, \end{array}\right.\end{equation*} where $ϕ : R^{n} \to R \cup {+ \infty}$ is a proper, convex and lower semicontinuous function, $ψ : R^{n} \to R$ is a (possibly nonconvex) smooth function and $λ > 0$ is a parameter which controls the velocity. We show that the set of limit points of the trajectory $x$ is contained in the set of critical points of the objective function $ϕ + ψ$ , which is here seen as the set of the zeros of its limiting subdifferential. If the objective function satisfies the Kurdyka-\L{}ojasiewicz property, then we can prove convergence of the whole trajectory $x$ to a critical point. Furthermore, convergence rates for the orbits are obtained in terms of the \L{}ojasiewicz exponent of the objective…

Equations147

\left\{\begin{array}[]{ll}v(t)\in\partial\phi(x(t))\\ \lambda\dot{x}(t)+\dot{v}(t)+v(t)+\nabla\psi(x(t))=0,\end{array}\right.

\left\{\begin{array}[]{ll}v(t)\in\partial\phi(x(t))\\ \lambda\dot{x}(t)+\dot{v}(t)+v(t)+\nabla\psi(x(t))=0,\end{array}\right.

\left\{\begin{array}[]{ll}v(t)\in T(x(t))\\ \lambda(t)\dot{x}(t)+\dot{v}(t)+v(t)=0,\end{array}\right.

\left\{\begin{array}[]{ll}v(t)\in T(x(t))\\ \lambda(t)\dot{x}(t)+\dot{v}(t)+v(t)=0,\end{array}\right.

x \in R^{n} in f {ϕ (x) + ψ (x)},

x \in R^{n} in f {ϕ (x) + ψ (x)},

\left\{\begin{array}[]{ll}v(t)\in\partial\phi(x(t))\\ \lambda(t)\dot{x}(t)+\dot{v}(t)+v(t)+\nabla\psi(x(t))=0,\end{array}\right.

\left\{\begin{array}[]{ll}v(t)\in\partial\phi(x(t))\\ \lambda(t)\dot{x}(t)+\dot{v}(t)+v(t)+\nabla\psi(x(t))=0,\end{array}\right.

\hat{\partial} f (x) = {v \in R^{n} : y \to x lim inf \frac{f ( y ) - f ( x ) - ⟨ v , y - x ⟩}{∥ y - x ∥} \geq 0} .

\hat{\partial} f (x) = {v \in R^{n} : y \to x lim inf \frac{f ( y ) - f ( x ) - ⟨ v , y - x ⟩}{∥ y - x ∥} \geq 0} .

\partial_{L} f (x) = {v \in R^{n} : \exists x_{k} \to x, f (x_{k}) \to f (x) \mbox an d \exists v_{k} \in \hat{\partial} f (x_{k}), v_{k} \to v \mbox a s k \to + \infty},

\partial_{L} f (x) = {v \in R^{n} : \exists x_{k} \to x, f (x_{k}) \to f (x) \mbox an d \exists v_{k} \in \hat{\partial} f (x_{k}), v_{k} \to v \mbox a s k \to + \infty},

crit (f) = {x \in R^{n} : 0 \in \partial_{L} f (x)}

crit (f) = {x \in R^{n} : 0 \in \partial_{L} f (x)}

\frac{d}{d t} F (t) \leq G (t) .

\frac{d}{d t} F (t) \leq G (t) .

\frac{d}{d t} F (t) \leq G (t),

\frac{d}{d t} F (t) \leq G (t),

\frac{d}{d t} f (x (t)) = ⟨ \overset{x}{˙} (t), h ⟩ \forall h \in \partial f (x (t)) .

\frac{d}{d t} f (x (t)) = ⟨ \overset{x}{˙} (t), h ⟩ \forall h \in \partial f (x (t)) .

\left\{\begin{array}[]{ll}v(t)\in\partial\phi(x(t))\\ \lambda\dot{x}(t)+\dot{v}(t)+v(t)+\nabla\psi(x(t))=0\\ x(0)=x_{0},v(0)=v_{0}\in\partial\phi(x_{0}),\end{array}\right.

\left\{\begin{array}[]{ll}v(t)\in\partial\phi(x(t))\\ \lambda\dot{x}(t)+\dot{v}(t)+v(t)+\nabla\psi(x(t))=0\\ x(0)=x_{0},v(0)=v_{0}\in\partial\phi(x_{0}),\end{array}\right.

\frac{d}{d t} (\frac{1}{2} ∥ v (t) + \nabla ψ (x (t)) ∥^{2})

\frac{d}{d t} (\frac{1}{2} ∥ v (t) + \nabla ψ (x (t)) ∥^{2})

\frac{d}{d t} (\frac{1}{2} ∥ v (t) + \nabla ψ (x (t)) ∥^{2}) + \frac{3}{4} ∥ \overset{v}{˙} (t) ∥^{2} \leq L (λ + L) ∥ \overset{x}{˙} (t) ∥^{2} .

\frac{d}{d t} (\frac{1}{2} ∥ v (t) + \nabla ψ (x (t)) ∥^{2}) + \frac{3}{4} ∥ \overset{v}{˙} (t) ∥^{2} \leq L (λ + L) ∥ \overset{x}{˙} (t) ∥^{2} .

t \to + \infty lim λ \overset{x}{˙} (t) + \overset{v}{˙} (t) = 0.

t \to + \infty lim λ \overset{x}{˙} (t) + \overset{v}{˙} (t) = 0.

∥ \overset{v}{˙} (t) ∥^{2}

∥ \overset{v}{˙} (t) ∥^{2}

\frac{d}{d t} (ϕ + ψ) (x (t)) \leq 0

\frac{d}{d t} (ϕ + ψ) (x (t)) \leq 0

0 \in \partial_{L} (ϕ + ψ) (\overline{x}) .

0 \in \partial_{L} (ϕ + ψ) (\overline{x}) .

v (t_{k}) + \nabla ψ (x (t_{k})) \in \partial ϕ (x (t_{k})) + \nabla ψ (x (t_{k})) = \partial_{L} (ϕ + ψ) (x (t_{k})) .

v (t_{k}) + \nabla ψ (x (t_{k})) \in \partial ϕ (x (t_{k})) + \nabla ψ (x (t_{k})) = \partial_{L} (ϕ + ψ) (x (t_{k})) .

x (t_{k}) \to \overline{x} \mbox a s k \to + \infty

x (t_{k}) \to \overline{x} \mbox a s k \to + \infty

v (t_{k}) + \nabla ψ (x (t_{k})) \to 0 \mbox a s k \to + \infty.

v (t_{k}) + \nabla ψ (x (t_{k})) \to 0 \mbox a s k \to + \infty.

(ϕ + ψ) (x (t_{k})) \to (ϕ + ψ) (\overline{x}) \mbox a s k \to + \infty.

(ϕ + ψ) (x (t_{k})) \to (ϕ + ψ) (\overline{x}) \mbox a s k \to + \infty.

v (t_{k}) \to - \nabla ψ (\overline{x}) \mbox a s k \to + \infty.

v (t_{k}) \to - \nabla ψ (\overline{x}) \mbox a s k \to + \infty.

ϕ (\overline{x}) \geq ϕ (x (t_{k})) + ⟨ v (t_{k}), \overline{x} - x (t_{k})⟩ \forall k \in N .

ϕ (\overline{x}) \geq ϕ (x (t_{k})) + ⟨ v (t_{k}), \overline{x} - x (t_{k})⟩ \forall k \in N .

k \to + \infty lim sup ϕ (x (t_{k})) \leq ϕ (\overline{x}) .

k \to + \infty lim sup ϕ (x (t_{k})) \leq ϕ (\overline{x}) .

k \to + \infty lim ϕ (x (t_{k})) = ϕ (\overline{x}),

k \to + \infty lim ϕ (x (t_{k})) = ϕ (\overline{x}),

ω (x) := {\overline{x} \in R^{n} : \exists t_{k} \to + \infty \mbox s u c h t ha t x (t_{k}) \to \overline{x} \mbox a s k \to + \infty} .

ω (x) := {\overline{x} \in R^{n} : \exists t_{k} \to + \infty \mbox s u c h t ha t x (t_{k}) \to \overline{x} \mbox a s k \to + \infty} .

∥ u ∥ \to + \infty lim (ϕ + ψ) (u) = + \infty.

∥ u ∥ \to + \infty lim (ϕ + ψ) (u) = + \infty.

(ϕ + ψ) (x (T)) \leq (ϕ + ψ) (x_{0}) \forall T \geq 0.

(ϕ + ψ) (x (T)) \leq (ϕ + ψ) (x_{0}) \forall T \geq 0.

U \cap {x \in R^{n} : f (\overline{x}) < f (x) < f (\overline{x}) + η}

U \cap {x \in R^{n} : f (\overline{x}) < f (x) < f (\overline{x}) + η}

φ^{'} (f (x) - f (\overline{x})) dist (0, \partial_{L} f (x)) \geq 1.

φ^{'} (f (x) - f (\overline{x})) dist (0, \partial_{L} f (x)) \geq 1.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFunctional Equations Stability Results

Full text

Newton-like dynamics associated to nonconvex optimization problems

Radu Ioan Boţ University of Vienna, Faculty of Mathematics, Oskar-Morgenstern-Platz 1, A-1090 Vienna, Austria, email: [email protected]. Research partially supported by FWF (Austrian Science Fund), project I 2419-N32.

Ernö Robert Csetnek University of Vienna, Faculty of Mathematics, Oskar-Morgenstern-Platz 1, A-1090 Vienna, Austria, email: [email protected]. Research supported by FWF (Austrian Science Fund), project P 29809-N32.

Abstract. We consider the dynamical system

[TABLE]

where $\phi:\mathbb{R}^{n}\to\mathbb{R}\cup\{+\infty\}$ is a proper, convex and lower semicontinuous function, $\psi:\mathbb{R}^{n}\to\mathbb{R}$ is a (possibly nonconvex) smooth function and $\lambda>0$ is a parameter which controls the velocity. We show that the set of limit points of the trajectory $x$ is contained in the set of critical points of the objective function $\phi+\psi$ , which is here seen as the set of the zeros of its limiting subdifferential. If the objective function satisfies the Kurdyka-Łojasiewicz property, then we can prove convergence of the whole trajectory $x$ to a critical point. Furthermore, convergence rates for the orbits are obtained in terms of the Łojasiewicz exponent of the objective function, provided the latter satisfies the Łojasiewicz property.

Key Words. dynamical systems, Newton-like methods, Lyapunov analysis, nonsmooth optimization, limiting subdifferential, Kurdyka-Łojasiewicz property

AMS subject classification. 34G25, 47J25, 47H05, 90C26, 90C30, 65K10

1 Introduction and preliminaries

The dynamical system

[TABLE]

where $\lambda:[0,+\infty)\to[0,+\infty)$ and $T:\mathbb{R}^{n}\rightrightarrows\mathbb{R}^{n}$ is a (set-valued) maximally monotone operator, has been introduced and investigated in [10] as a continuous version of Newton and Levenberg-Marquardt-type algorithms. It has been shown that under mild conditions on $\lambda$ the trajectory $x(t)$ converges weakly to a zero of the operator $T$ , while $v(t)$ converges to zero as $t\rightarrow+\infty$ .

These investigations have been continued in [2] in the context of solving optimization problems of the form

[TABLE]

where $\phi:\mathbb{R}^{n}\to\mathbb{R}\cup\{+\infty\}$ is a proper, convex and lower semicontinuous function and $\psi:\mathbb{R}^{n}\to\mathbb{R}$ is a convex and differentiable function with locally Lipschitz-continuous gradient. More precisely, problem (2) has been approached via the dynamical system

[TABLE]

where $\partial\phi$ is the convex subdifferential of $\phi$ . It has been shown in [2] that if the set of minimizers of (2) is nonempty and some mild conditions on the damping function $\lambda$ are satisfied, then the trajectory $x(t)$ converges to a minimizer of (2) as $t\rightarrow+\infty$ . Further investigations on dynamical systems of similar type have been reported in [1] and [21].

The aim of this paper is to perform an asymptotic analysis of the dynamical system (3) in the absence of the convexity of $\psi$ , for constant damping function $\lambda$ and by assuming that the objective function of (2) satisfies the Kurdyka-Łojasiewicz property, in other words is a KL function. To the class of KL functions belong semialgebraic, real subanalytic, uniformly convex and convex functions satisfying a growth condition. The convergence analysis relies on methods of real algebraic geometry introduced by Łojasiewicz [30] and Kurdyka [28] and developed recently in the nonsmooth setting by Attouch, Bolte and Svaiter [7] and Bolte, Sabach and Teboulle [16].

Optimization problems involving KL functions have attracted the interest of the community since the works of Łojasiewicz [30], Simon [34], Haraux and Jendoubi [26]. The most important contributions of the last years in the field include the works of Alvarez, Attouch, Bolte and Redont [3, Section 4] and Bolte, Daniilidis and Lewis [12, Section 4]. Ever since the interest in this topic increased continuously (see [5, 6, 7, 15, 16, 20, 18, 19, 23, 24, 27, 32]).

In the first part of the paper we show that the set of limit points of the trajectory $x$ generated by (3) is entirely contained in the set of critical points of the objective function $\phi+\psi$ , which is seen as the set of zeros of its limiting subdifferential. Under some supplementary conditions, including the Kurdyka-Łojasiewicz property, we prove the convergence of the trajectory $x$ to a critical point of $\phi+\psi$ . Furthermore, convergence rates for the orbits are obtained in terms of the Łojasiewicz exponent of the objective function, provided the latter satisfies the Łojasiewicz property.

In the following we recall some notions and results which are needed throughout the paper. We consider on $\mathbb{R}^{n}$ the Euclidean scalar product and the corresponding norm denoted by $\langle\cdot,\cdot\rangle$ and $\|\cdot\|$ , respectively.

The domain of the function $f:\mathbb{R}^{n}\rightarrow\mathbb{R}\cup\{+\infty\}$ is defined by $\operatorname*{dom}f=\{x\in\mathbb{R}^{n}:f(x)<+\infty\}$ and we say that $f$ is proper, if it has a nonempty domain. For the following generalized subdifferential notions and their basic properties we refer to [17, 31, 33]. Let $f:\mathbb{R}^{n}\rightarrow\mathbb{R}\cup\{+\infty\}$ be a proper and lower semicontinuous function. The Fréchet (viscosity) subdifferential of $f$ at $x\in\operatorname*{dom}f$ is the set

[TABLE]

If $x\notin\operatorname*{dom}f$ , we set $\hat{\partial}f(x):=\emptyset$ . The limiting (Mordukhovich) subdifferential is defined at $x\in\operatorname*{dom}f$ by

[TABLE]

while for $x\notin\operatorname*{dom}f$ , we set $\partial_{L}f(x):=\emptyset$ . Obviously, $\hat{\partial}f(x)\subseteq\partial_{L}f(x)$ for each $x\in\mathbb{R}^{n}$ .

When $f$ is convex, these subdifferential notions coincide with the convex subdifferential, thus $\hat{\partial}f(x)=\partial_{L}f(x)=\partial f(x)=\{v\in\mathbb{R}^{n}:f(y)\geq f(x)+\left\langle v,y-x\right\rangle\ \forall y\in\mathbb{R}^{n}\}$ for all $x\in\mathbb{R}^{n}$ .

The following closedness criterion of the graph of the limiting subdifferential will be used in the convergence analysis: if $(x_{k})_{k\in\mathbb{N}}$ and $(v_{k})_{k\in\mathbb{N}}$ are sequences in $\mathbb{R}^{n}$ such that $v_{k}\in\partial_{L}f(x_{k})$ for all $k\in\mathbb{N}$ , $(x_{k},v_{k})\rightarrow(x,v)$ and $f(x_{k})\rightarrow f(x)$ as $k\rightarrow+\infty$ , then $v\in\partial_{L}f(x)$ .

The Fermat rule reads in this nonsmooth setting as follows: if $x\in\mathbb{R}^{n}$ is a local minimizer of $f$ , then $0\in\partial_{L}f(x)$ . We denote by

[TABLE]

the set of (limiting)-critical points of $f$ .

When $f$ is continuously differentiable around $x\in\mathbb{R}^{n}$ we have $\partial_{L}f(x)=\{\nabla f(x)\}$ . We will also make use of the following subdifferential sum rule: if $f:\mathbb{R}^{n}\rightarrow\mathbb{R}\cup\{+\infty\}$ is proper and lower semicontinuous and $h:\mathbb{R}^{n}\rightarrow\mathbb{R}$ is a continuously differentiable function, then $\partial_{L}(f+h)(x)=\partial_{L}f(x)+\nabla h(x)$ for all $x\in\mathbb{R}^{n}$ .

Further, we recall the notion of a locally absolutely continuous function and state two of its basic properties.

Definition 1

(see [10, 2]) A function $x:[0,+\infty)\rightarrow\mathbb{R}^{n}$ is said to be locally absolutely continuous, if it absolutely continuous on every interval $[0,T]$ for $T>0$ .

Remark 1

(a)

An absolutely continuous function is differentiable almost everywhere, its derivative coincides with its distributional derivative almost everywhere and one can recover the function from its derivative $\dot{x}=y$ by integration. 2. (b)

If $x:[0,T]\rightarrow\mathbb{R}^{n}$ is absolutely continuous for $T>0$ and $B:\mathbb{R}^{n}\rightarrow\mathbb{R}^{n}$ is $L$ -Lipschitz continuous for $L\geq 0$ , then the function $z=B\circ x$ is absolutely continuous, too. Moreover, $z$ is differentiable almost everywhere on $[0,T]$ and the inequality $\|\dot{z}(t)\|\leq L\|\dot{x}(t)\|$ holds for almost every $t\in[0,T]$ .

The following two results, which can be interpreted as continuous versions of the quasi-Fejér monotonicity for sequences, will play an important role in the asymptotic analysis of the trajectories of the dynamical system (3). For their proofs we refer the reader to [2, Lemma 5.1] and [2, Lemma 5.2], respectively.

Lemma 2

Suppose that $F:[0,+\infty)\rightarrow\mathbb{R}$ is locally absolutely continuous and bounded from below and that there exists $G\in L^{1}([0,+\infty))$ such that for almost every $t\in[0,+\infty)$

[TABLE]

Then there exists $\lim_{t\rightarrow\infty}F(t)\in\mathbb{R}$ .

Lemma 3

If $1\leq p<\infty$ , $1\leq r\leq\infty$ , $F:[0,+\infty)\rightarrow[0,+\infty)$ is locally absolutely continuous, $F\in L^{p}([0,+\infty))$ , $G:[0,+\infty)\rightarrow\mathbb{R}$ , $G\in L^{r}([0,+\infty))$ and for almost every $t\in[0,+\infty)$

[TABLE]

then $\lim_{t\rightarrow+\infty}F(t)=0$ .

The following result, which is due to Brézis ([22, Lemme 3.3, p. 73]; see also [8, Lemma 3.2]), provides an expression for the derivative of the composition of convex functions with absolutely continuous trajectories.

Lemma 4

Let $f:\mathbb{R}^{n}\rightarrow\mathbb{R}\cup\{+\infty\}$ be a proper, convex and lower semicontinuous function. Let $x\in L^{2}([0,T],\mathbb{R}^{n})$ be absolutely continuous such that $\dot{x}\in L^{2}([0,T],\mathbb{R}^{n})$ and $x(t)\in\operatorname*{dom}f$ for almost every $t\in[0,T]$ . Assume that there exists $\xi\in L^{2}([0,T],\mathbb{R}^{n})$ such that $\xi(t)\in\partial f(x(t))$ for almost every $t\in[0,T]$ . Then the function $t\mapsto f(x(t))$ is absolutely continuous and for almost every $t$ such that $x(t)\in\operatorname*{dom}\partial f$ we have

[TABLE]

2 Asymptotic analysis

In this paper we investigate the dynamical system

[TABLE]

where $x_{0},v_{0}\in\mathbb{R}^{n}$ and $\lambda>0$ . We assume that $\phi:\mathbb{R}^{n}\rightarrow\mathbb{R}\cup\{+\infty\}$ is proper, convex and lower semicontinuous and $\psi:\mathbb{R}^{n}\rightarrow\mathbb{R}$ is possibly nonconvex and Fréchet differentiable with $L$ -Lipschitz continuous gradient, for $L>0$ ; in other words, $\|\nabla\psi(x)-\nabla\psi(y)\|\leq L\|x-y\|$ for all $x,y\in\mathbb{R}^{n}$ .

In the following we specify what we understand under a solution of the dynamical system (4).

Definition 2

Let $x_{0},v_{0}\in\mathbb{R}^{n}$ and $\lambda>0$ be such that $v_{0}\in\partial\phi(x_{0})$ . We say that the pair $(x,v)$ is a strong global solution of (4) if the following properties are satisfied:

(i)

$x,v:[0,+\infty)\rightarrow\mathbb{R}^{n}$ are locally absolutely continuous functions; 2. (ii)

$v(t)\in\partial\phi(x(t))$ for every $t\in[0,+\infty)$ ; 3. (iii)

$\lambda\dot{x}(t)+\dot{v}(t)+v(t)+\nabla\psi(x(t))=0$ for almost every $t\in[0,+\infty)$ ; 4. (iv)

$x(0)=x_{0},v(0)=v_{0}$ .

The existence and uniqueness of the trajectories generated by (4) has been investigated in [2]. A careful look at the proofs in [2] reveals the fact that the convexity of $\psi$ is not used in the mentioned results on the existence, but the Lipschitz-continuity of its gradient.

We start our convergence analysis with the following technical result.

Lemma 5

Let $x_{0},v_{0}\in\mathbb{R}^{n}$ and $\lambda>0$ be such that $v_{0}\in\partial\phi(x_{0})$ . Let $(x,v):[0,+\infty)\rightarrow\mathbb{R}^{n}\times\mathbb{R}^{n}$ be the unique strong global solution of the dynamical system (4). Then the following statements are true:

(i)

$\langle\dot{x}(t),\dot{v}(t)\rangle\geq 0$ * for almost every $t\in[0,+\infty)$ ;* 2. (ii)

$\frac{d}{dt}\phi(x(t))=\langle\dot{x}(t),v(t)\rangle$ * for almost every $t\in[0,+\infty)$ .*

Proof.

(i) See [10, Proposition 3.1]. The proof relies on the first relation in (4) and the monotonicity of the convex subdifferential.

(ii) The proof makes use of Lemma 4. This relation has been already stated in [2, relation (51)] without making use in its proof of the convexity of $\psi$ . $\blacksquare$

Lemma 6

Let $x_{0},v_{0}\in\mathbb{R}^{n}$ and $\lambda>0$ be such that $v_{0}\in\partial\phi(x_{0})$ . Let $(x,v):[0,+\infty)\rightarrow\mathbb{R}^{n}\times\mathbb{R}^{n}$ be the unique strong global solution of the dynamical system (4). Suppose that $\phi+\psi$ is bounded from below. Then the following statements are true:

(i)

$\frac{d}{dt}(\phi+\psi)(x(t))+\lambda\|\dot{x}(t)\|^{2}+\langle\dot{x}(t),\dot{v}(t)\rangle=0$ * for almost every $t\geq 0$ ;* 2. (ii)

$\dot{x},\dot{v},v+\nabla\psi(x)\in L^{2}([0,+\infty);\mathbb{R}^{n})$ , $\langle\dot{x}(\cdot),\dot{v}(\cdot)\rangle\in L^{1}([0,+\infty);\mathbb{R})$ and $\lim_{t\rightarrow+\infty}\dot{x}(t)=\ \$ $\lim_{t\rightarrow+\infty}\dot{v}(t)=\lim_{t\rightarrow+\infty}\big{(}v(t)+\nabla\psi(x(t))\big{)}=0$ ; 3. (iii)

$\exists\lim_{t\rightarrow+\infty}(\phi+\psi)\big{(}x(t)\big{)}\in\mathbb{R}$ .

Proof.

(i) The statement follows by inner multiplying the both sides of the second relation in (4) by $\dot{x}(t)$ and by taking afterwards into consideration Lemma 5(ii).

(ii) After integrating the relation (i) and by taking into account that $\phi+\psi$ is bounded from below, we easily derive $\dot{x}\in L^{2}([0,+\infty);\mathbb{R}^{n})$ and $\langle\dot{x}(\cdot),\dot{v}(\cdot)\rangle\in L^{1}([0,+\infty);\mathbb{R})$ (see also Lemma 5(i)). Further, by using the second relation in (4), Remark 1(b) and Lemma 5(i), we obtain for almost every $t\geq 0$ :

[TABLE]

hence

[TABLE]

Since $\dot{x}\in L^{2}([0,+\infty);\mathbb{R}^{n})$ , a simple integration argument yields that $\dot{v}\in L^{2}([0,+\infty);\mathbb{R}^{n})$ . Considering the second equation in (4), we further obtain that $v+\nabla\psi(x)\in L^{2}([0,+\infty);\mathbb{R}^{n})$ . This fact combined with Lemma 3 and (5) implies that $\lim_{t\rightarrow+\infty}\big{(}v(t)+\nabla\psi(x(t))\big{)}=0$ . From the second equation in (4) we obtain

[TABLE]

Further, from Lemma 5(i) we have for almost every $t\geq 0$

[TABLE]

hence from (6) we get $\lim_{t\rightarrow+\infty}\dot{v}(t)=0$ . Combining this with (6) we conclude that $\lim_{t\rightarrow+\infty}\dot{x}(t)=0$ .

(iii) From (i) and Lemma 5(i) it follows that

[TABLE]

for almost every $t\geq 0$ . The conclusion follows by applying Lemma 2. $\blacksquare$

Lemma 7

Let $x_{0},v_{0}\in\mathbb{R}^{n}$ and $\lambda>0$ be such that $v_{0}\in\partial\phi(x_{0})$ . Let $(x,v):[0,+\infty)\rightarrow\mathbb{R}^{n}\times\mathbb{R}^{n}$ be the unique strong global solution of the dynamical system (4). Suppose that $\phi+\psi$ is bounded from below. Let $(t_{k})_{k\in\mathbb{N}}$ be a sequence such that $t_{k}\rightarrow+\infty$ and $x(t_{k})\rightarrow\overline{x}\in\mathbb{R}^{n}\mbox{ as }k\rightarrow+\infty$ . Then

[TABLE]

Proof.

From the first relation in (4) and the subdifferential sum rule of the limiting subdifferential we derive for any $k\in\mathbb{N}$

[TABLE]

Further, we have

[TABLE]

and (see Lemma 6(ii))

[TABLE]

According to the closedness property of the limiting subdifferential, the proof is complete as soon as we show that

[TABLE]

From (9), (10) and the continuity of $\nabla\psi$ we get

[TABLE]

Further, since $v({t_{k}})\in\partial\phi(x(t_{k}))$ , we have

[TABLE]

Combining this with (9) and (12) we derive

[TABLE]

A direct consequence of the lower semicontinuity of $\phi$ is the relation

[TABLE]

which combined with (9) and the continuity of $\psi$ yields (11). $\blacksquare$

We define the limit set of $x$ as

[TABLE]

We use also the distance function to a set, defined for $A\subseteq\mathbb{R}^{n}$ as $\operatorname*{dist}(x,A)=\inf_{y\in A}\|x-y\|$ for all $x\in\mathbb{R}^{n}$ .

Lemma 8

Let $x_{0},v_{0}\in\mathbb{R}^{n}$ and $\lambda>0$ be such that $v_{0}\in\partial\phi(x_{0})$ . Let $(x,v):[0,+\infty)\rightarrow\mathbb{R}^{n}\times\mathbb{R}^{n}$ be the unique strong global solution of the dynamical system (4). Suppose that $\phi+\psi$ is bounded from below and $x$ is bounded. Then the following statements are true:

(i)

$\omega(x)\subseteq\operatorname*{crit}(\phi+\psi)$ ;

(ii)

$\omega(x)$ * is nonempty, compact and connected;*

(iii)

$\lim_{t\to+\infty}\operatorname*{dist}\big{(}x(t),\omega(x)\big{)}=0$ ;

(iv)

$\phi+\psi$ * is finite and constant on $\omega(x)$ .*

Proof.

Statement (i) is a direct consequence of Lemma 7.

Statement (ii) is a classical result from [25]. We also refer the reader to the proof of Theorem 4.1 in [3], where it is shown that the properties of $\omega(x)$ of being nonempty, compact and connected are generic for bounded trajectories fulfilling $\lim_{t\rightarrow+\infty}{\dot{x}(t)}=0$ .

Statement (iii) follows immediately since $\omega(x)$ is nonempty.

(iv) According to Lemma (6)(iii), there exists $\lim_{t\rightarrow+\infty}(\phi+\psi)\big{(}x(t)\big{)}\in\mathbb{R}$ . Let us denote by $l\in\mathbb{R}$ this limit. Take $\overline{x}\in\omega(x)$ . Then there exists $t_{k}\rightarrow+\infty$ such that $x(t_{k})\rightarrow\overline{x}$ as $k\rightarrow+\infty$ . From the proof of Lemma 7 we have that $(\phi+\psi)(x(t_{k}))\rightarrow(\phi+\psi)(\overline{x})\mbox{ as }k\rightarrow+\infty$ , hence $(\phi+\psi)(\overline{x})=l$ . $\blacksquare$

Remark 9

Suppose that $\phi+\psi$ is coercive, in other words,

[TABLE]

Let $x_{0},v_{0}\in\mathbb{R}^{n}$ and $\lambda>0$ be such that $v_{0}\in\partial\phi(x_{0})$ . Let $(x,v):[0,+\infty)\rightarrow\mathbb{R}^{n}\times\mathbb{R}^{n}$ be the unique strong global solution of the dynamical system (4). Then $\phi+\psi$ is bounded from below and $x$ is bounded.

Indeed, since $\phi+\psi$ is a proper, lower semicontinuous and coercive function, it follows that $\inf_{u\in\mathbb{R}^{n}}[\phi(u)+\psi(u)]$ is finite and the infimum is attained. Hence $\phi+\psi$ is bounded from below. On the other hand, from (7) it follows

[TABLE]

Since $\phi+\psi$ is coercive, the lower level sets of $\phi+\psi$ are bounded, hence the above inequality yields that $x$ is bounded. Notice that in this case $v$ is bounded too, due to the relation $\lim_{t\rightarrow+\infty}\big{(}v(t)+\nabla\psi(x(t))\big{)}=0$ (Lemma 6(ii)) and the Lipschitz continuity of $\nabla\psi$ .

3 Convergence of the trajectory when the objective function satisfies the Kurdyka-Łojasiewicz property

In order to enforce the convergence of the whole trajectory $x(t)$ to a critical point of the objective function as $t\rightarrow+\infty$ more involved analytic features of the functions have to be considered.

A crucial role in the asymptotic analysis of the dynamical system (4) is played by the class of functions satisfying the Kurdyka-Łojasiewicz property. For $\eta\in(0,+\infty]$ , we denote by $\Theta_{\eta}$ the class of concave and continuous functions $\varphi:[0,\eta)\rightarrow[0,+\infty)$ such that $\varphi(0)=0$ , $\varphi$ is continuously differentiable on $(0,\eta)$ , continuous at [math] and $\varphi^{\prime}(s)>0$ for all $s\in(0,\eta)$ .

Definition 3

(Kurdyka-Łojasiewicz property) Let $f:\mathbb{R}^{n}\rightarrow\mathbb{R}\cup\{+\infty\}$ be a proper and lower semicontinuous function. We say that $f$ satisfies the Kurdyka-Łojasiewicz (KL) property at $\overline{x}\in\operatorname*{dom}\partial_{L}f=\{x\in\mathbb{R}^{n}:\partial_{L}f(x)\neq\emptyset\}$ , if there exist $\eta\in(0,+\infty]$ , a neighborhood $U$ of $\overline{x}$ and a function $\varphi\in\Theta_{\eta}$ such that for all $x$ in the intersection

[TABLE]

the following inequality holds

[TABLE]

If $f$ satisfies the KL property at each point in $\operatorname*{dom}\partial_{L}f$ , then $f$ is called KL function.

The origins of this notion go back to the pioneering work of Łojasiewicz [30], where it is proved that for a real-analytic function $f:\mathbb{R}^{n}\rightarrow\mathbb{R}$ and a critical point $\overline{x}\in\mathbb{R}^{n}$ (that is $\nabla f(\overline{x})=0$ ), there exists $\theta\in[1/2,1)$ such that the function $|f-f(\overline{x})|^{\theta}\|\nabla f\|^{-1}$ is bounded around $\overline{x}$ . This corresponds to the situation when $\varphi(s)=Cs^{1-\theta}$ for $C>0$ . The result of Łojasiewicz allows the interpretation of the KL property as a re-parametrization of the function values in order to avoid flatness around the critical points. Kurdyka [28] extended this property to differentiable functions definable in o-minimal structures. Further extensions to the nonsmooth setting can be found in [12, 6, 13, 14].

One of the remarkable properties of the KL functions is their ubiquity in applications (see [16]). We refer the reader to [12, 6, 14, 16, 13, 7, 5] and the references therein for more properties of the KL functions and illustrating examples.

In the analysis below the following uniform KL property given in [16, Lemma 6] will be used.

Lemma 10

Let $\Omega\subseteq\mathbb{R}^{n}$ be a compact set and let $f:\mathbb{R}^{n}\rightarrow\mathbb{R}\cup\{+\infty\}$ be a proper and lower semicontinuous function. Assume that $f$ is constant on $\Omega$ and that it satisfies the KL property at each point of $\Omega$ . Then there exist $\varepsilon,\eta>0$ and $\varphi\in\Theta_{\eta}$ such that for all $\overline{x}\in\Omega$ and all $x$ in the intersection

[TABLE]

the inequality

[TABLE]

holds.

Due to some reasons outlined in Remark 14 below, we prove the convergence of the trajectory $x(t)$ generated by (4) as $t\rightarrow+\infty$ under the assumption that $\phi:\mathbb{R}^{n}\to\mathbb{R}$ is convex and differentiable with $\rho^{-1}$ -Lipschitz continuous gradient for $\rho>0.$ In these circumstances the dynamical system (4) reads

[TABLE]

where $x_{0},v_{0}\in\mathbb{R}^{n}$ and $\lambda>0$ .

Remark 11

We notice that we do no require second order assumptions for $\phi$ . However, we want to notice that if $\phi$ is a twice continuously differentiable function, then the dynamical system (15) can be equivalently written as

[TABLE]

where $x_{0},v_{0}\in\mathbb{R}^{n}$ and $\lambda>0$ . This is a differential equation with a Hessian-driven damping term. We refer the reader to [3] and [9] for more insights into dynamical systems with Hessian-driven damping terms and for motivations for considering them. Moreover, as in [9], the driving forces have been split as $\nabla\phi+\nabla\psi$ , where $\nabla\psi$ stands for classical smooth driving forces and $\nabla\phi$ incorporates the contact forces.

In this context, an improved version of Lemma 5(i) can be stated.

Lemma 12

Let $x_{0},v_{0}\in\mathbb{R}^{n}$ and $\lambda>0$ be such that $v_{0}=\nabla\phi(x_{0})$ . Let $(x,v):[0,+\infty)\rightarrow\mathbb{R}^{n}\times\mathbb{R}^{n}$ be the unique strong global solution of the dynamical system (15). Then:

[TABLE]

Proof.

Take an arbitrary $\delta>0$ . For $t\geq 0$ we have

[TABLE]

where the inequality follows from the Baillon-Haddad Theorem [11, Corollary 18.16]. The conclusion follows by dividing (18) by $\delta^{2}$ and by taking the limit as $\delta$ converges to zero from above. $\blacksquare$

We are now in the position to prove the convergence of the trajectories generated by (15).

Theorem 13

Let $x_{0},v_{0}\in\mathbb{R}^{n}$ and $\lambda>0$ be such that $v_{0}=\nabla\phi(x_{0})$ . Let $(x,v):[0,+\infty)\rightarrow\mathbb{R}^{n}\times\mathbb{R}^{n}$ be the unique strong global solution of the dynamical system (15). Suppose that $\phi+\psi$ is a KL function which is bounded from below and $x$ is bounded. Then the following statements are true:

(i)

$\dot{x},\dot{v},\nabla\phi(x)+\nabla\psi(x)\in L^{2}([0,+\infty);\mathbb{R}^{n})$ *, $\langle\dot{x}(\cdot),\dot{v}(\cdot)\rangle\in L^{1}([0,+\infty);\mathbb{R})$ and *

$\lim_{t\rightarrow+\infty}\dot{x}(t)=\lim_{t\rightarrow+\infty}\dot{v}(t)=\lim_{t\rightarrow+\infty}\big{(}\nabla\phi(x(t))+\nabla\psi(x(t))\big{)}=0$ ;

(ii)

there exists $\overline{x}\in\operatorname*{crit}(\phi+\psi)$ (that is $\nabla(\phi+\psi)(\overline{x})=0$ ) such that $\lim_{t\rightarrow+\infty}x(t)=\overline{x}$ .

Proof.

According to Lemma 8, we can choose an element $\overline{x}\in\operatorname*{crit}(\phi+\psi)$ (that is $\nabla(\phi+\psi)(\overline{x})=0$ ) such that $\overline{x}\in\omega(x)$ . According to Lemma 6(iii), the proof of Lemma 7 and the proof of Lemma 8(iv), we have

[TABLE]

We consider the following two cases.

I. There exists $\overline{t}\geq 0$ such that

[TABLE]

From (7) we obtain for every $t\geq\overline{t}$ that

[TABLE]

Thus $(\phi+\psi)(x(t))=(\phi+\psi)(\overline{x})$ for every $t\geq\overline{t}$ . According to Lemma 6(i) and (17), it follows that $\dot{x}(t)=\dot{v}(t)=0$ for almost every $t\in[\overline{t},+\infty)$ , hence $x$ and $v$ are constant on $[\overline{t},+\infty)$ and the conclusion follows.

II. For every $t\geq 0$ it holds $(\phi+\psi)(x(t))>(\phi+\psi)(\overline{x})$ . Take $\Omega:=\omega(x)$ .

By using Lemma 8(ii), (iv) and the fact that $\phi+\psi$ is a KL function, by Lemma 10, there exist positive numbers $\epsilon$ and $\eta$ and a concave function $\varphi\in\Theta_{\eta}$ such that for all $u$ belonging to the intersection

[TABLE]

one has

[TABLE]

Let $t_{1}\geq 0$ be such that $(\phi+\psi)(x(t))<(\phi+\psi)(\overline{x})+\eta$ for all $t\geq t_{1}$ . Since $\lim_{t\to+\infty}\operatorname*{dist}\big{(}x(t),\Omega\big{)}=0$ (see Lemma 8(iii)), there exists $t_{2}\geq 0$ such that for all $t\geq t_{2}$ the inequality $\operatorname*{dist}\big{(}x(t),\Omega\big{)}<\epsilon$ holds. Hence for all $t\geq T:=\max\{t_{1},t_{2}\}$ , $x(t)$ belongs to the intersection in (19). Thus, according to (20), for every $t\geq T$ we have

[TABLE]

From the second equation in (15) we obtain for almost every $t\in[T,+\infty)$

[TABLE]

By using Lemma 6(i), that $\varphi^{\prime}>0$ and

[TABLE]

we further deduce that for almost every $t\in[T,+\infty)$ it holds

[TABLE]

We invoke now Lemma 17 and obtain

[TABLE]

Let $\alpha>0$ (not depending on $t$ ) be such that

[TABLE]

One can for instance chose $\alpha>0$ such that $2\alpha\max(\lambda,1)\leq\min(\lambda,\rho)$ . From (24) we derive the inequality

[TABLE]

which holds for almost every $t\geq T$ . Since $\varphi$ is bounded from below, by integration it follows $\dot{x},\dot{v}\in L^{1}([0,+\infty);\mathbb{R}^{n})$ . From here we obtain that $\lim_{t\rightarrow+\infty}x(t)$ exists and the conclusion follows from the results obtained in the previous section. $\blacksquare$

Remark 14

Taking a closer look at the above proof, one can notice that the inequality (23) can be obtained also when $\phi:\mathbb{R}^{n}\to\mathbb{R}\cup\{+\infty\}$ is a (possibly nonsmooth) proper, convex and lower semicontinuous function. Though, in order to conclude that $\dot{x}\in L^{1}([0,+\infty);\mathbb{R}^{n})$ the inequality obtained in Lemma 5(i) is not enough. The improved version stated in Lemma 12 is crucial in the convergence analysis.

If one attempts to obtain in the nonsmooth setting the inequality stated in Lemma 12, from the proof of Lemma 12 it becomes clear that one would need the inequality

[TABLE]

for all $(x_{1},x_{2})\in\mathbb{R}^{n}\times\mathbb{R}^{n}$ and all $(\xi_{1}^{*},\xi^{*}_{2})\in\mathbb{R}^{n}\times\mathbb{R}^{n}$ such that $\xi_{1}^{*}\in\partial\phi(x_{1})$ and $\xi_{2}^{*}\in\partial\phi(x_{2})$ . This is nothing else than (see for example [11])

[TABLE]

for all $(x_{1},x_{2})\in\mathbb{R}^{n}\times\mathbb{R}^{n}$ and all $(\xi_{1}^{*},\xi^{*}_{2})\in\mathbb{R}^{n}\times\mathbb{R}^{n}$ such that $x_{1}\in\partial\phi^{*}(\xi_{1}^{*})$ and $x_{2}\in\partial\phi^{*}(\xi_{2}^{*})$ . Here $\phi^{*}:\mathbb{R}^{n}\to\operatorname*{\overline{\mathbb{R}}}$ denotes the Fenchel conjugate of $\phi$ , defined for all $x^{*}\in\mathbb{R}^{n}$ by $\phi^{*}(x^{*})=\sup_{x\in\mathbb{R}^{n}}\{\langle x^{*},x\rangle-\phi(x)\}$ . The latter inequality is equivalent to $\partial\phi^{*}$ is $\rho$ -strongly monotone, which is further equivalent (see [35, Theorem 3.5.10] or [11]) to $\phi^{*}$ is is strongly convex. This is the same with asking that $\phi$ is differentiable on the whole $\mathbb{R}^{n}$ with Lipschitz-continuous gradient (see [11, Theorem 18.15]). In conclusion, the smooth setting provides the necessary prerequisites for obtaining the result in Lemma 12 and, finally, Theorem 13.

4 Convergence rates

In this subsection we investigate the convergence rates of the trajectories $(x(t),v(t))$ generated by the dynamical system (15) as $t\rightarrow+\infty$ . When solving optimization problems involving KL functions, convergence rates have been proved to depend on the so-called Łojasiewicz exponent (see [30, 12, 5, 24]). The main result of this subsection refers to the KL functions which satisfy Definition 3 for $\varphi(s)=Cs^{1-\theta}$ , where $C>0$ and $\theta\in(0,1)$ . We recall the following definition considered in [5].

Definition 4

Let $f:\mathbb{R}^{n}\rightarrow\mathbb{R}\cup\{+\infty\}$ be a proper and lower semicontinuous function. The function $f$ is said to have the Łojasiewicz property, if for every $\overline{x}\in\operatorname*{crit}f$ there exist $C,\varepsilon>0$ and $\theta\in(0,1)$ such that

[TABLE]

According to [6, Lemma 2.1 and Remark 3.2(b)], the KL property is automatically satisfied at any noncritical point, fact which motivates the restriction to critical points in the above definition. The real number $\theta$ in the above definition is called Łojasiewicz exponent of the function $f$ at the critical point $\overline{x}$ .

The convergence rates obtained in the following theorem are in the spirit of [12] and [5].

Theorem 15

Let $x_{0},v_{0}\in\mathbb{R}^{n}$ and $\lambda>0$ be such that $v_{0}=\nabla\phi(x_{0})$ . Let $(x,v):[0,+\infty)\rightarrow\mathbb{R}^{n}\times\mathbb{R}^{n}$ be the unique strong global solution of the dynamical system (15). Suppose that $x$ is bounded and $\phi+\psi$ is a function which is bounded from below and satisfies Definition 3 for $\varphi(s)=Cs^{1-\theta}$ , where $C>0$ and $\theta\in(0,1)$ . Then there exists $\overline{x}\in\operatorname*{crit}(\phi+\psi)$ (that is $\nabla(\phi+\psi)(\overline{x})=0$ ) such that $\lim_{t\rightarrow+\infty}x(t)=\overline{x}$ and $\lim_{t\rightarrow+\infty}v(t)=\nabla\phi(\overline{x})=-\nabla\psi(\overline{x})$ . Let $\theta$ be the Łojasiewicz exponent of $\phi+\psi$ at $\overline{x}$ , according to the Definition 4. Then there exist $a_{1},b_{1},a_{2},b_{2}>0$ and $t_{0}\geq 0$ such that for every $t\geq t_{0}$ the following statements are true:

(i)

if $\theta\in(0,\frac{1}{2})$ , then $x$ and $v$ converge in finite time;

(ii)

if $\theta=\frac{1}{2}$ , then $\|x(t)-\overline{x}\|+\|v(t)-\nabla\phi(\overline{x})\|\leq a_{1}\exp(-b_{1}t)$ ;

(iii)

if $\theta\in(\frac{1}{2},1)$ , then $\|x(t)-\overline{x}\|+\|v(t)-\nabla\phi(\overline{x})\|\leq(a_{2}t+b_{2})^{-\left(\frac{1-\theta}{2\theta-1}\right)}$ .

Proof.

According to the proof of Theorem 13, $\dot{x},\dot{v}\in L^{1}([0,+\infty);\mathbb{R}^{n})$ and there exists $\overline{x}\in\operatorname*{crit}(\phi+\psi)$ , in other words $\nabla(\phi+\psi)(\overline{x})=0$ , such that $\lim_{t\rightarrow+\infty}x(t)=\overline{x}$ and $\lim_{t\rightarrow+\infty}v(t)=\nabla\phi(\overline{x})=-\nabla\psi(\overline{x})$ . Let $\theta$ be the Łojasiewicz exponent of $\phi+\psi$ at $\overline{x}$ , according to the Definition 4.

We define $\sigma:[0,+\infty)\rightarrow[0,+\infty)$ by (see also [12])

[TABLE]

It is immediate that

[TABLE]

Indeed, this follows by noticing that for $T\geq t$

[TABLE]

and by letting afterwards $T\rightarrow+\infty$ .

Similarly, we have

[TABLE]

From (28) and (29) we derive

[TABLE]

We assume that for every $t\geq 0$ we have $(\phi+\psi)(x(t))>(\phi+\psi)(\overline{x}).$ As seen in the proof of Theorem 13 otherwise the conclusion follows automatically. Furthermore, by invoking again the proof of Theorem 13 , there exist $\varepsilon>0$ , $t_{0}\geq 0$ and $\alpha>0$ such that for almost every $t\geq t_{0}$ (see (26))

[TABLE]

and

[TABLE]

We derive by integration for $T\geq t\geq t_{0}$

[TABLE]

[TABLE]

hence

[TABLE]

Since $\theta$ is the Łojasiewicz exponent of $\phi+\psi$ at $\overline{x}$ , we have

[TABLE]

for every $t\geq t_{0}$ . From the second relation in (15) we derive for almost every $t\in[t_{0},+\infty)$

[TABLE]

which combined with (32) yields

[TABLE]

Since

[TABLE]

we conclude that there exists $\alpha^{\prime}>0$ such that for almost every $t\in[t_{0},+\infty)$

[TABLE]

If $\theta=\frac{1}{2}$ , then

[TABLE]

for almost every $t\in[t_{0},+\infty)$ . By multiplying with $\exp(\alpha^{\prime}t)$ and integrating afterwards from $t_{0}$ to $t$ , it follows that there exist $a_{1},b_{1}>0$ such that

[TABLE]

and the conclusion of (b) is immediate from (30).

Assume that $0<\theta<\frac{1}{2}$ . We obtain from (35)

[TABLE]

for almost every $t\in[t_{0},+\infty)$ .

By integration we obtain

[TABLE]

where $\overline{\alpha}>0$ . Thus there exists $T\geq 0$ such that

[TABLE]

which implies that $x$ and $y$ are constant on $[T,+\infty)$ .

Finally, suppose that $\frac{1}{2}<\theta<1$ . We obtain from (35)

[TABLE]

for almost every $t\in[t_{0},+\infty)$ . By integration we derive

[TABLE]

where $a_{2},b_{2}>0$ . Statement (c) follows from (30). $\blacksquare$

Bibliography35

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] B. Abbas, An asymptotic viscosity selection result for the regularized Newton dynamic , ar Xiv:1504.07793 v 1, 2015
2[2] B. Abbas, H. Attouch, B.F. Svaiter, Newton-like dynamics and forward-backward methods for structured monotone inclusions in Hilbert spaces , Journal of Optimization Theory and its Applications 161(2), 331–360, 2014
3[3] F. Alvarez, H. Attouch, J. Bolte, P. Redont, A second-order gradient-like dissipative dynamical system with Hessian-driven damping. Application to optimization and mechanics , Journal de Mathématiques Pures et Appliquées (9) 81(8), 747–779, 2002
4[4] H. Attouch, G. Buttazzo, G. Michaille, Variational Analysis in Sobolev and BV Spaces: Applications to PD Es and Optimization, Second Edition , MOS-SIAM Series on Optimization, Philadelphia, 2014
5[5] H. Attouch, J. Bolte, On the convergence of the proximal algorithm for nonsmooth functions involving analytic features , Mathematical Programming 116(1-2) Series B, 5–16, 2009
6[6] H. Attouch, J. Bolte, P. Redont, A. Soubeyran, Proximal alternating minimization and projection methods for nonconvex problems: an approach based on the Kurdyka-Łojasiewicz inequality , Mathematics of Operations Research 35(2), 438–457, 2010
7[7] H. Attouch, J. Bolte, B.F. Svaiter, Convergence of descent methods for semi-algebraic and tame problems: proximal algorithms, forward-backward splitting, and regularized Gauss-Seidel methods , Mathematical Programming 137(1-2) Series A, 91–129, 2013
8[8] H. Attouch, M.-O. Czarnecki, Asymptotic behavior of coupled dynamical systems with multiscale aspects , Journal of Differential Equations 248(6), 1315–1344, 2010