This paper establishes conditions under which the Lojasiewicz-Simon gradient inequality applies to submanifolds in Banach spaces, aiding the analysis of long-term behavior of constrained quasilinear parabolic equations.
Contribution
It offers new sufficient conditions for the Lojasiewicz-Simon inequality on submanifolds, including discussions on the optimality of these assumptions.
Findings
01
Provides a framework for applying the inequality to constrained PDEs
02
Identifies optimality conditions for the assumptions
03
Facilitates analysis of asymptotic behavior in quasilinear parabolic equations
Abstract
We provide sufficient conditions for the Lojasiewicz-Simon gradient inequality to hold on a submanifold of a Banach space and discuss the optimality of our assumptions. Our result provides a tool to study asymptotic properties of quasilinear parabolic equations with (nonlinear) constraints.
\displaystyle\mathcal{T}_{u}\mathcal{M}\vcentcolon=\left\{\gamma^{\prime}(0)\mid\exists\varepsilon>0,\gamma\in\mathcal{C}^{1}\big{(}(-\varepsilon,\varepsilon);V\big{)}\text{ with }\gamma(t)\in\mathcal{M}\leavevmode\nobreak\ \forall t\in(-\varepsilon,\varepsilon)\text{ and }\gamma(0)=u\right\}.
\displaystyle\mathcal{T}_{u}\mathcal{M}\vcentcolon=\left\{\gamma^{\prime}(0)\mid\exists\varepsilon>0,\gamma\in\mathcal{C}^{1}\big{(}(-\varepsilon,\varepsilon);V\big{)}\text{ with }\gamma(t)\in\mathcal{M}\leavevmode\nobreak\ \forall t\in(-\varepsilon,\varepsilon)\text{ and }\gamma(0)=u\right\}.
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Full text
\newaliascnt
lemprop
\aliascntresetthelem
\newaliascntdefnprop
\aliascntresetthedefn
\newaliascntcorprop
\aliascntresetthecor
\newaliascntremprop
\aliascntresettherem
\newaliascntthmprop
\aliascntresetthethm
\newaliascnthypprop
\aliascntresetthehyp
\newaliascntdefnlemprop
\aliascntresetthedefnlem
\newaliascntexampleprop
\aliascntresettheexample
On the Łojasiewicz–Simon gradient inequality on submanifolds
Fabian Rupp
Institute of Analysis, Ulm University, Helmholtzstraße 18, 89081 Ulm, Germany. [email protected].
Abstract
Abstract: We provide sufficient conditions for the Łojasiewicz-Simon gradient inequality to hold on a submanifold of a Banach space and discuss the optimality of our assumptions. Our result provides a tool to study asymptotic properties of quasilinear parabolic equations with (nonlinear) constraints.
Keywords: Łojasiewicz–Simon gradient inequality, constrained gradient flows, mean curvature flow with isoperimetric constraint, constrained Allen–Cahn equation.
In real algebraic geometry, the Łojasiewicz inequality is a remarkable result describing the particular behavior of an analytic function near a critical point.
Let U⊂Rn be open. If E∈Cω(U;R) and uˉ∈U satisfies ∇E(uˉ)=0, then there exist C,σ>0 and θ∈(0,21] such that for all ∥u−uˉ∥≤σ, we have
[TABLE]
Throughout this article, we write Cω(U;X) for the set of real analytic functions from an open set U of a Banach space V into another Banach space X. All vector spaces are understood to be over the field of real numbers R. The space of bounded linear operators between two normed spaces X and Y is denoted by L(X,Y) and we write X∗:=L(X,R) for the continuous dual of X.
In Rn, inequality (1) was discovered and proven by S. Łojasiewicz in his famous works on semianalytic and subanalytic sets, [27, 28]. Since then, Section 1 has been used as a celebrated tool to prove convergence results for the gradient flow of analytic energies on finite-dimensional spaces (see [29]). The pioneering work of L. Simon in [33] extended inequality (1) to certain energy functions on infinite-dimensional function spaces using Lyapunov–Schmidt reduction and, in honor of his significant contributions, the inequality is nowadays often called Łojasiewicz–Simon gradient inequality. In more recent work by Kurdyka [22], Łojasiewicz’s convergence result has been extended to a larger class of functions via the Kurdyka–Łojasiewicz inequality.
Over the last decades, gradient inequalities like (1) have been extensively studied in various situations to analyze the long time behavior of gradient flows, see for instance [12, 13, 15, 18, 32].
In [20, 21], this is also done for second order evolution equations.
Loosely speaking, whenever an energy E satisfies a Łojasiewicz–Simon gradient inequality at a critical point uˉ=limn→∞u(tn), where tn→∞ and u=u(t) is a precompact solution to the associated gradient flows
[TABLE]
we may conclude that u converges with limt→∞u(t)=uˉ. Numerical applications of this phenomenon have been considered for instance in [2, 6].
Hence, it is a question of great interest, whether a given energy function satisfies a Łojasiewicz–Simon gradient inequality. It can be shown that in the infinite-dimensional case, mere analyticity of the energy is not enough, see for instance [19, Theorem 2.1, Proposition 3.5]. On the other hand, very general conditions which are sufficient for the gradient inequality to hold are presented in [10].
For most of the applications, one usually checks that the following conditions are satisfied, see [12, 13, 14, 25].
Theorem \thethm(Consequence of [10, Corollary 3.11]).
Let V be a Banach space, U⊂V an open set, E∈Cω(U;R) and uˉ∈U a critical point of E. Suppose that
(i)
there exists a Banach space Z such that V↪Z densely,
2. (ii)
E′∈Cω(U;Z∗),
3. (iii)
the second derivative E′′(uˉ):V→Z∗ is Fredholm of index zero.
Then, there exist C,σ>0, θ∈(0,21] such that for all u∈U with ∥u−uˉ∥V≤σ, we have
[TABLE]
Remark \therem.
Note that by assumption (i) in Section 1 we have V↪Z, so Z∗ can be identified with a subset of V∗. Condition (ii) requires that for all u∈U the functional E′(u) which is in general only in V∗ is in fact in Z∗ and the map E′:U→Z∗ is analytic.
Although Section 1 describes a slightly less general situation than in [10], in most applications its conditions are relatively easy to check and suffice to prove the Łojasiewicz–Simon gradient inequality. The details on how to deduce Section 1 from [10] are given in Appendix A.
To prove a suitable version of Section 1 on a finite-dimensional manifold M is quite straightforward if M and E are analytic, by simply choosing local coordinates and applying Section 1. In [23], this is used to study gradient-like dynamical systems via the Kurdyka–Łojasiewicz inequality. The infinite-dimensional setting is more complicated.
Our main result is to extend Section 1 to a constrained energy function E∣M on a submanifold M of a Banach space V, and to refine the estimate by projecting the derivative onto the cotangent space of M. In [25], a special case has been studied and a Łojasiewicz–Simon gradient inequality is proven for the Canham–Helfrich energy on the submanifold of closed embedded surfaces with fixed area and volume, see [25, Theorem 1.4]. In the following theorem, we give very general sufficient conditions for the Łojasiewciz–Simon gradient inequality to hold on an infinite-dimensional submanifold in the abstract setting of an energy on a Banach space. In Section 5, we will consider the easier case where the ambient space is a Hilbert space. However, as we shall explain in detail in Section 1 below, in order to avoid issues with analyticity, it is sometimes necessary to work in Banach spaces, cf. also Section 7.1. Our main result is the following
Theorem \thethm.
Let V be a Banach space, U⊂V an open set, m∈N and E:U→R, G:U→Rm be analytic. Let uˉ∈U and suppose that
(i)
there exists a Banach space Y such that V↪Y densely,
2. (ii)
E′∈Cω(U;Y∗),
3. (iii)
the second derivative E′′(uˉ):V→Y∗ is Fredholm of index zero,
4. (iv)
for any u∈U, the linear operator G′(u)∈L(V,Rm) extends to G′(u)∈L(Y,Rm) and the map G′:U→L(Y,Rm), u↦G′(u) is analytic,
5. (v)
the Fréchet derivative \big{(}\overline{\mathcal{G}^{\prime}}\big{)}^{\prime}(\bar{u})\colon V\to\mathcal{L}(Y,\mathbb{R}^{m}) is compact,
6. (vi)
G(uˉ)=0* and G′(uˉ):V→Rm is surjective.*
Then, M:={u∈U∣G(u)=0} is locally an analytic submanifold of V of codimension m near uˉ.
If uˉ is a critical point of E∣M, then the restriction satisfies a refined Łojasiewicz–Simon gradient inequality at uˉ, i.e. there exist C,σ>0 and θ∈(0,21] such that for any u∈M with ∥u−uˉ∥V≤σ, we have
[TABLE]
Here, TuM∗ is the dual of the closure TuM:=TuM∥⋅∥Y⊂Y of the tangent space TuM.
Remark \therem.
The notation G′(u) is justified, since the operator G′(u):Y→Rm is the closure of A=G′(u) on the Banach space Y with D(A)=V.
Remark \therem.
(i)
Note that we could apply Section 1 in the situation of Section 1 as well, but (3) yields a sharper estimate: If Z=Y with Y as in Section 1, then for u,uˉ∈M with ∥u−uˉ∥V≤σ, we have
[TABLE]
Thus, if the assumptions of Section 1 are satisfied and C,σ,θ are as in Section 1, we have ∣E(u)−E(uˉ)∣1−θ≤C∥E′(u)∥Y∗, i.e. (3) implies (2) under the assumptions of Section 1. It hence makes sense to refer to (3) as a refined Łojasiewicz–Simon gradient inequality.
2. (ii)
From our proof, we cannot conclude that the Łojasiewicz exponentsθ in Section 1 and Section 1 coincide.
Remark \therem.
The Hilbert space case treated in Section 5 is much easier to handle than Section 1. It is also more natural since one usually studies H-gradient flows with H=Wk,2(Ω), Ω⊂Rd open, k∈Z. On the other hand, one may sometimes encounter a problem in proving analyticity of the energy. The problematic phenomenon is, that whenever a Nemytskii or supercomposition operator
[TABLE]
is analytic, the function f has to be a polynomial of degree at most ⌈qp⌉, see [5, Theorem 3.16]. A way to work around this, is to choose suitable Sobolev spaces, such that all derivatives in the energy either appear in polynomial expressions with appropriate powers or are continuous. This is exactly why we work in the Banach space W2,p(Ω) with p>d to prove the Łojasiewicz–Simon gradient inequality in Section 7.1.
This article is structured as follows. First, we recall some basic definitions and fundamental properties of analytic functions and Fredholm operators. Then we present the generalizations of basic concepts of differential geometry to submanifolds of a Banach space.
In Section 3, we establish a local graph representation for the manifold M in Section 1. It turns out that studying this chart plays a crucial role in the proof of Section 1 which we complete in Section 4. After that, we consider the Hilbert space case in Section 5 in which the inequality takes a more convenient form. We also prove an abstract convergence result for the associated gradient flow in this case. Section 6 is dedicated to discuss the necessity of the assumptions we make in Section 1. In the last section, we will then apply our abstract results to the area of graph surfaces with an isoperimetric constraint in Section 7.1, the Allen–Cahn equation in Section 7.2 and to surfaces of revolution with prescribed volume in Section 7.3.
2 Preliminaries
2.1 Analyticity
Definition \thedefn.
Let V,W be (real) Banach spaces, D⊂V be an open set. A function f:D→W is called (real) analytic at u0∈D if there exist ρ>0 and continuous R-multilinear forms an:Vn:=n−timesV×⋯×V→W for all n∈N0 such that
[TABLE]
for all ∥u−u0∥V<ρ, where an(u−u0)n:=an(u−u0,…,u−u0)∈W. The function f is (real) analytic (on D) if it is analytic at every point u0∈D.
We denote by Cω(D;W) the vector space of analytic functions from D to W.
Like in the finite-dimensional case, a composition of two analytic maps is analytic.
Let V,W,X be Banach spaces, D⊂V and E⊂W be open and f:D→W, g:E→X be analytic with f(D)⊂E. Then g∘f:D→X is analytic.
Easy examples of analytic maps are bounded multilinear maps.
Example \theexample.
Let ℓ∈N and V1,…,Vℓ,W be Banach spaces. If a:V1×⋯×Vℓ→W is multilinear and continuous, then it is analytic. This follows easily since the series in (4) consists of exactly one nonzero term and hence converges.
2.2 Fredholm operators
Definition \thedefn.
Let V,W be Banach spaces. An operator T∈L(V,W) is called a Fredholm operator if both dimkerT and codim(ImT,W)=dim(W/ImT) are finite. The number indT:=dimkerT−codim(ImT,W)
is called the Fredholm index of T.
In the following, we collect some important properties of Fredholm operators.
Proposition 2.1** ([24, XVII, Corollaries 2.6 and 2.7]).**
Let T∈L(V,W) be a Fredholm operator. Then
(i)
the image ImT⊂W is closed,
2. (ii)
for any compact operator K:V→W, the perturbed operator T+K is Fredholm with ind(T+K)=indT. This holds in particular if K has finite rank.
Let V,W and X be Banach spaces and let T∈L(V,W) and S∈L(W,X) be Fredholm operators. Then S∘T∈L(V,X) is a Fredholm operator and its index is given by ind(S∘T)=indS+indT.
2.3 Complemented subspaces
Projection operators and complemented subspaces play a crucial role in the proof of the Łojasiewicz–Simon gradient inequality in [10] and they will also be important for our result, specifically when investigating the properties of the submanifold M in Section 3.
Definition and Lemma \thedefnlem.
A closed subspace V0 of a Banach space V is called complemented in V if there exists a projection P∈L(V) with ImP=V0. Equivalently, there exists a closed subspace V1 of V with V=V0⊕V1, see [8, Section 2.4].
Whereas in a Hilbert space, every closed subspace is complemented via the orthogonal projection (cf. [8, Chapter 5.1]), this is not true for a general Banach space. In fact, if in a Banach space V, every closed subspace is complemented, then it has to be isomorphic to a Hilbert space, see [26]. Nevertheless, some subspaces are always complemented.
Let V be a Banach space and V0⊂V be a closed subspace, such that dimV0<∞ or
codim(V0,V)<∞. Then V0 is complemented in V.
2.4 Submanifolds of Banach spaces
This section is devoted to review some basic definitions in differential geometry in the setting of infinite-dimensional manifolds. Since we are only interested in the case of a submanifold of a Banach space V, the following definition based on [1, Definition 3.2.1] is sufficient for our purposes.
Definition \thedefn.
Let V be a Banach space. A subset M⊂V is called a (splitting) submanifold of V (of class Cℓ) if for all u∈M, there exists an open neighborhood U⊂V of u, a complemented subspace V0⊂V and a map α∈Cℓ(U;V) which is a diffeomorphism onto its image, such that α(U∩M)=α(U)∩V0.
If α∈Cω(U;V), we say that M is analytic.
Example \theexample.
If V is a Banach space, V0⊂V is a complemented subspace, with V=V0⊕V1, Ω0⊂V0 is an open set and ψ∈Cℓ(Ω0;V) with ψ(Ω0)⊂V1, then M:={ω+ψ(ω)∣ω∈Ω0} is a submanifold of V of class Cℓ.
Indeed, let Ω:=Ω0+V1 and write Ω∋v=ω+v1 with ω∈Ω0 and v1∈V1 and define α:Ω→V,α(ω+v1)=ω+(v1−ψ(ω))∈V0⊕V1. Then α is of class Cℓ and
[TABLE]
so α′(ω):V→V is an isomorphism. Since α is clearly bijective onto its image, we conclude that α is a Cℓ-diffeomorphism by the Inverse Function Theorem [24, XIV, Theorem 1.2]. Consequently,
[TABLE]
thus M is a submanifold in the sense of Section 2.4.
Definition \thedefn.
Let M⊂V be a submanifold of class Cℓ with ℓ≥1. The tangent space TuM of M at u∈M is defined by
[TABLE]
Like in the finite-dimensional case, TuM⊂V is a subspace. We define the codimension of M in V to be the codimension codim(TuM,V) of TuM in V.
The dual of the tangent space is called cotangent space and denoted by Tu∗M:=(TuM)∗.
Definition and Lemma \thedefnlem.
Let V be a Banach space, U⊂V be an open set, ∅=M⊂U and E∈C1(U;R). We say that uˉ is a constraint critical point of E on M or a critical point of E∣M, if for any curve \gamma\in\mathcal{C}^{1}\big{(}(-\varepsilon,\varepsilon);V\big{)} with γ(0)=uˉ and γ(t)∈M for all t∈(−ε,ε), the map t↦(E∘γ)(t) has a critical point at t=0.
If M=M⊂V is a submanifold, then uˉ∈M is a constraint critical point if and only if
[TABLE]
Proof.
This follows since for each curve γ∈C1((−ε,ε);V) with γ(0)=uˉ and γ(t)∈M for all t∈(−ε,ε), we have 0=dtdt=0(E∘γ)(t)=E′(uˉ)γ′(0).
∎
3 Local representation by a graph
In this section, we will lay the foundations for the proof of our main theorem. We will see that the level set manifold M in Section 1 admits a natural chart around uˉ representing M locally as a graph. After that, we will carefully analyze the properties of this induced chart.
For the rest of the article, we assume that V and Y are Banach spaces with V↪Y densely, thus we get an induced embedding Y∗↪V∗. Furthermore, we assume that U⊂V is an open set, m∈N and G:U→Rm is analytic. We study the nodal set of G given by M:={u∈U∣G(u)=0}.
Theorem \thethm.
Let uˉ∈M such that G′(uˉ):V→Rm is surjective. Then V=V0⊕V1 with V0=kerG′(uˉ) for a closed subspace V1⊂V. Moreover, there exist open sets Ω0⊂V0,Ω1⊂V1 with uˉ∈Ω=Ω0×Ω1⊂U and an analytic function ψ:Ω0→V with ψ(Ω0)=Ω1 such that
[TABLE]
Hence, locally around uˉ, M is an analytic submanifold of V.
Moreover, with φ:Ω0→V,φ(ω):=ω+ψ(ω) we have for any ω∈Ω0,v∈V0
[TABLE]
where ∂v1∂G(u):=G′(u)∣V1:V1→Rm for u∈U.
Proof.
Since Rm=ImG′(uˉ)≅V/V0, V0 has finite codimension in V. Moreover, since V0 is closed by continuity, it is complemented by Section 2.3, i.e. there exists V1⊂V closed with V=V0⊕V1. As a consequence thereof, ∂v1∂G(uˉ):V1→Rm is an isomorphism of Banach spaces. Thus, by the Implicit Function Theorem [35, Theorem 4.B], there exist open neighborhoods Ω0⊂V0,Ω1⊂V1 with Ω:=Ω0×Ω1⊂U and u∈Ω such that for any ω∈Ω0, there exists exactly one ψ(ω)∈Ω1 with G(ω+ψ(ω))=0. Analyticity of ψ follows since G is analytic.
By Section 2.4, we may conclude that M∩Ω is an analytic submanifold of V.
Moreover, the subset of invertible operators in L(V1,Rm) is open in the norm topology and the map Ω∋u↦G′(u)∣V1∈L(V1,Rm) is continuous. Hence, by continuity, we can assume that ∂v1∂G(u):V1→Rm is an isomorphism for all u∈Ω, passing to a smaller Ω if necessary.
Therefore, (6) and thus (5) follow by differentiating the equation 0=G(ω+ψ(ω))=G(φ(ω)) for ω∈Ω0.
∎
Remark \therem.
(i)
The relation M∩Ω={ω+ψ(ω)∣ω∈Ω0} implies that the map φ:Ω0→Ω∩M,φ(ω)=ω+ψ(ω) defines a chart for M∩Ω centered at uˉ∈M. Resembling the finite-dimensional case, we can identify ω+ψ(ω)=(ω,ψ(ω)) which means that M is locally the graph of ψ near uˉ (cf. Section 2.4).
2. (ii)
Since we only work locally, we will abuse notation and speak about the manifold M instead of M∩Ω and write TuM for the tangent space Tu(M∩Ω) at u.
The assumptions on G in Section 1 have some immediate consequences for the tangent space of M.
Proposition 3.1**.**
Suppose G:U→Rm and uˉ∈M satisfy assumptions (i), (iv) and (vi) in Section 1. Then, using the notation of Section 3, for ω∈Ω0,φ(ω)=u we have
(i)
TuM=kerG′(u)=Imφ′(ω),
2. (ii)
TuM:=TuM∥⋅∥Y=kerG′(u),
3. (iii)
codim(TuM,V)=codim(TuM,Y)=m* for all u∈Ω.*
Proof.
(i)
We first prove the inclusion TuM⊂kerG′(u). Let v∈TuM. Then, there exist ε>0 and γ∈C1((−ε,ε),V) with Imγ⊂M∩Ω,γ(0)=u and γ′(0)=v. Now, G′(u)v=dtdt=0G(γ(t))=0, since M={u∈U∣G(u)=0}.
For kerG′(u)⊂Imφ′(ω), let v∈V with G′(u)v=0 and write v=v0+v1∈V0⊕V1. Then
0=∂v0∂G(u)v0+∂v1∂G(u)v1. With φ,ψ as in Section 3 and writing u=φ(ω) we conclude using (5)
To prove Imφ′(ω)⊂TuM, let ω∈Ω0 with φ(ω)=u and let y∈Imφ′(ω). Then there exists v∈V0 with y=φ′(ω)v, hence y=φ′(ω)v=dtdt=0φ(ω+tv)∈TuM since γ(t):=φ(ω+tv) defines a curve in M with γ(0)=u and γ′(0)=φ′(ω)v.
2. (ii)
First, let y∈TuM and vn∈TuM with vn→y in Y. Using the extension property (iv) in Section 1, we get G′(u)y=limn→∞G′(u)vn=0 by (i).
Conversely, let y∈Y such that G′(u)y=0. Since ∂v1∂G(u):V1→Rm is an isomorphism by the proof of Section 3, we conclude that G′(u):V→Rm is surjective. As a consequence, Rm≅V/kerG′(u), so codimkerG′(u)=m is finite. By Section 2.3, there exists a closed subspace W=W(u) of V with V=kerG′(u)⊕W. By density of V in Y, there exists a sequence (vn)⊂V with vn→y in Y. Thus, we may write vn=vn0+wn with vn0∈kerG′(u) and wn∈W. As a consequence, G′(u)vn=G′(u)wn→G′(u)y=0 in Rm. Since G′(u)∣W:W→Rm is an isomorphism, we get wn→0 in W⊂V, hence in Y. Thus, y=limn→∞vn=limn→∞vn0 with vn0∈kerG′(u)=TuM by (i).
3. (iii)
First, since ∂v1∂G(u):V1→Rm is an isomorphism by the proof of Section 3, the operator G′(u):V→Rm is surjective. Thus Rm≅V/kerG′(u)=V/TuM by (i), so codim(TuM,V)=m. Moreover, since G′(u):V→Rm is surjective, so is the extension G′(u):Y→Rm and hence Rm≅Y/kerG′(u)=Y/TuM by (ii), so codim(TuM,Y)=m.∎
Remark \therem.
In particular, 3.1 and Section 2.3 imply that there exists a projection P(uˉ):Y→Y onto TuˉM=kerG′(u)=:V0.
As a next step, we investigate the properties of the chart φ defined in Section 3 under the assumptions on G in Section 1.
Proposition 3.2**.**
Suppose G satisfies assumptions (iv) and (vi) in Section 1. Using the notation of Section 3, we have
(i)
for all ω∈Ω0, the operator ψ′(ω):V0→V defined in (5) extends to an operator ψ′(ω)∈L(Y) such that supω∈Ω0∥ψ′(ω)∥L(Y)<∞ and Imψ′(ω)⊂V1 is finite-dimensional, replacing Ω0 with a smaller neighborhood if necessary,
2. (ii)
for any ω∈Ω0, the operator φ′(ω) extends to φ′(ω)=IdY+ψ′(ω):Y→Y,
3. (iii)
the map φ′:Ω0→L(Y),ω↦φ′(ω) is analytic,
4. (iv)
φ′(ωˉ)y=y* for all y∈V0, where ωˉ∈Ω0 satisfies φ(ωˉ)=uˉ.*
5. (v)
For all ω∈Ω0 and y∈V0, we have ∥φ′(ω)y∥Y≥21∥y∥Y, passing to a smaller neighborhood Ω0 if necessary.
Proof.
(i)
By assumption (iv) in Section 1, the operator G′(φ(ω)) extends to an operator G′(φ(ω)):Y→Rm and hence ψ′(ω) extends to an operator ψ′(ω):Y→Y via (5). Since the image of \big{(}\frac{\partial\mathcal{G}}{\partial v_{1}}(\varphi(\omega))\big{)}^{-1}\colon\mathbb{R}^{m}\to V_{1} is contained in V1, we conclude that Imψ′(ω)⊂V1⊂Y is finite-dimensional. For the norm estimate note that for any y∈Y, using V↪Y, we have
[TABLE]
for some C,C′>0, passing to a smaller Ω0 if necessary, since φ and G are analytic and so is the extension G′:U→L(Y,Rm) by assumption (iv) in Section 1.
2. (ii)
The map Ω0∋ω↦G′(φ(ω))∈L(Y,Rm) is analytic using assumption (iv) in Section 1 and the analyticity of φ. Therefore, using (6), so is the extension Ω0∋ω↦φ′(ω)∈L(Y) using Section 2.1 and Section 2.1.
4. (iv)
By 3.1 (ii) and (5), ψ′(ω)y=0 for all y∈V0. This yields the claim.
5. (v)
By (iv), ∥φ′(ωˉ)y∥Y=y for all y∈V0. By (iii), passing to a smaller Ω0 if necessary, we can assume ∥φ′(ωˉ)−φ′(ω)∥L(Y)≤21 for all w∈Ω0. Then, for any y∈V0 we can estimate
∥φ′(ω)y∥Y≥∥φ′(ωˉ)y∥Y−∥φ′(ωˉ)y−φ′(ω)y∥Y≥(1−21)∥y∥Y.
∎
4 Proof of the Łojasiewicz–Simon gradient inequality
In this section, we will establish the Łojasiewicz–Simon gradient inequality for the energy E composed with the chart we constructed in Section 3 and use this to prove our main theorem.
Theorem \thethm.
Suppose E, G and uˉ∈U satisfy assumptions (i), (ii), (iv) and (vi) of Section 1. Let φ be the chart centered at uˉ defined in Section 3. Define F:Ω0→R, F(ω):=(E∘φ)(ω). Then
(i)
F* is analytic,*
2. (ii)
for ω∈Ω0, F′(ω)∈V0∗ via F′(ω)=P(uˉ)∗∘φ′(ω)∗E′(φ(ω)), where P(uˉ)∈L(Y) is the projection onto V0=TuˉM from Section 3,
3. (iii)
the map Ω0∋ω↦F′(ω)∈V0∗ is analytic.
Proof.
(i)
This follows from Section 2.1 since E and φ are analytic.
2. (ii)
Let ω∈Ω0,v∈V0 and P(uˉ)∈L(Y) be as in Section 3. We compute
[TABLE]
using that P(uˉ)v=v since v∈V0⊂V0. Since P(uˉ) projects onto V0∗, (ii) follows.
3. (iii)
The chart φ:Ω0→U is analytic and so is E′:U→Y∗ by assumption (ii) in Section 1. Thus, so is their composition ω↦E′(φ(ω)):Ω0→Y∗ by Section 2.1. By 3.2, the extension φ′:Ω0→L(Y) is analytic, and so is taking the adjoint T↦T∗:L(Y)→L(Y∗) by Section 2.1, since it is linear and bounded. Similarly, the evaluation map L(Y∗)×Y∗→Y∗,(T,y∗)↦Ty∗ is analytic. Therefore, Ω0→V0∗,ω↦F′(ω)=P(uˉ)∘φ′(ω)∗∘E′(φ(ω)) is analytic, since the projection P(uˉ):Y→V0∗ is analytic. ∎
The following lemma justifies our approach to study F in order to prove Section 1.
Lemma \thelem.
Suppose E and G are analytic and satisfy assumptions (i), (ii), (iv) and (vi) in Section 1. Let uˉ∈U and let φ be the chart centered at uˉ defined in Section 3. Let ωˉ∈Ω0 such that φ(ωˉ)=uˉ and F=E∘φ as in Section 4. Then the following are equivalent.
(i)
F* satisfies a Łojasiewicz–Simon gradient inequality at ωˉ, i.e. there exist C,σ′>0 and θ∈(0,21] such that*
[TABLE]
2. (ii)
E∣M* satisfies a refined Łojasiewicz–Simon gradient inequality (3) near uˉ.*
Proof.
Suppose (i) holds. Let u∈M with ∥u−uˉ∥V≤σ. For σ>0 small enough, we can assume u∈Ω, u=φ(ω) for a unique ω∈Ω0 and ∥ω−ωˉ∥V≤σ′ by continuity. Then by (i), we have
[TABLE]
Now, by 3.1, we have Tφ(ω)M=Imφ′(ω) and thus φ′(ω)v∈Tφ(ω)M for v∈V0.
By continuity of the extension (see 3.2 (ii)), we get φ′(ω)y∈Tφ(ω)M for y∈V0. Using P(uˉ)y=y for y∈V0 and Section 4 (ii), we compute
[TABLE]
Now, reducing σ,σ′>0 if necessary, and using 3.2 (i), we may assume that supv∈Ω0∥φ′(v)∥L(Y)≤1+supv∈Ω0∥ψ′(v)∥L(Y)≤C′<∞. Hence, using (7) and (4), we conclude ∣E(u)−E(uˉ)∣1−θ≤CC′∥E′(u)∥TuM∗.
Conversely, suppose E∣M satisfies (3) for some C,σ>0 and θ∈(0,21]. Let ω∈Ω0 with ∥ω−ωˉ∥V≤σ′. Define u:=φ(ω) and let σ′>0 be small enough such that ∥u−uˉ∥V≤σ. Then we have
[TABLE]
Now, fix 0=w∈TuM. Then by 3.1, w=φ′(ω)v for some 0=v∈V0. We have
[TABLE]
using P(uˉ)v=v since v∈V0 and Section 4 (ii). Thus, we find
[TABLE]
using 3.2 (v) and reducing σ′>0 if necessary. Since E′(u)∈Y∗ by assumption, we may conclude that (10) remains valid if 0=w∈TuM by continuity. Combining this with (9), the claim follows.
∎
Theorem \thethm.
Suppose E,G and uˉ∈U satisfy the assumptions of Section 1 above and let F be as in Section 4. Then, for ωˉ∈Ω0 with φ(ωˉ)=uˉ, the operator F′′(ωˉ):V0→V0∗ is Fredholm of index zero.
Proof.
Let φ(ωˉ)=uˉ and v∈V0. By 3.2 (iv), we have φ′(ω)v=v. We use the chain rule, the analyticity of φ,ψ,φ′,ψ′ and E′:U→Y∗, and the analyticity and bilinearity of the evaluation map L(Y∗)×Y∗→Y∗ to compute
[TABLE]
using Section 4 (ii), 3.2 and the fact that φ′∗=IdY∗+ψ′∗ by (6).
We will now show that E:V0→V0∗ is compact. First, using (5), we compute
[TABLE]
where R,S∈L(V0,L(Y)). We conclude that
[TABLE]
Recall that for u∈Ω=φ(Ω0), (∂v1∂G(u))−1:Rm→V1↪Y. The first part of (12) is
[TABLE]
so the image of v↦(Rv)∗E′(uˉ) is contained in Im(G′(uˉ)∗⊂Y∗, which is finite-dimensional since G′(uˉ):Y→Rm has finite rank.
Furthermore, with η:=((∂v1∂G(uˉ))−1)∗E′(uˉ)∈Rm
we have
[TABLE]
We will now show that the operator v↦(Sv)∗E′(uˉ):V0→Y∗ is compact. Let vn∈V0 for n∈N with ∥vn∥V≤1. By assumption (v) in Section 1, passing to a subsequence, we can assume (G′)′(uˉ)vn→A in L(Y,Rm). Since taking the adjoint is continuous, this yields ((G′)′(uˉ)vn)∗→A∗ in L(Rm,Y∗). But this clearly implies (G′)′(uˉ)vn)∗η→A∗η in Y∗.
By the previous arguments, together with (12), (13) and (14), we conclude that the linear operator V0→Y∗,v↦(dtdt=0\leavevmodeψ′(ωˉ+tv))∗E′(uˉ) is compact.
Clearly, since ψ′(ωˉ) has finite rank, the image of v↦ψ′(ωˉ)∗E′′(uˉ)v is contained in Im(ψ′(ωˉ))∗ and thus finite-dimensional. As a consequence,
[TABLE]
is a compact operator. We will now show that F′′(ωˉ):V0→V0∗ is Fredholm with indF′′(ω)=0. By (4) and 2.1, it is enough to show that T:V↦V0∗,v↦P(uˉ)∗∘E′′(uˉ)v is Fredholm of index zero.
Note that T=P(uˉ)∗∘E′′(uˉ)∘ι. Here, ι:V0↪V is the inclusion, which
is Fredholm since kerι={0} and codim(Imι,V)=codim(V0,V)=codim(TuˉM,V)=m by 3.1. Moreover, recall from Section 3 that P(uˉ):Y→Y is the projection onto V0=TuˉM with codim(V0,Y)=m by 3.1. Thus Y=V0⊕Z with dimZ=m. Then kerP(uˉ)∗≅Z∗, so dimkerP(uˉ)∗=dimZ∗=m. Clearly, codim(ImP(uˉ)∗,V0∗)=0.
Therefore, by Section 2.2, the composition T=P(uˉ)∗∘E′′(uˉ)∘ι is Fredholm with indT=indP(uˉ)∗+indE′′(uˉ)+indι=m+indE′′(uˉ)−m=0.
∎
Now, it is not difficult to see that F satisfies a Łojasiewicz–Simon gradient inequality at a critical point by Section 1.
Theorem \thethm.
Suppose E,G and uˉ∈M satisfy the assumptions of Section 1. Let φ be the chart constructed in Section 3 with vˉ∈Ω0 such that φ(ωˉ)=uˉ. If F′(ωˉ)=0, then there exist C,σ′>0 and θ∈(0,21] such that
[TABLE]
Proof.
We verify that the assumptions in Section 1 are satisfied for V=V0, U=Ω0Z=V0, φ=uˉ and E=F. Density of V0⊂V0 is trivial.
Assumption (ii) in Section 1 is satisfied by Section 4 (iii). Assumption (iii), i.e. the Fredholm property of F′′(ωˉ):V0→V0∗, holds by Section 4. Hence, F satisfies a Łojasiewicz–Simon gradient inequality in a neighborhood of ωˉ by Section 1.
∎
Suppose E,G and uˉ∈M={u∈M∣G(u)=0} satisfy the assumptions of Section 1. Suppose uˉ is a constraint critical point in the sense of Section 2.4. By Section 3, Section 3 and 3.1, M is locally a manifold near uˉ with codimension m. Let φ:Ω0→Ω∩M be the chart from Section 3 centered at uˉ with φ(ωˉ)=uˉ. Recall from Section 2.4 that E′(uˉ)v=0 for all v∈TuˉM=Imφ′(ωˉ) by 3.1 (i).
Then, for any v∈V0, using Section 4 and P(uˉ)v=v we have
[TABLE]
Hence, by Section 4, there exist C,σ′>0 and θ∈(0,21] such that F satisfies a Łojasiewicz–Simon gradient inequality at ωˉ. By Section 4 the claim follows.
∎
5 The Hilbert space framework
In the setting where Y=Y∗=H is a Hilbert space, the assumptions in Section 1 can be characterized in a simpler way in terms of the H-gradients.
Definition and Lemma \thedefnlem.
Let V be a Banach space and let (H,⟨⋅,⋅⟩) be a Hilbert space such that V↪H densely, so H↪V∗. Suppose U⊂V is an open set and E∈C1(U;R). If E′(u)∈H under the identification of H with its image in V∗, we say that E possesses an H-gradient at u∈U and we write ∇E(u):=E′(u)∈H. This means precisely that
[TABLE]
i.e. E′(u)∈V∗=L(V,R) extends to E′(u)∈L(H,R) via (15). Thus, E′(u)=∇E(u) under the isomorphism H≅H∗ given by the Riesz–Fréchet Theorem.
Corollary \thecor.
Let V be a Hilbert space, U⊂V be an open set, m∈N and let E∈Cω(U;R),G∈Cω(U;Rm). Let uˉ∈U and suppose that
(i)
there exists a Hilbert space (H,⟨⋅,⋅⟩) with V↪H densely,
2. (ii)
E* possesses an H-gradient ∇E(u) at each u∈U and the map u↦∇E(u):U→H is analytic,*
3. (iii)
the second derivative E′′(uˉ)=(∇E)′(uˉ):V→H is Fredholm of index zero,111The equation E′′(uˉ)=(∇E)′(u) has to be understood in the sense of the identification E′(u)=∇E(u), cf. Section 5.
4. (iv)
for any u∈U, the components Gk:U→R of G possess H-gradients ∇Gk such that U∋u↦∇Gk(u)∈H is analytic for all k=1,…,m,
5. (v)
the Fréchet derivatives (∇Gk)′(uˉ):V→H are compact for all k=1,…,m,
6. (vi)
G(uˉ)=0* and the H-gradients ∇G1(uˉ),…,∇Gm(uˉ) are linearly independent.*
Then, M:={u∈U∣G(u)=0} is locally an analytic submanifold of V of codimension m near uˉ.
If uˉ is a critical point of E∣M, then the restriction satisfies a refined Łojasiewicz–Simon gradient inequality at uˉ, i.e. there exist C,σ>0 and θ∈(0,21] such that for any u∈M with ∥u−uˉ∥V≤σ, we have
[TABLE]
where P(u):H→H is the orthogonal projection onto TuM:=TuM∥⋅∥H.
Remark \therem.
Requiring V to be a Hilbert space in Section 5 is no additional assumption. Indeed, if hypothesis (iii) in Section 5 is satisfied for V merely a Banach space, E′′(uˉ):V→H is a compact perturbation of an isomorphism by [4, Theorem 7.10]. In particular, V and H are isomorphic, so V has to be a Hilbert space.
Remark \therem.
In the case m=1 in Section 5, the projection P(u)∈L(H) onto TuM is given by
P(u)y=y−∥∇G(u)∥H2⟨∇G(u),y⟩∇G(u), where ∇G:=∇G1. This yields
[TABLE]
The scalar λ(u):=∥∇G(u)∥2⟨∇G(u),∇E(u)⟩ is often referred to as the Langrange multiplier, since if the right hand side of (17) is zero, λ(u) is exactly the Lagrange multiplier for the function E subject to the constraint G(u)=0 (cf. [16, Chapter 2]).
The following shows that assumption (vi) in Section 5 is just the equivalent formulation of hypothesis (vi) in Section 1.
Lemma \thelem.
Let V be a Banach space and let (H,⟨⋅,⋅⟩) be a Hilbert space such that V↪H densely. Let U⊂V open, u∈U and suppose G∈C1(U;Rm) possesses H-gradients ∇G1(u),…,∇Gm(u) in the sense of Section 5. Then the following are equivalent.
(i)
G′(u):V→Rm* is surjective,*
2. (ii)
∇G1(u),…,∇Gm(u)* are linearly independent in H.*
Proof.
Assume (i) holds and let λ∈Rm be such that ∑k=1mλk∇Gk(u)=0 in H. Then, for any v∈V⊂H we have
0=∑k=1mλk⟨∇Gk(u),v⟩=⟨λ,G′(u)v⟩Rm by Section 5. Hence, λ∈(ImG′(u))⊥Rm={0} by (i). Conversely, suppose (ii) holds and λ∈(ImG′(u))⊥Rm. Then, we have
[TABLE]
As a consequence, ∑k=1mλk∇Gk(u)=0 in H by density of V⊂H, thus λ=0 by (ii).
∎
Assumptions (i)-(iv) of Section 1 are satisfied if we choose Y=Y∗=H under the identification of H with its image in V∗. Note that the extension of G′(u) is given by
[TABLE]
Thus, assumption (v) of Section 1 is satisfied if and only if (∇Gk)′(uˉ):V→H is compact for all k=1,…,m which is exactly assumption (v) in Section 5. By Section 5, assumption (vi) in Section 1 is also satisfied. We conclude that there exists C,σ>0 and θ∈(0,21] such that for any u∈M with ∥u−uˉ∥V≤σ, we have
[TABLE]
By 3.1, TuM=kerG′(u).
Consequently, by (18), we have
[TABLE]
Hence, if P(u)∈L(H) denotes the orthogonal projection onto TuM⊂H we can estimate the right hand side of (19) by
In the setting of Section 5, we may deduce the following abstract convergence result for the associated gradient flow.
Corollary \thecor.
Let E,G be as in Section 5 with m=1 and suppose u∈C1([0,∞);V) is a solution of the constrained gradient flow equation
[TABLE]
where λ(u) is as in (17). Assume that {u(t)∣t≥0}⊂V is compact. Then limt→∞u(t) exists in V.
Remark \therem.
The key idea in the proof of Section 5 is the following (formal) computation, based on [33]. If E∣G−1{0} satisfies a refined Łojasiewicz–Simon gradient inequality near uˉ∈{u(t)∣t≥0}, then
[TABLE]
This implies ∂tu∈L1([0,∞);H) which yields the claim (see [11, Theorem 12.2] for a detailed presentation of this argument, with weaker regularity assumptions).
6 Optimality discussion
In this section, we will discuss why the assumptions in Section 1 and Section 5 cannot be omitted.
First, we provide an example, inspired by the Hilbert space case in [19, Theorem 2.1], which implies that in any Banach space of infinite dimension, there will exist an energy which fails to satisfy the Łojasiewicz–Simon gradient inequality. The construction relies on the following nontrivial fact.
Theorem \thethm.
Let V be a Banach space of infinite dimension and let ε>0. Then there exist sequences (en)n∈N⊂V with ∥en∥=1 for all n∈N and (ϕk)k∈N⊂V∗ with ∥ϕk∥≤2(1+ε) for all k∈N such that
Let V be a Banach space of infinite dimension and ε>0. Let (en)n∈N and (ϕk)k∈N be as in Section 6. Let λ∈ℓ1(N) with λk=0 for all k∈N. Then, x=0 is a critical point of the analytic energy
[TABLE]
but E satisfies no Lojasiewicz–Simon gradient inequality around x=0.
Proof.
First, we will prove that E is analytic. Indeed, we have E(x)=Φ(x,x), where Φ(x,y):=∑k=1∞λkϕk(x)ϕk(y) for x,y∈V. Note that Φ:V×V→R is bilinear and bounded by
[TABLE]
where β:=2(1+ε).
By Section 2.1, Φ and hence E is analytic. Furthermore, we find
[TABLE]
Clearly, E′(0)=0. For k∈N and t>0 we have
[TABLE]
and E(ten)=21λnt2. Thus, if E satisfied a Łojasiewicz–Simon gradient inequality for some C,σ,θ>0, for all n∈N and 0<t≤σ we would get
[TABLE]
Dividing by ∣λn∣ and letting n→∞ yields a contradiction, since λn→0 as λ∈ℓ1(N).
∎
It is not too difficult to see that the second derivative of E in Section 6 fails to be Fredholm. Consequently, condition (iii) in Section 1 (and Section 1) is violated, whereas conditions (i)-(ii) are satisfied with Z=H (Y=H, respectively), indicating that mere analyticity of the energy is not enough.
In fact, the following result shows that it is never sufficient in infinite dimensions.
Corollary \thecor.
Let V be a Banach space, and let U⊂V be open. Then, dimV<∞ if and only if every analytic function E∈Cω(U;R) satisfies a Łojasiewicz–Simon gradient inequality at each of its critical points.
For the sake of simplicity, throughout the rest of this section we restrict ourselves to the Hilbert space case in Section 5 with V=H. That way, assumptions (i), (ii) and (iv) are automatically satisfied if the energy and the constraint are analytic. The next example shows that we can not drop the compactness assumption (v) in Section 5.
Example \theexample.
Consider the Hilbert space H=R×ℓ2(N) and let λ∈ℓ1(N). We write elements x∈H as x=(x0,x′) with x′∈ℓ2(N). The natural norm on H is given by ∥x∥H2:=∣x0∣2+∥x′∥ℓ2(N)2.
For x=(x0,x′)∈H define
[TABLE]
Then E satisfies assumptions (i)-(iii) in Section 5 with V=H. We define
[TABLE]
and consider G:H→R,G(x):=x0−ψ(x′) and M:=G−1({0}). Then, E∣M does not satisfy a refined Łojasiewicz–Simon gradient inequality at the origin, but satisfies all assumptions of Section 5 with V=H except assumption (v).
Proof.
It is easy to see that E, ψ and G are analytic. Given xˉ,y∈H, a short computation yields (∇E)′(xˉ)y=(0,2y′), so the second derivative is Fredholm with index zero and the first part of the statement is proven.
Moreover, G possesses an H-gradient ∇G(x0,x′)=(1,(2(λn−1)xn′)n∈N)∈H and the gradient map is analytic. Also note that G(0)=0 and ∇G(0)=(1,0), so assumptions (iv) and (vi) of Section 5 are satisfied. By Section 2.4, M is an analytic submanifold of H near the origin, with a single chart φ:ℓ2(N)→H, φ(x′):=(ψ(x′),x′)∈H which coincides with the chart from Section 3 in this example.
However, note that the operator T:=(∇G)′(0):H→H, (∇G)′(0)(x0,x′)=(0,(2(λn−1)xn′)n∈N) is not compact. Indeed, let xk:=(0,ek′)∈H where ek′∈ℓ2(N) is the standard k-th unit vector. A short computation yields Txk⇀0, however Txk→0, since ∥Txk∥H=2∣λk−1∣→1, so T cannot be compact. For x′∈ℓ2(N) we have
[TABLE]
Similar to Section 6, one can show that E∘φ does not satisfy a Łojasiewicz–Simon gradient inequality at the origin x′=0 by assuming the inequality holds and then testing it with x′=ek′∈ℓ2(N) for all k∈N. Section 4 then implies that E∣M cannot satisfy a refined Łojasiewicz–Simon gradient inequality at x=0 either.222At this point it is crucial that in Section 4 we did not require assumption (v) of Section 1 to be satisfied.
∎
Section 6 shows that assumption (v) in Section 5 cannot be omitted. Note that while one can easily show using Section 1 that E as in Section 6 satisfies a Łojasiewicz–Simon gradient inequality, E∣M does not satisfy the refined inequality (3). In particular, the property of satisfying a Łojasiewicz–Simon gradient inequality does in general not behave well under the restriction to a submanifold, even if we assume finite codimension.
Let us also remark that condition (vi) in Section 5 is in general necessary to guarantee that M is a manifold.
Finally, note that our main result only considers submanifolds of finite codimension. The following example shows that our main result cannot be extended to the case of infinite codimension, even for linear subspaces.
Example \theexample.
Consider the Hilbert space H:=ℓ2(N)×ℓ2(N) with
[TABLE]
Then, the energy E:H→R,E(x,x′):=∑n=1∞∣xn∣2−∣xn′∣2 satisfies assumptions (i)-(iii) in Section 5 with V=H. For the constraint function G:H→ℓ2(N), (x,x′)↦(xn−1+n−2xn′)n∈N, the set M:=G−1({0}) is a linear subspace of H and (x,x′)=(0,0) is a constrained critical point of E∣M. Moreover, G and E are analytic, ∇G(0,0)=0 and 0=(∇G)′(0,0):H→H is compact. However, E∣M does not satisfy a refined Łojasiewicz–Simon gradient inequality near the origin.
Proof.
The properties of E and G can be shown as in in Section 6. The natural chart for M is given by
[TABLE]
and we observe that (E∘φ)(y)=∑n=1∞n21∣yn∣2 cannot satisfy a Łojasiewicz–Simon gradient inequality near x=0 by Section 6. Hence, by an argument similar to Section 4, neither does E∣M.
∎
7 Applications
In this section, we will apply our result from Section 1 to different energies on Sobolev spaces with isoperimetric constraints (cf. [16, Chapter 2.1]). Like in Section 5 this can be then used to conclude convergence for precompact solutions of the associated gradient flows.
7.1 Surface area with an isoperimetric constraint
Throughout this subsection, we assume that Ω⊂Rd is a domain with C1,1-boundary. We want to study the surface area or d-dimensional Hausdorff measure of graph(u)⊂Rd+1 given by
[TABLE]
Note that while this energy is already defined if we merely require u∈W1,1(Ω), a natural space to study a L2-gradient flow would be W2,2(Ω). However, we consider u∈V:=W2,p(Ω)∩W01,p(Ω) with d<p<∞ and Y:=Lq(Ω) where p1+q1=1. The condition on p and our choice of spaces will imply analyticity (cf. Section 1). We want to study E on the set of functions which satisfy the constraint
[TABLE]
where g:R→R is an analytic function. Note that the energy as well as the constraint are well defined since W2,p(Ω) embeds into both W1,1(Ω) and C(Ω) by [17, Corollary 7.11]. Moreover, V↪Lq(Ω) densely, so we get an induced embedding Lp(Ω)=Y∗↪V∗.
We recall the following important property of Nemytskii operators.
Let F∈C(R). Then, the superposition operatorF:C(Ω)→C(Ω), F(v)=F(v) is analytic if and only if the function F is.
Lemma \thelem.
The map G:V→R is analytic with
[TABLE]
In particular, G′(u)=g′(u)∈Y∗=Lp(Ω).
Proof.
By the embedding W2,p(Ω)↪C(Ω), the map V∋u↦u∈C(Ω) is analytic. Hence, so is V∋u↦g(u)∈C(Ω) by Section 7.1. Integrating is analytic by Section 2.1, since it is linear and bounded. Using Section 2.1, this yields G∈Cω(V;R). Furthermore, (24) follows, since for u,v∈V, we have
[TABLE]
Clearly, G′(u)=g′(u)∈Y∗=Lp(Ω), since g′(u)∈C(Ω)⊂Lp(Ω). Moreover, we have G′∈Cω(V,Y∗), since g′:R→R is analytic and so is V∋u↦g′(u)∈Lp(Ω) by the embeddings V↪C(Ω) and C(Ω)↪Lp(Ω) and using Sections 2.1 and 7.1.
∎
As a next step, we compute the second derivative of G.
Since u∈V↪C(Ω), g′′(u)∈C(Ω) and hence V∋v↦g′′(u)v∈Y∗=Lp(Ω) is compact, since the embedding V∋v↦v∈Lp(Ω) is compact. It follows that G′′(u):V→Y∗ is compact.
∎
Lemma \thelem.
The map E:V→R is analytic with
[TABLE]
Moreover, E′(u)∈Y∗=Lp(Ω) for all u∈V.
This analyticity statement motivates our choice of spaces in the beginning of this section.
Proof.
We first note that the following maps are analytic.
(i)
The embedding i:V↪C1(Ω) since d>p using Section 2.1.
2. (ii)
The map C1(Ω)→C(Ω),u↦∣∇u∣2 by Section 2.1, since it is the diagonal of a bounded bilinear map.
3. (iii)
The map
C(Ω;(−1,∞))⊂C(Ω)→C(Ω),F(v)=(1+v)−α
for α>0 by Section 7.1, since the map F:(−1,∞)→R,F(x)=(1+x)−α is analytic.
4. (iv)
The map
[TABLE]
for α>0 as a composition of the maps in (i)-(iii) using Section 2.1.
5. (v)
The map C(Ω)→R,v↦∫Ωvdx by Section 2.1, since it is linear and bounded.
Since E can be written as the composition of these maps, E is analytic. For a proof of (25), consider [16, Chapter 1, 2.2 Example 5]. Note that E′(u)∈Lp(Ω), since for u∈W2,p(Ω), we have using summation convention
[TABLE]
By the embedding W2,p(Ω)↪C1(Ω) and since the denominators are bounded from below, we conclude that E′(u)∈Lp(Ω) for u∈W2,p(Ω).
∎
Lemma \thelem.
The function E′:V→Lp(Ω),u↦E′(u) is analytic.
Proof.
The following maps are analytic.
(i)
The map V→Lp(Ω),u↦∂i∂ju for any i,j∈{1,…,d} by Section 2.1.
2. (ii)
The maps u↦(1+∣∇u∣2)−21,u↦(1+∣∇u∣2)−23:V→C(Ω) by (26).
3. (iii)
Since the pointwise multiplications C(Ω)×Lp(Ω)→Lp(Ω) and C(Ω)×C(Ω)→C(Ω) are bilinear and bounded, they are analytic by Section 2.1. Hence, so is u↦E′(u) by (27) and Section 2.1.
∎
Lemma \thelem.
Let u∈V. Then the Fréchet derivative E′′(u):V→Y∗ is Fredholm of index zero.
using summation convention, where K~,K:V→Lp(Ω) only contain terms in v of order 1 or lower, whence are compact by the Rellich–Kondrachov Theorem [17, Theorem 7.26].
It is easy to see that A uniformly is elliptic, hence A:W2,p(Ω)∩W01,p(Ω)→Lp(Ω) is an isomorphism by [17, Theorem 9.15]. Therefore, E′′(u)=−A+K:V→Y∗ is Fredholm of index zero by 2.1.
∎
Let uˉ∈V,G(uˉ)=0 with g′(uˉ)≡0 be a constraint critical point of E on M={u∈V∣G(u)=0}. Then, M is locally a manifold near uˉ and satisfies a Łojasiewicz–Simon gradient inequality on M, i.e. there exist C,σ>0 and θ∈(0,21] such that for any u∈M with ∥u−uˉ∥W2,p(Ω)≤σ, we have
[TABLE]
Proof.
We verify that Section 1 is applicable with U=V=W2,p(Ω)∩W01,p(Ω) and Y=Lq(Ω), so Y∗=Lp(Ω). Analyticity of G and E has been proven in Sections 7.1 and 7.1. Clearly V=W2,p(Ω)∩W01,p(Ω)↪Y∗ densely. Moreover, E′∈Cω(V;Y∗) by Section 7.1. The Fredholm property of E′′(uˉ):V→Y∗ has been established in Section 7.1. By Section 7.1, G′ extends analytically in the sense of assumption (iv) in Section 1. Moreover, the Fréchet derivative, G′′(uˉ):V→Y∗ is compact by Section 7.1. By assumption, G′(uˉ)=g′(uˉ)≡0, hence it is surjective as an operator Y→R.
Thus, by Section 1, M is locally a manifold near uˉ and
there exist C,σ>0 and θ∈(0,21] such that for u∈M with ∥u−uˉ∥W2,p≤σ, we have
[TABLE]
It remains to conclude (28) from (29). To that end, note that by 3.1 we have TuM=kerG′(u)={w∈Lq(Ω)∣∫Ωg′(u)wdx=0}. Hence, for any λ∈R, we have
If g(x)=x−Γ for some Γ>0 in (23), the energy E∣M corresponds to the restriction of the surface area of graph(u) on the set of graphs with fixed enclosed volume Γ with the Rd×{0}-hyperplane.
2. 2.
By considering the shifted energies E~(u)=E(u+β)
and G~(u)=G(u+β) for u∈V=W2,p(Ω)∩W01,p(Ω) and fixed β∈W2,p(Ω), the result can be extended to general Dirichlet boundary data.
3. 3.
Notice that in the proof of Section 7.1, we have some freedom in the choice of λ. Our choice is justified, since then any solution u=u(t) of the equation
[TABLE]
will preserve the constraint, i.e. G(u(t))=0 for all t, provided u is smooth enough.
4. 4.
It is not clear, whether our choice of λ is optimal in the sense that it minimizes the right hand side of (7.1). However, in the case p=2, so d=1, our choice of λ yields the orthogonal projection (cf. Section 5), and hence minimizes the right hand side of (7.1).
7.2 The Allen–Cahn equation
The following reaction-diffusion equation plays an important role in mathematical physics, modeling the process of phase separation [9],
[TABLE]
Here, T,ε>0 and Ω⊂Rd is a domain with C1,1-boundary.
Equation 31 is the L2-gradient flow of the Ginzburg–Landau free energy
[TABLE]
where f(u)=−W′(u). The function W describes some potential, a common choice is W(s)=4ε1(1−s2)2, the double well potential.
In their celebrated works [30, 31], L. Modica and S. Mortola proved that as ε→0, the energy Eε in (32) Γ-converges to the perimeter of a suitable level set.
For our result, we will consider ε>0 as being fixed, and therefore, we can assume ε=1 without loss of generality. We define V:=W2,2(Ω)∩W01,2(Ω) and write E:=E1.
In this subsection, we will use Section 1 to establish a Łojasiewicz–Simon gradient inequality for the constrained energy E∣M, where
M={u∈V∣G(u)=0} and
Let d≤3, W,g∈Cω(R;R). Suppose uˉ∈M is a constrained critical point of E∣M with g′(uˉ)≡0. Then M is locally a submanifold near uˉ and E∣M satisfies a Łojasiewicz–Simon gradient inequality, i.e. there exist C,σ>0 and θ∈(0,21] such that for any u∈M with ∥u−uˉ∥W2,2(Ω)≤σ we have
[TABLE]
Proof.
We can use the Hilbert space version of our main result, Section 5, with U=V=W2,2(Ω)∩W01,2(Ω) and H=L2(Ω). Clearly, V↪H densely. Moreover, we have ∇E(u)=−Δu+W′(u). Since W′∈Cω(R;R), V↪C(Ω) as d≤3 and C(Ω)↪H, we conclude from Section 7.1 that assumption (ii) in Section 5 is satisfied. For v∈V, we have
[TABLE]
Thus, using [17, Theorem 7.26], (∇E)′(uˉ):V→H is a compact perturbation of the Dirichlet-Laplacian −Δ:V→H, which is an isomorphism by standard elliptic theory [17, Theorem 9.15]. Thus, (∇E)′(uˉ) is Fredholm of index zero by 2.1. It follows from Section 7.1 that the H-gradient of G is given by ∇G(u)=g′(u), so assumption (iv) in Section 5 is clearly satisfied. Similar to Section 7.1, the Fréchet derivative (∇G)′(uˉ)v=g′′(uˉ)v is compact. Moreover, the assumption g′(uˉ)≡0 means that assumption (vi) in Section 5 is satisfied. Thus, (33) follows from (16) and Section 5.
∎
Remark \therem.
The condition d≤3 is only needed to prove analyticity of the energy on W2,2(Ω)∩W01,2(Ω). However, like in Section 7.1 one can consider different spaces to deal with the higher dimensional cases.
7.3 Area of surfaces of revolution with prescribed volume
In this subsection, we will discuss an application of Section 1 to the area of a surface of revolution with prescribed boundary and prescribed inclosed volume. To that end, let I=[a,b]⊂R be an interval, V:=W2,2(I)∩W01,2(I), H:=L2(I) and consider
[TABLE]
This is the area of the surface of revolution S obtained by rotating the graph of 1+u around the x-axis. Note that by requiring u∈V, we impose symmetric boundary conditions 1+u(a)=1+u(b)=1. In order to ensure that S is indeed a surface, we will study E on the set U:={u∈V∣1+u>0 on I}. Note that U⊂V is open by the Sobolev embedding W2,2(I)↪C(I).
We prescribe the volume inside S by some fixed value ν∈R, i.e. we require
[TABLE]
Unlike in Section 7.1, here we can work in the Hilbert space framework, since the embedding W2,2(I)↪C1(I) ensures that the energy E is analytic on U.
Lemma \thelem.
The energies E and G are analytic on U⊂V. Moreover, for u∈U,v∈V we have
[TABLE]
Moreover, the maps U∋u↦∇E(u)∈H and U∋u↦∇G(u)∈H are analytic.
The Fréchet derivative (∇E)′(u):V→H is Fredholm of index zero.
Proof.
Similar to Section 7.1, a short computation yields for u∈U and v∈V
[TABLE]
where K:V→H only contains terms in v of order 1 or lower and is hence compact by [17, Theorem 7.26]. Note that like in Section 7.1, A is an elliptic operator in v by the embeddings V↪C1(I)↪C(I) and the requirement u∈U. Thus, E′′(u) is Fredholm of index zero by 2.1.
∎
Lemma \thelem.
The operator (∇G)′(u):V→H is compact for all u∈U.
Proof.
The statement follows with the same ideas as in Section 7.1.
∎
Consequently, similar to Sections 7.1 and 7.2 we get the following result.
Theorem \thethm.
Let M:={u∈U∣G(u)=0}. Suppose uˉ∈M is a constrained critical point of E∣M. Then M is locally a submanifold near uˉ and satisfies a Łojasiewicz–Simon gradient inequality, i.e. there exist C,σ>0 and θ∈(0,21] such that for any u∈M with ∥u−uˉ∥W2,2(I)≤σ we have
[TABLE]
where the scalar λ(u) is given by
[TABLE]
Proof.
We use Section 5. It remains to check condition (vi). Clearly, G(uˉ)=0 and we have ∇G(uˉ)=2π(1+uˉ)≡0∈H=L2(I), since uˉ∈U. Thus E∣M satisfies a Łojasiewicz–Simon gradient inequality at uˉ. By Section 5, we get
[TABLE]
Using (34) and (35) yields (36) with λ(u)=∥∇G(u)∥L2(I)2⟨∇E(u),∇G(u)⟩. The explicit formula (37) for the scalar λ can be proven using integration by parts and G(u)=0.
∎
Following the notation in [10, Section 3], let V be a Banach space, E∈C2(U;R) with U⊂V open and let φ∈U be a critical point of E. Then M:=E′∈C1(U;V∗) and L:=E′′∈C(U;L(V,V∗)). In [10], the Łojasiewicz–Simon gradient inequality is proven under the following assumptions.
The kernel V0:=kerL(φ) is complemented, i.e. there exists a projection Q∈L(V) such that ImQ=V0.
Clearly, under the assumption of Appendix A, we have V=V0⊕V1 with V1=kerQ, and similarly V∗=V0∗⊕V1∗ with V0∗=ImQ∗ and V1∗=kerQ∗. Note that this abuse of notation is justified since V1∗={v∗∈V∗∣v∗(w)=0 for all w∈V0} is isomorphic to the dual of V0 and similarly for V0∗.
We will show that Appendices A and A and the assumptions of Appendix A are satisfied for φ:=uˉ. We follow [13, Appendix A], where the result was proven for a special case. Let Z as in Section 1, then (i) in Appendix A is satisfied for W:=Z∗ since V↪Z densely.
We set X:=V,Y:=W=Z∗.
Since L(φ)=E′′(uˉ):V→Z∗ is Fredholm by assumption (iii), its kernel V0:=kerL(φ)⊂V⊂Z is finite-dimensional, thus (iii) in Appendix A holds. Let d:=dimV0<∞ and note that V0 is closed in both V and Z and thus complemented in both spaces (cf. Section 2.3). This implies that there exists V1⊂V closed such that V=V0⊕V1 and there exists a projection Q~∈L(Z) onto V0. We will extend this to obtain a particular projection on Z onto V0. Note that Q:=Q~∣V:V→V is also continuous, since as V0=ImQ~=ImQ is finite-dimensional, there exist C,C′,C′′>0 such that ∥Qv∥V≤C∥Q~v∥Z≤C′∥v∥Z≤C′′∥v∥V for all v∈V, using that V↪Z. Now, Q∈L(V) satisfies Appendix A. Denote by Q∗∈L(V∗) the adjoint of Q.
Assumption (ii) in Section 1 immediately implies that assumption (iii) in Appendix A and (ii) in Appendix A are satisfied.
In order to prove Appendix A (iv), recall that by Schwarz’s Theorem (cf. [24, XIII, Theorem 5.3]), the second derivative L(φ):V→Z∗ is symmetric, i.e. for v,w∈V we have (L(φ)v)(w)=(L(φ)w)(v). Thus, for v∈V and for any w∈V0=kerL(φ), we have (L(φ)v)(w)=0, i.e.
[TABLE]
using V∗=V0∗⊕V1∗.
As a next step, we show that the inclusion in (38) is an equality. Since indL(φ)=0, we have codim(ImL(φ),Z∗)=dimkerL(φ)=dimV0=d. Moreover, we have codim(V1∗∩Z∗,Z∗)=d. Indeed, considering V0 as a subspace of Z, by (38), we have V1∗∩Z∗=V0⊥, the annihilator of V0 in Z∗. By [8, Proposition 11.13], we have d=dimV0=codimV0⊥=codim(V1∗∩Z∗,Z∗). Hence, ImL(φ) and V1∗∩Z∗ are two subspaces with the same finite codimension d in Z∗, with one contained in the other. Therefore, they have to be equal by [7, Proposition 5 in II §7. 3.]. Thus, equality holds in (38), so Appendix A (iv) and assumption (iv) in Appendix A are satisfied.
It remains to check that Q∗ leaves W=Z∗ invariant. Let z∗∈Z∗ and v∈Z. Then
(Q∗z∗)v=z∗(Qv) is linear in v and for C=∥z∗∥Z∗∥Q~∥L(Z)≥0, we have
[TABLE]
By (39), Q∗z∗:Z→R is an element of the dual space Z∗. This yields that Appendix A (ii) and assumption (i) in Appendix A are satisfied. Hence, we may apply Appendix A to conclude that E satisfies a Łojasiewicz–Simon gradient inequality in a neighborhood of φ=uˉ.
∎
Appendix B Basic sequences in Banach spaces
The goal of this subsection is to provide a proof of Section 6. We therefore use a generalization of the notion of an orthonormal basis in a Hilbert space to the Banach space situation. We follow the presentation of [3, Chapter 1].
Definition \thedefn(Cf. [3, Definition 1.1.2 and Definition 1.1.5]).
Let X be a Banach space and (en)n∈N⊂X be a sequence. Suppose there exists a sequence (ϕk)k∈N⊂X∗ such that
(i)
ϕk(en)=δk,n* for all k,j∈N,*
2. (ii)
v=∑k=1∞ϕk(v)ek* for all v∈X.*
Then, (en)n∈N is called a Schauder basis for X with associated biorthogonal functionals(ϕk)k∈N. A sequence (en)n∈N in a Banach space X is called a basic sequence if it is a Schauder basis for span{en∣n∈N}.
The following is an immediate consequence of the Uniform Boundedness Principle.
Let (en)n∈N be a Schauder basis for a Banach space X. Then the natural projectionsSN:X→X,SNv:=∑k=1Nϕk(v)ek are uniformly bounded in L(X), i.e. we have
[TABLE]
The number K is called the basis constant of the sequence (en)n∈N. The following existence result is what we need to prove Section 6.
Theorem \thethm.
Let V be an infinite-dimensional Banach space and let ε>0. Then there exists a basic sequence (en)n∈N with basis constant K≤1+ε and ∥en∥=1 for all n∈N.
Proof.
The existence of a basic sequence (en)n∈N with basis constant less than 1+ε is exactly the statement of [3, Corollary 1.5.3]. Investigating its proof, we note that the en are chosen from S:={x∈X∣∥x∥=1}, which proves the claim.
∎
Using Appendix B, we obtain a sequence (en)n∈N with ∥en∥=1 for all n∈N which is a basic sequence, i.e. a Schauder basis for the Banach space X:=span{en∣n∈N}⊂V. By Appendix B there exists an associated biorthogonal sequence (ϕk)k∈N⊂X∗. Note that by Appendix B, the basis constant K defined in (40) satisfies K≤1+ε. For x∈X and k∈N, we have setting S0:=0∈L(X)
[TABLE]
thus ∥ϕk∥X∗≤2K≤2(1+ε). By the Hahn–Banach Theorem, there exist extensions of ϕk:X→R also denoted ϕk:V→R with ∥ϕk∥V∗=∥ϕk∥X∗≤2(1+ε).
∎
Acknowledgments
The author would like to thank Anna Dall’Acqua, Marius Müller and Adrian Spener for helpful discussions and comments.
Funding
This work was supported by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation)-Projektnummer: 404870139.
Bibliography35
The reference list from the paper itself. Each links out to its DOI / PubMed record.
1[1] R. Abraham, J. E. Marsden, and T. S. Ratiu. Manifolds, tensor analysis, and applications . Springer, New York, 2004.
2[2] P.-A. Absil, R. Mahony, and B. Andrews. Convergence of the iterates of descent methods for analytic cost functions. SIAM J. Optim. , 16(2):531–547, 2005.
3[3] F. Albiac and N. J. Kalton. Topics in Banach space theory , volume 233 of Graduate Texts in Mathematics . Springer, New York, 2006.
4[4] J. Appell and M. Väth. Elemente der Funktionalanalysis . Vieweg, Wiesbaden, 2005.
5[5] J. Appell and P. P. Zabrejko. Nonlinear superposition operators , volume 95 of Cambridge Tracts in Mathematics . Cambridge University Press, Cambridge, 1990.
6[6] H. Attouch, J. Bolte, P. Redont, and A. Soubeyran. Proximal alternating minimization and projection methods for nonconvex problems: an approach based on the Kurdyka-Łojasiewicz inequality. Math. Oper. Res. , 35(2):438–457, 2010.
7[7] N. Bourbaki. Algebra I. Chapters 1–3 . Elements of Mathematics (Berlin). Springer-Verlag, Berlin, 1998.
8[8] H. Brezis. Functional Analysis, Sobolev Spaces and Partial Differential Equations . Springer, New York, 2010.