Sufficient conditions for unique global solutions in optimal control of   semilinear equations with $C^1-$nonlinearity

A. Ahmad Ali; K. Deckelnick; and M. Hinze

arXiv:1902.09639·math.OC·February 27, 2019

Sufficient conditions for unique global solutions in optimal control of semilinear equations with $C^1-$nonlinearity

A. Ahmad Ali, K. Deckelnick, and M. Hinze

PDF

Open Access

TL;DR

This paper establishes sufficient conditions ensuring that solutions to certain semilinear elliptic optimal control problems are globally optimal, extending previous results and providing explicit criteria at both continuous and discrete levels.

Contribution

It generalizes prior work by deriving explicit global optimality conditions for semilinear elliptic control problems with $C^1$ nonlinearities, including the case of $ ext{sign}(s)$ nonlinearity.

Findings

01

Derived explicit global optimality conditions for continuous problems.

02

Extended conditions to discrete problem settings.

03

Numerical tests demonstrate practical applicability.

Abstract

We consider a $C^{1} -$ semilinear elliptic optimal control problem possibly subject to control and/or state constraints. Generalizing previous work we provide a condition which guarantees that a solution of the necessary first order conditions is a global minimum. A similiar result also holds at the discrete level where the corresponding condition can be evaluated explicitly. Our investigations are motivated by G\"unter Leugering, who raised the question whether our previous results can be extended to the nonlinearity $ϕ (s) = s ∣ s ∣$ . We develop a corresponding analysis and present several numerical test examples demonstrating its usefulness in practice.

Equations133

- Δ y + ϕ (\cdot, y)

- Δ y + ϕ (\cdot, y)

y

y \mapsto ϕ (x, y) \mbox i so f c l a ss C^{1} \mbox w i t h ϕ_{y} (x, y) \geq 0 \mbox f or a l m os t a l l x \in Ω;

y \mapsto ϕ (x, y) \mbox i so f c l a ss C^{1} \mbox w i t h ϕ_{y} (x, y) \geq 0 \mbox f or a l m os t a l l x \in Ω;

\forall L \geq 0 \exists c_{L} \geq 0 ϕ_{y} (x, y) \leq c_{L} \mbox f or a l m os t a l l x \in Ω \mbox an d a l l ∣ y ∣ \leq L .

(\mathbb{P})\quad\begin{array}[]{rcl}&&\min_{u\in U_{ad}}J(u):=\frac{1}{2}\|y-y_{0}\|_{L^{2}(\Omega)}^{2}+\frac{\alpha}{2}\|u\|_{L^{2}(\Omega)}^{2}\\ &&\mbox{subject to }y=\mathcal{G}(u)\mbox{ and }y_{a}(x)\leq y(x)\leq y_{b}(x)\mbox{ for all }x\in K.\end{array}

(\mathbb{P})\quad\begin{array}[]{rcl}&&\min_{u\in U_{ad}}J(u):=\frac{1}{2}\|y-y_{0}\|_{L^{2}(\Omega)}^{2}+\frac{\alpha}{2}\|u\|_{L^{2}(\Omega)}^{2}\\ &&\mbox{subject to }y=\mathcal{G}(u)\mbox{ and }y_{a}(x)\leq y(x)\leq y_{b}(x)\mbox{ for all }x\in K.\end{array}

\int_{Ω} \nabla \overset{y}{ˉ} \cdot \nabla v + ϕ (\cdot, \overset{y}{ˉ}) v d x = \int_{Ω} \overset{u}{ˉ} v d x \forall v \in H_{0}^{1} (Ω), y_{a} \leq \overset{y}{ˉ} \leq y_{b} \mbox in K,

\int_{Ω} \nabla \overset{y}{ˉ} \cdot \nabla v + ϕ (\cdot, \overset{y}{ˉ}) v d x = \int_{Ω} \overset{u}{ˉ} v d x \forall v \in H_{0}^{1} (Ω), y_{a} \leq \overset{y}{ˉ} \leq y_{b} \mbox in K,

\int_{Ω} \overset{p}{ˉ} (- Δ v) + ϕ_{y} (\cdot, \overset{y}{ˉ}) \overset{p}{ˉ} v d x = \int_{Ω} (\overset{y}{ˉ} - y_{0}) v d x + \int_{K} v d \overset{μ}{ˉ} \forall v \in H_{0}^{1} (Ω) \cap H^{2} (Ω),

\int_{Ω} (\overset{p}{ˉ} + α \overset{u}{ˉ}) (u - \overset{u}{ˉ}) d x \geq 0 \forall u \in U_{a d},

\int_{K} (z - \overset{y}{ˉ}) d \overset{μ}{ˉ} \leq 0 \forall z \in C^{0} (K), y_{a} \leq z \leq y_{b} \mbox in K .

∣ ϕ_{y y} (x, y) ∣ \leq M (ϕ_{y} (x, y))^{\frac{1}{r}} \mbox f or a l m os t a l l x \in Ω \mbox an d a l l y \in R .

∣ ϕ_{y y} (x, y) ∣ \leq M (ϕ_{y} (x, y))^{\frac{1}{r}} \mbox f or a l m os t a l l x \in Ω \mbox an d a l l y \in R .

\displaystyle\|\bar{p}\|_{L^{q}}\leq\big{(}\frac{r-1}{2r-1}\big{)}^{\frac{1-r}{r}}M^{-1}C^{\frac{2-2r}{r}}_{q}\alpha^{\frac{\rho}{2}}q^{1/q}r^{1/r}\rho^{\rho/2}(2-\rho)^{\frac{\rho}{2}-1},

\displaystyle\|\bar{p}\|_{L^{q}}\leq\big{(}\frac{r-1}{2r-1}\big{)}^{\frac{1-r}{r}}M^{-1}C^{\frac{2-2r}{r}}_{q}\alpha^{\frac{\rho}{2}}q^{1/q}r^{1/r}\rho^{\rho/2}(2-\rho)^{\frac{\rho}{2}-1},

J (u) - J (\overset{u}{ˉ}) = \frac{1}{2} ∥ y - \overset{y}{ˉ} ∥_{L^{2} (Ω)}^{2} + \frac{α}{2} ∥ u - \overset{u}{ˉ} ∥_{L^{2} (Ω)}^{2} + α \int_{Ω} \overset{u}{ˉ} (u - \overset{u}{ˉ}) d x + \int_{Ω} (\overset{y}{ˉ} - y_{0}) (y - \overset{y}{ˉ}) d x .

J (u) - J (\overset{u}{ˉ}) = \frac{1}{2} ∥ y - \overset{y}{ˉ} ∥_{L^{2} (Ω)}^{2} + \frac{α}{2} ∥ u - \overset{u}{ˉ} ∥_{L^{2} (Ω)}^{2} + α \int_{Ω} \overset{u}{ˉ} (u - \overset{u}{ˉ}) d x + \int_{Ω} (\overset{y}{ˉ} - y_{0}) (y - \overset{y}{ˉ}) d x .

\int_{Ω} (\overset{y}{ˉ} - y_{0}) (y - \overset{y}{ˉ}) d x = - \int_{Ω} \overset{p}{ˉ} Δ (y - \overset{y}{ˉ}) d x + \int_{Ω} ϕ_{y} (\cdot, \overset{y}{ˉ}) \overset{p}{ˉ} (y - \overset{y}{ˉ}) d x - \int_{K} (y - \overset{y}{ˉ}) d \overset{μ}{ˉ}

\int_{Ω} (\overset{y}{ˉ} - y_{0}) (y - \overset{y}{ˉ}) d x = - \int_{Ω} \overset{p}{ˉ} Δ (y - \overset{y}{ˉ}) d x + \int_{Ω} ϕ_{y} (\cdot, \overset{y}{ˉ}) \overset{p}{ˉ} (y - \overset{y}{ˉ}) d x - \int_{K} (y - \overset{y}{ˉ}) d \overset{μ}{ˉ}

J (u) - J (\overset{u}{ˉ}) \geq \frac{1}{2} ∥ y - \overset{y}{ˉ} ∥_{L^{2} (Ω)}^{2} + \frac{α}{2} ∥ u - \overset{u}{ˉ} ∥_{L^{2} (Ω)}^{2} - R (u),

J (u) - J (\overset{u}{ˉ}) \geq \frac{1}{2} ∥ y - \overset{y}{ˉ} ∥_{L^{2} (Ω)}^{2} + \frac{α}{2} ∥ u - \overset{u}{ˉ} ∥_{L^{2} (Ω)}^{2} - R (u),

R(u)=\int_{\Omega}\bigl{(}\phi(\cdot,y)-\phi(\cdot,\bar{y})-\phi_{y}(\cdot,\bar{y})(y-\bar{y})\bigr{)}\bar{p}\,dx.

R(u)=\int_{\Omega}\bigl{(}\phi(\cdot,y)-\phi(\cdot,\bar{y})-\phi_{y}(\cdot,\bar{y})(y-\bar{y})\bigr{)}\bar{p}\,dx.

ϕ (x, y (x)) - ϕ (x, \overset{y}{ˉ} (x)) - ϕ_{y} (x, \overset{y}{ˉ} (x)) (y (x) - \overset{y}{ˉ} (x)) \geq 0 \mbox f or a l m os t a l l x \in Ω

ϕ (x, y (x)) - ϕ (x, \overset{y}{ˉ} (x)) - ϕ_{y} (x, \overset{y}{ˉ} (x)) (y (x) - \overset{y}{ˉ} (x)) \geq 0 \mbox f or a l m os t a l l x \in Ω

- Δ y_{b} + ϕ (\cdot, y_{b}) \geq u_{b} \mbox in Ω, y_{b} \geq 0 \mbox o n \partial Ω.

- Δ y_{b} + ϕ (\cdot, y_{b}) \geq u_{b} \mbox in Ω, y_{b} \geq 0 \mbox o n \partial Ω.

\int_{Ω} ∣\nabla y^{-} ∣^{2} d x = - \int_{Ω} ϕ (\cdot, y^{-}) y^{-} d x + \int_{Ω} u y^{-} d x \leq 0

\int_{Ω} ∣\nabla y^{-} ∣^{2} d x = - \int_{Ω} ϕ (\cdot, y^{-}) y^{-} d x + \int_{Ω} u y^{-} d x \leq 0

- Δ (y - y_{b}) + [ϕ (\cdot, y) - ϕ (\cdot, y_{b})] \leq u - u_{b} \leq 0 \mbox a . e . in Ω.

- Δ (y - y_{b}) + [ϕ (\cdot, y) - ϕ (\cdot, y_{b})] \leq u - u_{b} \leq 0 \mbox a . e . in Ω.

- Δ \overset{p}{ˉ} + ϕ_{y} (\cdot, \overset{y}{ˉ}) \overset{p}{ˉ} = \overset{y}{ˉ} - y_{0} \leq y_{b} - y_{0} \leq 0 \mbox a . e . in Ω

- Δ \overset{p}{ˉ} + ϕ_{y} (\cdot, \overset{y}{ˉ}) \overset{p}{ˉ} = \overset{y}{ˉ} - y_{0} \leq y_{b} - y_{0} \leq 0 \mbox a . e . in Ω

\Big{|}\frac{\phi_{y}(x,y_{2})-\phi_{y}(x,y_{1})}{y_{2}-y_{1}}\Big{|}\leq M\,\Bigl{(}\frac{\phi(x,y_{2})-\phi(x,y_{1})}{y_{2}-y_{1}}\Bigr{)}^{\gamma}

\Big{|}\frac{\phi_{y}(x,y_{2})-\phi_{y}(x,y_{1})}{y_{2}-y_{1}}\Big{|}\leq M\,\Bigl{(}\frac{\phi(x,y_{2})-\phi(x,y_{1})}{y_{2}-y_{1}}\Bigr{)}^{\gamma}

a^{λ} b^{μ} \leq \frac{λ ^{λ} μ ^{μ}}{( λ + μ ) ^{λ + μ}} (a + b)^{λ + μ}, a, b \geq 0, λ, μ > 0,

a^{λ} b^{μ} \leq \frac{λ ^{λ} μ ^{μ}}{( λ + μ ) ^{λ + μ}} (a + b)^{λ + μ}, a, b \geq 0, λ, μ > 0,

∥ f ∥_{L^{q}} \leq C_{q} ∥ f ∥_{L^{2}}^{1 - θ} ∥\nabla f ∥_{L^{2}}^{θ}

∥ f ∥_{L^{q}} \leq C_{q} ∥ f ∥_{L^{2}}^{1 - θ} ∥\nabla f ∥_{L^{2}}^{θ}

\bar{p}\in L^{q}(\Omega)\left\{\begin{array}[]{ll}\mbox{ for every }1\leq q<\infty&\mbox{ if }d=2;\\ \mbox{ for every }1\leq q<3&\mbox{ if }d=3.\end{array}\right.

\bar{p}\in L^{q}(\Omega)\left\{\begin{array}[]{ll}\mbox{ for every }1\leq q<\infty&\mbox{ if }d=2;\\ \mbox{ for every }1\leq q<3&\mbox{ if }d=3.\end{array}\right.

\overset{p}{ˉ} \in L^{\infty} (Ω) \mbox i f K = \emptyset \mbox or K = \overset{ˉ}{Ω} \mbox w i t h y_{a}, y_{b} \in W^{2, \infty} (Ω) .

\overset{p}{ˉ} \in L^{\infty} (Ω) \mbox i f K = \emptyset \mbox or K = \overset{ˉ}{Ω} \mbox w i t h y_{a}, y_{b} \in W^{2, \infty} (Ω) .

\frac{1}{1 - γ} < q < \infty \mbox i f d = 2; \frac{3}{2 ( 1 - γ )} \leq q < 3 \mbox i f d = 3

\frac{1}{1 - γ} < q < \infty \mbox i f d = 2; \frac{3}{2 ( 1 - γ )} \leq q < 3 \mbox i f d = 3

\displaystyle\eta(\alpha,q,d):=\Bigl{(}\frac{1-\gamma}{2-\gamma}\Bigr{)}^{\gamma-1}M^{-1}C_{t}^{2(\gamma-1)}\alpha^{\frac{\rho}{2}}(\frac{d}{2q})^{-\frac{d}{2q}}\gamma^{-\gamma}(2-\rho)^{\frac{\rho}{2}-1}\rho^{\frac{\rho}{2}},

\displaystyle\eta(\alpha,q,d):=\Bigl{(}\frac{1-\gamma}{2-\gamma}\Bigr{)}^{\gamma-1}M^{-1}C_{t}^{2(\gamma-1)}\alpha^{\frac{\rho}{2}}(\frac{d}{2q})^{-\frac{d}{2q}}\gamma^{-\gamma}(2-\rho)^{\frac{\rho}{2}-1}\rho^{\frac{\rho}{2}},

∥ \overset{p}{ˉ} ∥_{L^{q}} \leq η (α, q, d)

∥ \overset{p}{ˉ} ∥_{L^{q}} \leq η (α, q, d)

R (u) = Ω \int \overset{p}{ˉ} (y - \overset{y}{ˉ}) 0 \int 1 [ϕ_{y} (\cdot, \overset{y}{ˉ} + t (y - \overset{y}{ˉ})) - ϕ_{y} (\cdot, \overset{y}{ˉ})] d t d x .

R (u) = Ω \int \overset{p}{ˉ} (y - \overset{y}{ˉ}) 0 \int 1 [ϕ_{y} (\cdot, \overset{y}{ˉ} + t (y - \overset{y}{ˉ})) - ϕ_{y} (\cdot, \overset{y}{ˉ})] d t d x .

\displaystyle\Big{|}\int_{0}^{1}[\phi_{y}(\cdot,y_{1}+t(y_{2}-y_{1}))-\phi_{y}(\cdot,y_{1})]dt\Big{|}

\displaystyle\Big{|}\int_{0}^{1}[\phi_{y}(\cdot,y_{1}+t(y_{2}-y_{1}))-\phi_{y}(\cdot,y_{1})]dt\Big{|}

ϕ_{ϵ} (y) := \int_{R} ζ_{ϵ} (z) ϕ (y - z) d z, y \in R,

ϕ_{ϵ} (y) := \int_{R} ζ_{ϵ} (z) ϕ (y - z) d z, y \in R,

ζ_{ϵ} \geq 0, \mbox s u pp ζ_{ϵ} \subset [- ϵ, ϵ], \mbox an d \int_{R} ζ_{ϵ} (z) d z = 1.

ζ_{ϵ} \geq 0, \mbox s u pp ζ_{ϵ} \subset [- ϵ, ϵ], \mbox an d \int_{R} ζ_{ϵ} (z) d z = 1.

ϕ_{ϵ}^{''} (y) = h \to 0 lim \int_{R} ζ_{ϵ} (z) \frac{ϕ ^{'} ( y + h - z ) - ϕ ^{'} ( y - z )}{h} d z

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Numerical Methods in Computational Mathematics · Computational Fluid Dynamics and Aerodynamics · Numerical methods for differential equations

Full text

Sufficient conditions for unique global solutions in optimal control of semilinear equations with $C^{1}-$ nonlinearity

Ahmad Ahmad Ali111Schwerpunkt Optimierung und Approximation, Universität Hamburg, Bundesstraße 55, 20146 Hamburg, Germany., Klaus Deckelnick222Institut für Analysis und Numerik, Otto–von–Guericke–Universität Magdeburg, Universitätsplatz 2, 39106 Magdeburg, Germany & Michael Hinze333Schwerpunkt Optimierung und Approximation, Universität Hamburg, Bundesstraße 55, 20146 Hamburg, Germany.

Abstract

We consider a semilinear elliptic optimal control problem possibly subject to control and/or state constraints. Generalizing previous work in [2] we provide a condition which guarantees that a solution of the necessary first order conditions is a global minimum. A similiar result also holds at the discrete level where the corresponding condition can be evaluated explicitly. Our investigations are motivated by Günter Leugering, who raised the question whether the problem class considered in [2] can be extended to the nonlinearity $\phi(s)=s|s|$ . We develop a corresponding analysis and present several numerical test examples demonstrating its usefulness in practice.

Dedicated to Günter Leugering on the occasion of his 65th birthday.

1 Introduction and problem setting

Let $\Omega\subset\mathbb{R}^{d}\,(d=2,3)$ be a bounded, convex polygonal/polyhedral domain, in which we consider the semilinear elliptic PDE

[TABLE]

We assume that $\phi:\bar{\Omega}\times\mathbb{R}\rightarrow\mathbb{R}$ is a Carathéodory function with $\phi(x,0)=0$ a.e. in $\Omega$ and that

[TABLE]

Under the above conditions it can be shown that for every $u\in L^{2}(\Omega)$ the boundary value problem (1.1), (1.2) has a unique solution $y=:\mathcal{G}(u)\in H^{2}(\Omega)\cap H^{1}_{0}(\Omega)$ . Next, let us introduce $U_{ad}:=\{v\in L^{2}(\Omega):u_{a}\leq v(x)\leq u_{b}\mbox{ a.e. in }\Omega\}$ , where $u_{a},u_{b}\in\mathbb{R}$ with $-\infty\leq u_{a}\leq u_{b}\leq\infty$ . For given $y_{0}\in L^{2}(\Omega),\,\alpha>0$ we then consider the optimal control problem

[TABLE]

Here, $y_{a},y_{b}\in C^{0}(\bar{\Omega})$ satisfy $y_{a}(x)<y_{b}(x)$ for all $x\in K$ , where $K\subset\bar{\Omega}$ is compact and either $K\subset\Omega$ or $K=\bar{\Omega}$ . In the latter case we suppose in addition that $y_{a}(x)<0<y_{b}(x),x\in\partial\Omega$ . It is well–known that $(\mathbb{P})$ has a solution provided that a feasible point exists (compare [5]). Under some constraint qualification, such as the linearized Slater condition, a local solution $\bar{u}\in U_{ad}$ of $(\mathbb{P})$ then satisfies the following necessary first order conditions, see [5, Theorem 5.2]: There exist $\bar{p}\in L^{2}(\Omega)$ and a regular Borel measure $\bar{\mu}\in\mathcal{M}(K)$ such that

[TABLE]

In view of the nonlinearity of the state equation problem $(\mathbb{P})$ is in general nonconvex and hence there may be several solutions of the conditions (1.5)–(1.8). The problem we are interested in is whether it is possible to establish sufficient conditions which guarantee that a solution of (1.5)–(1.8) is actually a global minimum of $(\mathbb{P})$ . A first result in this direction was obtained by the authors in [2] and holds for a class of nonlinearities which satisfy a certain growth condition:

Theorem 1.1.

([2, Theorem 3.2]) Let $d=2$ ; suppose that $y\mapsto\phi(x,y)$ belongs to $C^{2}$ for almost all $x\in\Omega$ and that there exist $r>1$ and $M\geq 0$ such that

[TABLE]

Assume that $(\bar{u},\bar{y},\bar{p},\bar{\mu})$ solves (1.5)–(1.8) and that

[TABLE]

where $q:=\tfrac{3r-2}{r-1},\,\rho:=\frac{r+q}{rq}$ and $C_{q}$ denotes the constant in (2.6) below. Then $\bar{u}$ is a global minimum for Problem $(\mathbb{P})$ . If the above inequality is strict, then $\bar{u}$ is the unique global minimum.

Assumption (1.9) is satisfied for $\phi_{q}(y):=|y|^{q-2}y$ provided that $q>3$ if we choose $r=\frac{q-2}{q-3}$ . Günter Leugering recently raised the question whether our theory can be extended to include the case $q=3$ . The corresponding nonlinearity $\phi_{3}(y)=|y|y$ appears for example in the mathematical modeling of gas flow through pipes with PDEs [16, (5.1)], so that an extension of Theorem 1.1 to this case could be helpful in understanding the optimal control of pipe networks. As $\phi_{3}$ is no longer $C^{2}$ it does not fit directly into the theory above. However it turns out that instead the analysis can be built on the fact that $\phi_{3,y}$ satisfies a global Lipschitz condition.

The purpose of this paper is to generalize Theorem 1.1 in several directions. To begin, we shall replace (1.9) by a condition that can be formulated for $C^{1}$ –nonlinearities $\phi$ and is satisfied by the functions $\phi_{q}$ for every $q\geq 3$ thus including the case suggested by Günter Leugering, see (2.4). A second generalization concerns the choice of the norm $\|\bar{p}\|_{L^{q}}$ in condition (1.10). Even though the integration index $q=\frac{3r-2}{r-1}$ is quite natural (solve $r=\frac{q-2}{q-3}$ for $q$ ), it is nevertheless possible to formulate a corresponding result not just for one index but for $q$ belonging to a suitable interval, see (2.9), thus giving additional flexibility in its application. Our arguments are natural extensions of the analysis presented in [2] and will also cover the case $d=3$ left out in Theorem 1.1.

There is a lot of literature available considering the problem $(\mathbb{P})$ . For a broad overview, we refer the reader to the references of the respective citations. In [5] this problem is studied for boundary controls. The regularity of optimal controls of $(\mathbb{P})$ and their associated multipliers is investigated in [12] and [11]. Sufficient second order conditions are discussed in e.g. [9, 7, 8] when the set $K$ contains finitely/infinitely many points. For the role of those conditions in PDE constrained optimization see e.g. [13].

The finite element discretization of problem $(\mathbb{P})$ in rather general settings is studied in [4, 10, 19]. Convergence rates for sets $K$ containing only finitely many points are established in [23] for finite dimensional controls, and in [6] for control functions. Only in [27, 3] an error analysis is provided for general pointwise state constraints in $K$ . Error analysis for linear-quadratic control problems can be found in e.g. [11], [14, 15] and [24]. Improved error estimates for the state in the case of weakly active state constraints are provided in [28]. A detailed discussion of discretization concepts and error analysis in PDE-constrained control problems can be found in [20, 21] and [17, Chapter 3].

The organization of the paper is as follows: in § 2 we shall develop the optimality conditions outlined above. In addition to the criteria based on an $L^{q}$ –norm of $\bar{p}$ we shall also include a result that uses a sign of $\bar{p}$ . The variational discretization of $(\mathbb{P})$ is considered in § 3 and is based on a finite element approximation of (1.1), (1.2) that uses numerical integration for the nonlinear term. We obtain corresponding optimality criteria for discrete stationary points and apply these conditions in a series of numerical tests in § 4 including the nonlinearity $\phi(y)=y|y|$ .

2 Optimality conditions for $(\mathbb{P})$

In what follows we assume that $(\bar{u},\bar{y},\bar{p},\bar{\mu})$ is a solution of (1.5)–(1.8). Let $u\in U_{ad}$ be a feasible control, $y=\mathcal{G}(u)$ the associated state such that $y_{a}\leq y\leq y_{b}$ in $K$ . A straightforward calculation shows that

[TABLE]

Combining (1.6) for $v:=y-\bar{y}$ with (1.8) and (1.1) we deduce that

[TABLE]

Inserting this relation into (2.1) and recalling (1.7) we finally obtain

[TABLE]

where

[TABLE]

2.1 Conditions involving a sign of $\bar{p}$

A natural first idea to deduce global optimality from (2.2) consists in identifying situations in which $R(u)\leq 0$ for all $u\in U_{ad}$ . We have the following result:

Theorem 2.1.

Suppose that there exists an interval $I\subset\mathbb{R}$ such that $y\mapsto\phi(x,y)$ is convex (concave) on $I$ for almost all $x\in\Omega$ . Furthermore, assume that for every $u\in U_{ad}$ the solution $y=\mathcal{G}(u)$ with $y_{a}\leq y\leq y_{b}$ in $K$ satisfies $y(x)\in I$ for all $x\in\Omega$ . If $\bar{p}\leq 0\;(\bar{p}\geq 0)$ a.e. on $\Omega$ , then $\bar{u}$ is the unique global minimum of $(\mathbb{P})$ .

Proof.

Suppose that $y\mapsto\phi(x,y)$ is convex. Then our assumptions imply that

[TABLE]

which yields that $R(u)\leq 0$ since $\bar{p}\leq 0$ a.e. in $\Omega$ . Hence $J(u)>J(\bar{u})$ for $u\neq\bar{u}$ by (2.2).

In general we cannot expect the adjoint variable $\bar{p}$ to have a sign without additional conditions on the data of the problem. The following result is similar in spirit to a sufficient condition involving a suitable bound on $y_{0}$ obtained in [25, Theorem 5.4] and [22, Section 5.2] for the optimal control of the obstacle problem.

Lemma 2.2.

Suppose that $K=\emptyset$ and that $u_{a}=0,u_{b}<\infty$ . Let $y_{b}\in H^{2}(\Omega)$ satisfy

[TABLE]

Then $0\leq\mathcal{G}(u)\leq y_{b}$ in $\bar{\Omega}$ for every $u\in U_{ad}$ . Also, if $y_{0}\geq y_{b}$ a.e. in $\Omega$ , then $\bar{p}\leq 0$ in $\Omega$ .

Proof.

Let $u\in U_{ad}$ and set $y=\mathcal{G}(u)$ . If we test (1.5) with $v=y^{-}$ we have

[TABLE]

using (1.3), the fact that $\phi(\cdot,0)=0$ as well as $u\geq 0$ . We infer that $y^{-}\equiv 0$ and hence $y\geq 0$ in $\bar{\Omega}$ . Next, $y-y_{b}$ satisfies

[TABLE]

Testing with $(y-y_{b})^{+}$ then gives $y\leq y_{b}$ in $\bar{\Omega}$ . Finally, since $K=\emptyset$ , the adjoint state satisfies

[TABLE]

since $\bar{y}\leq y_{b}$ by what we have already shown. We infer that $\bar{p}\leq 0$ in a similar way as above.

Example 2.3.

Let $a\in L^{\infty}(\Omega)$ with $a\geq 0$ a.e. in $\Omega$ . Then the functions $\phi(x,y)=e^{a(x)y}-1$ and $\phi(x,y)=a(x)|y|^{q-2}y\;(q\geq 3)$ are convex on $\mathbb{R}$ and $[0,\infty)$ respectively. Hence if $K=\emptyset$ and $u_{a},u_{b}$ and $y_{0}$ are chosen as in Lemma 2.2, then Theorem 2.1 and Lemma 2.2 imply that a solution of the necessary first order conditions will be the unique global minimum of $(\mathbb{P})$ .

2.2 Conditions involving a bound on $\|\bar{p}\|_{L^{q}}$

As mentioned above it will in general not be possible to establish a sign on the adjoint variable $\bar{p}$ , so that one is left with trying to bound $|R(u)|$ in terms of $\frac{1}{2}\|y-\bar{y}\|_{L^{2}(\Omega)}^{2}+\frac{\alpha}{2}\|u-\bar{u}\|_{L^{2}(\Omega)}^{2}$ . In what follows we shall assume that there exists $\gamma\in[0,1)$ and $M\geq 0$ such that

[TABLE]

for almost all $x\in\Omega$ and for all $y_{1},y_{2}\in\mathbb{R},y_{1}\neq y_{2}$ . Note that (2.4) holds with $\gamma=0$ if $y\mapsto\phi_{y}(x,y)$ is globally Lipschitz uniformly in $x\in\Omega$ . Furthermore, it is not difficult to verify that (2.4) is satisfied with $\gamma=\frac{1}{r}$ provided that (1.9) holds.

Example 2.4.

Let $\phi(x,y)=a(x)|y|^{q-2}y$ , where $q\geq 3$ and $a\in L^{\infty}(\Omega)$ with $a(x)\geq 0$ a.e. in $\Omega$ . Then, $\phi$ satisfies (2.4) with $\gamma=\tfrac{q-3}{q-2}$ and $M=(q-2)(q-1)^{\frac{1}{q-2}}\|a\|^{\frac{1}{q-2}}_{L^{\infty}(\Omega)}$ .

In what follows we shall make use of the elementary inequality (see e.g. [2, Lemma 7.1])

[TABLE]

as well as of the Gagliardo–Nirenberg interpolation inequality

[TABLE]

where $\theta=d(\frac{1}{2}-\frac{1}{q})$ and $2\leq q<\infty$ if $d=2$ and $2\leq q\leq 6$ if $d=3$ . Explicit values for the constant $C_{q}$ in (2.6) can e.g. be found in [26] and [29], see also [2, Theorem 7.3].

Before we state our main result we mention that it is well–known that $\bar{p}\in W^{1,s}_{0}(\Omega)$ for all $s\in[1,\frac{d}{d-1})$ . In particular we infer with the help of a standard embedding result that

[TABLE]

Furthermore, we have that

[TABLE]

In order to see (2.8) we note that $\bar{p}\in H^{2}(\Omega)\hookrightarrow L^{\infty}(\Omega)$ by elliptic regularity theory if $K=\emptyset$ . On the other hand, if $K=\bar{\Omega}$ with $y_{a},y_{b}\in W^{2,\infty}(\Omega)$ we may apply Theorem 3.1 and Section 4.2 in [11] to obtain that $\bar{p}\in L^{\infty}(\Omega)$ .

Theorem 2.5.

Assume that $\phi$ satisfies (2.4) and let $(\bar{u},\bar{y},\bar{p},\bar{\mu})\in U_{ad}\times(H^{2}(\Omega)\cap H^{1}_{0}(\Omega))\times L^{2}(\Omega)\times\mathcal{M}(K)$ be a solution of (1.5)–(1.8). Furthermore, choose $q>1$ such that

[TABLE]

and define for $t:=\frac{2q(1-\gamma)}{q(1-\gamma)-1}$ and $\rho:=\frac{d}{2q}+\gamma$ the quantity

[TABLE]

where $C_{t}$ is the constant in (2.6). If the inequality

[TABLE]

is satisfied, then $\bar{u}$ is a global minimum for Problem $(\mathbb{P})$ . If the inequality (2.11) is strict, then $\bar{u}$ is the unique global minimum. The assertions hold for $\frac{3}{2(1-\gamma)}\leq q<\infty$ and $d=3$ provided that $K=\emptyset$ or $K=\bar{\Omega}$ with $y_{a},y_{b}\in W^{2,\infty}(\Omega)$ .

Proof.

To begin, note that (2.7) and (2.8) imply that $\bar{p}\in L^{q}(\Omega)$ for the cases that we consider. Our starting point is again (2.2) in which we write the remainder term as

[TABLE]

We claim that for all $y_{1},y_{2}\in\mathbb{R},y_{1}\neq y_{2}$ we have

[TABLE]

where $L_{\gamma}=M\bigl{(}\frac{1-\gamma}{2-\gamma}\bigr{)}^{1-\gamma}$ . To see this, let us suppress temporarily the dependence on $x$ and introduce

[TABLE]

where $(\zeta_{\epsilon})_{0<\epsilon<1}\subset C^{\infty}_{0}(\mathbb{R})$ is a sequence of mollifiers satisfying

[TABLE]

Since $\phi_{\epsilon}^{\prime}(y)=\int_{\mathbb{R}}\zeta_{\epsilon}(z)\phi^{\prime}(y-z)\,dz$ we have that

[TABLE]

so that we obtain with the help of (2.4) and Hölder’s inequality

[TABLE]

We may therefore apply Lemma 7.2 in [2] for $\gamma\in(0,1)$ to deduce that

[TABLE]

but the above estimate easily extends to the case $\gamma=0$ . The bound (2.2) now follows by sending $\epsilon\rightarrow 0$ . If we insert (2.2) into (2.12) we find that

[TABLE]

where we have used Hölder’s inequality with exponents $q,r=\frac{1}{\gamma}$ and $s=\frac{q}{q(1-\gamma)-1}$ . Note that

[TABLE]

in view of our assumptions on $q$ . We may therefore use (2.6) in order to estimate $\|y-\bar{y}\|_{L^{t}}$ and obtain with

[TABLE]

that

[TABLE]

Applying (2.5) with $\lambda=\frac{d}{2q}$ and $\mu=\gamma$ and recalling that $\rho=\frac{d}{2q}+\gamma$ we may continue

[TABLE]

If we take the difference of the PDEs satisfied by $\bar{y}$ and $y$ and test it with $y-\bar{y}$ we easily deduce that

[TABLE]

which yields

[TABLE]

Using once more (2.5), this time with $\lambda=1-\frac{\rho}{2},\mu=\frac{\rho}{2}$ we finally deduce that

[TABLE]

If we use this estimate in (2.2) and recall (2.10) as well as $L_{\gamma}=M\bigl{(}\frac{1-\gamma}{2-\gamma}\bigr{)}^{1-\gamma}$ we infer that $J(u)-J(\bar{u})\geq 0$ provided that (2.11) holds, so that $\bar{u}$ is a global solution of problem $(\mathbb{P})$ . If the inequality in (2.11) is strict, then $\bar{u}$ is the unique global minimum of problem $(\mathbb{P})$ .

Remark 2.6.

Suppose that $d=2$ and that $\phi$ satisfies (1.9) for some $r>1,M\geq 0$ , so that (2.4) holds with $\gamma=\frac{1}{r}$ . If we set $q:=\frac{3r-2}{r-1}$ , then $q$ satisfies (2.9) while $t=q$ and $\rho=\frac{1}{q}+\frac{1}{r}=\frac{r+q}{rq}$ , so that Theorem 1.1 is a special case of Theorem 2.5.

3 Variational discretization

In this section we consider the case $d=2$ and let $\mathcal{T}_{h}$ be an admissible triangulation of $\Omega\subset\mathbb{R}^{2}$ . We introduce the following spaces of linear finite elements:

[TABLE]

The Lagrange interpolation operator $I_{h}$ is defined by

[TABLE]

where $x_{1},\ldots,x_{n}$ denote the nodes in the triangulation $\mathcal{T}_{h}$ and $\{\phi_{1},\ldots,\phi_{n}\}$ is the set of basis functions of the space $X_{h}$ which satisfy $\phi_{i}(x_{j})=\delta_{ij}$ . We discretize (1.1), (1.2) using numerical integration for the nonlinear part: for a given $u\in L^{2}(\Omega)$ , find $y_{h}\in X_{h0}$ such that

[TABLE]

Using the monotonicity of $y\mapsto\phi(\cdot,y)$ and the Brouwer fixed-point theorem one can show that (3.1) admits a unique solution $y_{h}=:\mathcal{G}_{h}(u)\in X_{h0}$ . The variational discretization (see [18]) of Problem $(\mathbb{P})$ then reads:

[TABLE]

where $\mathcal{N}_{h}:=\{x_{j}\,|\,x_{j}\mbox{ is a node of }T\in\mathcal{T}_{h},\mbox{ such that }T\cap K\neq\emptyset\}$ . It can be shown that $(\mathbb{P}_{h})$ has a solution, provided that a feasible point exists. In practice, candidates for solutions are calculated by solving the system of necessary first order conditions which reads: find $\bar{u}_{h}\in U_{ad},\bar{y}_{h}\in X_{h0},\bar{p}_{h}\in X_{h0},\bar{\mu}_{j}\in\mathbb{R},x_{j}\in\mathcal{N}_{h}$ such that $y_{a}(x_{j})\leq y_{h}(x_{j})\leq y_{b}(x_{j}),x_{j}\in\mathcal{N}_{h}$ and

[TABLE]

In order to formulate the analogue of Theorem 2.5 we introduce the following $h$ –dependent norm on $X_{h}$ :

[TABLE]

Theorem 3.1.

Suppose that $\phi$ and $q>1$ satisfy the conditions (2.4) and (2.9) respectively and let $\bar{u}_{h}\in U_{ad}$ , $\bar{y}_{h}\in X_{h0}$ , $\bar{p}_{h}\in X_{h0}$ , $(\bar{\mu}_{j})_{x_{j}\in\mathcal{N}_{h}}$ be a solution of (3.2)–(3.5). If

[TABLE]

then $\bar{u}_{h}$ is a global minimum for Problem $(\mathbb{P}_{h})$ . If the inequality (3.6) is strict, then $\bar{u}_{h}$ is the unique global minimum.

Proof.

Just as in the continuous case we obtain for $u\in U_{ad}$ with $y_{h}=\mathcal{G}_{h}(u)$

[TABLE]

where

[TABLE]

If we use (2.2) then we obtain as above with the help of Hölder’s inequality

[TABLE]

where $s=\frac{q}{q(1-\gamma)-1}$ . Applying Lemma 5.1 in the Appendix we derive

[TABLE]

which is the analogue of (2.2). The rest of the proof now follows in the same way as in Theorem 2.5, where we use (3.1) instead of the PDEs.

We shall investigate condition (3.6) for different choices of $\phi$ and $q$ in the numerics section. From the numerical analysis point of view it is also possible to examine the convergence of a sequence of solutions $(\bar{u}_{h},\bar{y}_{h},\bar{p}_{h},(\bar{\mu}_{j})_{x_{j}\in\mathcal{N}_{h}})_{0<h<h_{0}}$ of (3.2)–(3.5) that satisfy (3.6) uniformly in $h$ . Based on Theorem 1.1, convergence in $L^{2}(\Omega)$ of $(\bar{u}_{h})_{0<h<h_{0}}$ to a solution $\bar{u}$ of $(\mathbb{P})$ has been obtained in [2, Theorem 4.2], while an error estimate is proved in [1, 3]. We expect that these results carry over to the generalized framework considered in this paper. In this context we also refer to [27] as a further contribution to the error analysis for optimal control of semilinear equations with pointwise bounds on the state. Contrary to our approach this work is based on second order sufficient optimality conditions for a local solution of the control problem and requires in particular a $C^{2}$ –nonlinearity $\phi$ .

4 Numerical experiments

In this section we conduct several numerical experiments related to Theorem 3.1. We consider $(\mathbb{P})$ with different choices for the nonlinearity $\phi$ . For each choice we fix $\Omega:=(0,1)\times(0,1)$ , while for the desired state $y_{0}$ we consider the following two scenarios:

A1: (Reachable desired state) $y_{0}(x):=2\sin(2\pi x_{1})\sin(2\pi x_{2})$ .

A2: (Not reachable desired state) $y_{0}(x):=60+160(x_{1}(x_{1}-1)+x_{2}(x_{2}-1))$ .

For the control and state bounds we consider these three cases:

Case 1: (Unconstrained problem) $u_{b}=-u_{a}=\infty$ , $K=\emptyset$ .

Case 2: (Control constrained problem) $u_{b}=-u_{a}=5$ , $K=\emptyset$ .

Case 3: (State constrained problem) $u_{b}=-u_{a}=\infty$ , $K=\bar{\Omega},y_{b}\equiv-y_{a}\equiv 1$ .

For $\alpha$ we report numerical results for the values $\alpha=10^{i}$ , $i=-6,-5,\ldots,3$ . The domain $\Omega$ is partitioned using a uniform triangulation with mesh size $h=2^{-5}\sqrt{2}$ , and the discrete counterpart of the problem is as in Section 3. The resulting discrete optimality system (3.2)–(3.5) is solved using the semismooth Newton method.

Example 4.1.

We consider $\phi(y):=y|y|$ . Then, $\gamma=0$ with $M=2$ . Taking $q=2$ , the condition reads

[TABLE]

with

[TABLE]

The results are reported in Figure 1. We see that in the light of Theorem 3.1, the unique global solution of the considered control problem has been computed for all given values of $\alpha$ , except for case 2 when $\alpha\leq 10^{-3}$ . There, no conclusion can be derived. However, with the coefficient $a(x):=\frac{1}{8}$ we obtain a global unique solution for the whole considered parameter range, see Fig. 2.

Example 4.2.

We consider $\phi(y):=y^{3}$ . Then, $\gamma=0.5$ with $M=2\sqrt{3}$ . Taking $q=3$ , the condition reads

[TABLE]

with

[TABLE]

The choice of $q=3$ is motivated by fact that among the possible choices of the Gagliardo-Nirenberg constant the value of $C_{6}$ is among the smallest possible ones, see [2, Figure 4]. The integrals involving $\phi$ , and the norm $\|\bar{p}_{h}\|_{L^{3}(\Omega)}$ are computed exactly. The results are reported in Figure 3. We for comparison also include the results for $q=4$ which correspond to the findings of [2, Example 2]. As one can see this choice in some situations delivers larger uniqueness intervalls for $\alpha$ . Overall, uniqueness of the global solution can be deduced for certain ranges of the parameter $\alpha$ , where it is more likely in the case of a reachable desired state $y_{0}$ .

Example 4.3.

We consider $\phi(y):=y^{5}$ . Then, $\gamma=3/4$ with $M=4\times(5)^{1/4}$ . Taking $q=6$ , the condition reads

[TABLE]

with

[TABLE]

The choice of $q=6$ is motivated as in the previous example. This then is the situation of [2, Example 3]. For comparison we also include the results obtained with quadrature based on the estimate (3.6). As one can see the differences in both approaches (exact integration versus quadrature) is negligible. The results are reported in Figure 4.

5 Appendix

Lemma 5.1.

Let $d=2$ and $2\leq q<\infty$ . Then

[TABLE]

Proof.

Let us denote by $\hat{T}\subset\mathbb{R}^{2}$ the unit simplex with vertices $\hat{a}_{0}=(0,0),\hat{a}_{1}=(1,0)$ and $\hat{a}_{2}=(0,1)$ . Using a scaling argument it is sufficient to show that

[TABLE]

where $\hat{I}_{h}f=\sum_{j=0}^{2}f(\hat{a}_{j})\hat{\phi}_{j}$ and $\hat{\phi}_{j}(\hat{a}_{i})=\delta_{ij}$ . In order to see the first inequality in (5.1) we observe that

[TABLE]

in view of the convexity of $t\mapsto|t|^{q}$ and the properties of $\hat{\phi}_{j},j=0,1,2$ . Let us next consider the remaining estimate and first focus on the case $q=2$ . A straightforward calculation shows that

[TABLE]

which implies that

[TABLE]

Let us introduce the measure $\mu:=\sum_{j=0}^{2}m_{j}\delta_{\hat{a}_{j}}$ with $m_{j}=\int_{\hat{T}}\hat{\phi}_{j}d\hat{x}=\frac{1}{6},j=0,1,2$ . Clearly,

[TABLE]

Now, (5.2) yields that $\|p\|_{L^{2}(\mu)}\leq 2\|p\|_{L^{2}(d\hat{x})}$ , while $\|p\|_{L^{\infty}(\mu)}\leq\|p\|_{L^{\infty}(d\hat{x})}$ , so that the Riesz–Thorin convexity theorem implies that

[TABLE]

which is (5.1).

Bibliography29

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] A. Ahmad Ali, Optimal Control of Semilinear Elliptic PD Es with State Constraints - Numerical Analysis and Implementation , Ph D thesis, Dissertation, Hamburg, Universität Hamburg, 2017.
2[2] A. Ahmad Ali, K. Deckelnick and M. Hinze, Global minima for semilinear optimal control problems, Computational Optimization and Applications , 65 (2016), 261–288.
3[3] A. Ahmad Ali, K. Deckelnick and M. Hinze, Error analysis for global minima of semilinear optimal control problems, Mathematical Control and related Fields (MCRF) 8 (2018).
4[4] N. Arada, E. Casas and F. Tröltzsch, Error estimates for the numerical approximation of a semilinear elliptic control problem, Computational Optimization and Applications , 23 (2002), 201–229.
5[5] E. Casas, Boundary control of semilinear elliptic equations with pointwise state constraints, SIAM Journal on Control and Optimization , 31 (1993), 993–1006.
6[6] E. Casas, Error estimates for the numerical approximation of semilinear elliptic control problems with finitely many state constraints, ESAIM: Control, Optimisation and Calculus of Variations , 8 (2002), 345–374.
7[7] E. Casas, Necessary and sufficient optimality conditions for elliptic control problems with finitely many pointwise state constraints, ESAIM: Control, Optimisation and Calculus of Variations , 14 (2008), 575–589.
8[8] E. Casas, J. C. De Los Reyes and F. Tröltzsch, Sufficient second-order optimality conditions for semilinear control problems with pointwise state constraints, SIAM Journal on Optimization , 19 (2008), 616–643.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

Sufficient conditions for unique global solutions in optimal control of semilinear equations with C1−C^{1}-C1−nonlinearity

Abstract

1 Introduction and problem setting

Theorem 1.1**.**

2 Optimality conditions for (P)(\mathbb{P})(P)

2.1 Conditions involving a sign of pˉ\bar{p}pˉ​

Theorem 2.1**.**

Proof.

Lemma 2.2**.**

Proof.

Example 2.3**.**

2.2 Conditions involving a bound on ∥pˉ∥Lq\|\bar{p}\|_{L^{q}}∥pˉ​∥Lq​

Example 2.4**.**

Theorem 2.5**.**

Proof.

Remark 2.6**.**

3 Variational discretization

Theorem 3.1**.**

Proof.

4 Numerical experiments

Example 4.1**.**

Example 4.2**.**

Example 4.3**.**

5 Appendix

Lemma 5.1**.**

Proof.

Sufficient conditions for unique global solutions in optimal control of semilinear equations with $C^{1}-$ nonlinearity

Theorem 1.1.

2 Optimality conditions for $(\mathbb{P})$

2.1 Conditions involving a sign of $\bar{p}$

Theorem 2.1.

Lemma 2.2.

Example 2.3.

2.2 Conditions involving a bound on $\|\bar{p}\|_{L^{q}}$

Example 2.4.

Theorem 2.5.

Remark 2.6.

Theorem 3.1.

Example 4.1.

Example 4.2.

Example 4.3.

Lemma 5.1.