Optimal control of elliptic equations with positive measures

Christian Clason; Anton Schiela

arXiv:1702.07528·math.OC·February 27, 2017

Optimal control of elliptic equations with positive measures

Christian Clason, Anton Schiela

PDF

1 Repo

TL;DR

This paper develops a framework for solving optimal control problems involving elliptic equations with positive measure controls, establishing existence, deriving optimality conditions, and proposing a numerical solution method.

Contribution

It introduces a novel approach to ensure existence of solutions using Radon measures and Fenchel duality, and presents a discretization and semismooth Newton method for computation.

Findings

01

Existence of optimal controls in Radon measure space under certain constraints.

02

Derivation of optimality conditions via Fenchel duality.

03

Numerical method combining discretization and semismooth Newton algorithm.

Abstract

Optimal control problems without control costs in general do not possess solutions due to the lack of coercivity. However, unilateral constraints together with the assumption of existence of strictly positive solutions of a pre-adjoint state equation, are sufficient to obtain existence of optimal solutions in the space of Radon measures. Optimality conditions for these generalized minimizers can be obtained using Fenchel duality, which requires a non-standard perturbation approach if the control-to-observation mapping is not continuous (e.g., for Neumann boundary control in three dimensions). Combining a conforming discretization of the measure space with a semismooth Newton method allows the numerical solution of the optimal control problem.

Equations210

y, u in f \frac{1}{2} ∥ E y - y_{d} ∥_{L^{2} (ω_{o})}^{2} s. t. A y - B u = 0, u \geq 0,

y, u in f \frac{1}{2} ∥ E y - y_{d} ∥_{L^{2} (ω_{o})}^{2} s. t. A y - B u = 0, u \geq 0,

a (y, p) := \int_{Ω} [i, j = 1 \sum d a_{ij} (x) y_{x_{i}} p_{x_{j}} + c (x) y p] d x + \int_{\partial Ω} r (x) y p d s .

a (y, p) := \int_{Ω} [i, j = 1 \sum d a_{ij} (x) y_{x_{i}} p_{x_{j}} + c (x) y p] d x + \int_{\partial Ω} r (x) y p d s .

i, j = 1 \sum d a_{ij} (x) ξ_{i} ξ_{j} \geq a_{0} ∣ ξ ∣^{2} for all ξ \in R^{d} and almost all x \in Ω.

i, j = 1 \sum d a_{ij} (x) ξ_{i} ξ_{j} \geq a_{0} ∣ ξ ∣^{2} for all ξ \in R^{d} and almost all x \in Ω.

a (y, y) \geq c_{1} ∥ y ∥_{H^{1} (Ω)}^{2} for all y \in H^{1} (Ω) .

a (y, y) \geq c_{1} ∥ y ∥_{H^{1} (Ω)}^{2} for all y \in H^{1} (Ω) .

a (\cdot, \cdot) : W^{1, q^{'}} (Ω) \times W^{1, q} (Ω) \to R .

a (\cdot, \cdot) : W^{1, q^{'}} (Ω) \times W^{1, q} (Ω) \to R .

dom \prescript * A := {p \in H^{1} (Ω) : \exists c_{p} \in R with a (y, p) \leq c_{p} ∥ y ∥_{W^{1, q^{'}} (Ω)} \forall y \in H^{1} (Ω)} .

dom \prescript * A := {p \in H^{1} (Ω) : \exists c_{p} \in R with a (y, p) \leq c_{p} ∥ y ∥_{W^{1, q^{'}} (Ω)} \forall y \in H^{1} (Ω)} .

\prescript * A

\prescript * A

p

\overline{a} (y, p) := n \to \infty lim a (y_{n}, p) for all (y, p) \in W^{1, q^{'}} (Ω) \times dom \prescript * A,

\overline{a} (y, p) := n \to \infty lim a (y_{n}, p) for all (y, p) \in W^{1, q^{'}} (Ω) \times dom \prescript * A,

\prescript * A

\prescript * A

p

A : W^{1, q^{'}} (Ω) \supset dom A \to M (\overline{Ω}),

A : W^{1, q^{'}} (Ω) \supset dom A \to M (\overline{Ω}),

dom A := {y \in W^{1, q^{'}} (Ω) : \exists c_{y} \in R with (\prescript * A p) (y) = \overline{a} (y, p) \leq c_{y} ∥ p ∥_{C (\overline{Ω})} \forall p \in dom \prescript * A} .

dom A := {y \in W^{1, q^{'}} (Ω) : \exists c_{y} \in R with (\prescript * A p) (y) = \overline{a} (y, p) \leq c_{y} ∥ p ∥_{C (\overline{Ω})} \forall p \in dom \prescript * A} .

\prescript * B : C (\overline{Ω}) \to C (\overline{ω}_{c}), (\prescript * B v) (x) = v (x) \forall x \in \overline{ω}_{c},

\prescript * B : C (\overline{Ω}) \to C (\overline{ω}_{c}), (\prescript * B v) (x) = v (x) \forall x \in \overline{ω}_{c},

B : M (\overline{ω}_{c}) \to M (\overline{Ω})

B : M (\overline{ω}_{c}) \to M (\overline{Ω})

E : W^{1, q^{'}} (Ω) \supset dom E \to L^{2} (ω_{o}),

E : W^{1, q^{'}} (Ω) \supset dom E \to L^{2} (ω_{o}),

E_{H^{1}} := E ∣_{H^{1}} : (H^{1} (Ω), ∥ \cdot ∥_{H^{1}}) \to L^{2} (ω_{o}),

E_{H^{1}} := E ∣_{H^{1}} : (H^{1} (Ω), ∥ \cdot ∥_{H^{1}}) \to L^{2} (ω_{o}),

\prescript * E : L^{2} (ω_{o}) \supset dom \prescript * E \to W^{1, q^{'}} (Ω)^{*},

\prescript * E : L^{2} (ω_{o}) \supset dom \prescript * E \to W^{1, q^{'}} (Ω)^{*},

\prescript * S : L^{2} (ω_{o}) \supset dom \prescript * S \to C (\overline{ω}_{c}), h \mapsto \prescript * B \prescript * A^{- 1} \prescript * E h,

\prescript * S : L^{2} (ω_{o}) \supset dom \prescript * S \to C (\overline{ω}_{c}), h \mapsto \prescript * B \prescript * A^{- 1} \prescript * E h,

S : M (\overline{ω}_{c}) \supset dom S \to L^{2} (ω_{o}) .

S : M (\overline{ω}_{c}) \supset dom S \to L^{2} (ω_{o}) .

dom E A^{- 1} B := {u \in M (\overline{ω}_{c}) : A^{- 1} B u \in dom E} = dom S \supset L^{2} (\overline{ω}_{c}) .

dom E A^{- 1} B := {u \in M (\overline{ω}_{c}) : A^{- 1} B u \in dom E} = dom S \supset L^{2} (\overline{ω}_{c}) .

⟨ u, \prescript * S h ⟩_{M (\overline{ω}_{c}), C (\overline{ω}_{c})} = ⟨ A^{- 1} B u, \prescript * E h ⟩_{W^{1, q^{'}} (Ω), W^{1, q^{'}} (Ω)^{*}} for all h \in dom \prescript * S, u \in M (\overline{ω}_{c}) .

⟨ u, \prescript * S h ⟩_{M (\overline{ω}_{c}), C (\overline{ω}_{c})} = ⟨ A^{- 1} B u, \prescript * E h ⟩_{W^{1, q^{'}} (Ω), W^{1, q^{'}} (Ω)^{*}} for all h \in dom \prescript * S, u \in M (\overline{ω}_{c}) .

⟨ S u, h ⟩_{L^{2} (ω_{o})} = ⟨ u, \prescript * S h ⟩_{M, C} for all u \in dom S, h \in dom \prescript * S,

⟨ S u, h ⟩_{L^{2} (ω_{o})} = ⟨ u, \prescript * S h ⟩_{M, C} for all u \in dom S, h \in dom \prescript * S,

S_{H^{1}} := E_{H^{1}} A_{H^{1}}^{- 1} B_{H^{1}} : L^{2} (\overline{ω}_{c}) \to L^{2} (ω_{o}) \mbox an d \prescript * S_{H^{1}} := S_{H^{1}}^{*} : L^{2} (ω_{o}) \to L^{2} (\overline{ω}_{c}) .

S_{H^{1}} := E_{H^{1}} A_{H^{1}}^{- 1} B_{H^{1}} : L^{2} (\overline{ω}_{c}) \to L^{2} (ω_{o}) \mbox an d \prescript * S_{H^{1}} := S_{H^{1}}^{*} : L^{2} (ω_{o}) \to L^{2} (\overline{ω}_{c}) .

u \in M (\overline{ω}_{c}) min \frac{1}{2} ∥ S u - y_{d} ∥_{L^{2} (ω_{o})} + δ_{M (\overline{ω}_{c})^{+}} (u),

u \in M (\overline{ω}_{c}) min \frac{1}{2} ∥ S u - y_{d} ∥_{L^{2} (ω_{o})} + δ_{M (\overline{ω}_{c})^{+}} (u),

M (\overline{ω}_{c})^{+} := {u \in M (\overline{ω}_{c}) : ⟨ u, φ ⟩_{M, C} \geq 0 for all φ \in C (\overline{ω}_{c}), φ \geq 0} .

M (\overline{ω}_{c})^{+} := {u \in M (\overline{ω}_{c}) : ⟨ u, φ ⟩_{M, C} \geq 0 for all φ \in C (\overline{ω}_{c}), φ \geq 0} .

(\prescript * S h) (x) \geq ε > 0 for all x \in \overline{ω}_{c} .

(\prescript * S h) (x) \geq ε > 0 for all x \in \overline{ω}_{c} .

ε ∥ u_{n} ∥_{M (\overline{ω}_{c})}

ε ∥ u_{n} ∥_{M (\overline{ω}_{c})}

\leq ∥ S u_{n} ∥_{L^{2} (ω_{o})} ∥ h ∥_{L^{2} (ω_{o})} \leq C,

a (y, p) = ⟨ h, E y ⟩_{L^{2} (ω_{o})} for all y \in dom E

a (y, p) = ⟨ h, E y ⟩_{L^{2} (ω_{o})} for all y \in dom E

a (y, p) \geq 0 for all y \in H^{1} (Ω), y \geq 0 \Rightarrow p \geq 0.

a (y, p) \geq 0 for all y \in H^{1} (Ω), y \geq 0 \Rightarrow p \geq 0.

a (y, p) \geq 0 for all y \in H_{0}^{1} (Ω), y \geq 0.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

clason/positivecontrol
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Optimal control of elliptic equations with positive measures

Christian Clason Faculty of Mathematics, University Duisburg-Essen, 45117 Essen, Germany () [email protected]

Anton Schiela Institute of Mathematics, University of Bayreuth, 95440 Bayreuth, Germany () [email protected]

(September 7, 2015)

Abstract

Optimal control problems without control costs in general do not possess solutions due to the lack of coercivity. However, unilateral constraints together with the assumption of existence of strictly positive solutions of a pre-adjoint state equation, are sufficient to obtain existence of optimal solutions in the space of Radon measures. Optimality conditions for these generalized minimizers can be obtained using Fenchel duality, which requires a non-standard perturbation approach if the control-to-observation mapping is not continuous (e.g., for Neumann boundary control in three dimensions). Combining a conforming discretization of the measure space with a semismooth Newton method allows the numerical solution of the optimal control problem.

1 Introduction

This work is concerned with the following optimal control problem, stated formally as

[TABLE]

where $A$ is a second-order elliptic differential operator and $y_{d}$ is a given target. Furthermore, ${\omega_{o}}\subset\overline{\Omega}\subset\mathbb{R}^{d}$ is the observation domain with corresponding restriction operator $E$ , and the control is defined on a control domain ${\overline{\omega}_{c}}\subset\overline{\Omega}$ with corresponding extension operator $B$ . (This setting includes boundary control and observation; for details we refer to Section 2.)

Problem (1) differs from standard control-constrained optimal control problems by the fact that no control cost term, e.g., of the form $\alpha\|u\|_{U}^{2}$ or $\|u\|_{U}$ with $\alpha>0$ and a suitable Banach space $U$ , appears in the functional. This term is usually necessary to guarantee existence of an optimal solution $(\bar{y},\bar{u})$ , since it provides us with coercivity of the objective functional in the appropriate topology. Consequently, one of the major issues in this work will be the discussion of existence of minimizers of this problem. As we will show, the non-negativity together with the tracking term is sufficient (under an appropriate assumption on the operator $A$ ) to obtain coercivity with respect to $u$ , albeit only in the space of measures. Intuitively, boundedness of $y=A^{-1}Bu$ in $L^{2}$ implies boundedness of $Bu$ only in $H^{-2}$ , which is all one can expect in general without control constraints. It is thus surprising that in many cases optimal controls exist in the more regular space $\mathcal{M}$ of Radon measures if merely unilateral constraints are present, thus allowing to formulate, analyze and numerically solve the limit problem as $\alpha\to 0$ in the above-mentioned standard problems with unilateral constraints, which is the main motivation of this work.

Once existence of optimal controls is established, first-order optimality conditions can be derived via Fenchel duality. This is relatively straightforward in those cases where the control-to-observation mapping $u\to Ey$ is continuous as a mapping $\mathcal{M}({\overline{\omega}_{c}})\to L^{2}({\omega_{o}})$ . However, due to the low regularity of the control, this assumption is not satisfied for all relevant applications (e.g., Neumann-control in three dimensions; similar difficulties are to expected for parabolic problems). These cases require special care since they involve unbounded operators. A second motivation of this work is therefore to extend the Fenchel duality theorem to this setting.

Let us remark on some related problems. Recently, a class of elliptic problems came into the focus of interest, where control costs of the form $\alpha\|u\|_{L^{1}}$ were used and which possess generalized solutions $u\in\mathcal{M}$ ; see [Clason:2010a, Clason:2011a, Clason:2012, Casas:2013]. In particular, we rely on the first three works for the numerical computation of our optimal measure space controls using a semismooth Newton method and a conforming finite element discretization of $\mathcal{M}$ . Often such functionals are still augmented by an additional $L^{2}$ -type control cost as well as bilateral control constraints, and the limit $\beta\to 0$ is considered; see, e.g., [Stadler:2007a, Wachsmuth:2009]. A second related problem class is that of so-called bang-bang-problems [Hinze:2012], where no control costs are present, but the control constraints are bilateral, so that optimal solutions exist in $L^{\infty}$ . Finally, due to the presence of measure-valued controls, we will have to define the operator $A$ in a way that $Ay=\mu$ has a unique solution for each $\mu\in\mathcal{M}$ . This requires an extension of the usual variational setting in $H^{1}$ . In this respect, our paper draws from results in the literature; see [Schiela:2010] and the references therein. It also provides a link to the study of state-constrained problems [Casas85], where measure-valued right-hand sides appear in first-order optimality conditions.

This work is organized as follows. Section 2 discusses well-posedness of the state equation for measure-valued right-hand sides. In Section 3, we give a rigorous statement of Problem (1) and show that under a strict positivity assumption on the adjoint control-to-observation mapping, a minimizer to (1) exists in the space of Radon measures; we discuss the validity of this assumption in the context of second-order elliptic equations in Section 3.1. Section 3.2 gives some examples as well as a counterexample that shows the necessity of our assumption. Optimality conditions for these minimizers are derived in Section 4 based on a Fenchel duality theorem for an unbounded operator. In Section 5, we remark on the relation of Problem (1) to the corresponding problems including additional $L^{2}$ or measure-space control costs. The numerical solution based on a variational discretization and a semismooth Newton method is discussed in Section 6. Finally, numerical examples are presented in Section 7.

2 State equation

We first discuss well-posedness of the control-to-observation mapping $u\mapsto Ey$ . Since $u$ is only a Radon measure and $E$ need not be continuous, this requires some technicalities. In particular, due to the presence of the non-reflexive spaces $C$ and $\mathcal{M}$ it will be useful to start with defining the pre-adjoint operators of $A$ and $B$ .

Elliptic differential operator $A$

Consider a bounded domain (i.e., an open connected subset) $\Omega\subset\mathbb{R}^{d}$ with Lipschitz boundary $\partial\Omega$ , so that the trace operator $H^{1}(\Omega)\to L^{2}(\partial\Omega)$ is well-defined. Let $a(\cdot,\cdot):H^{1}(\Omega)\times H^{1}(\Omega)\to\mathbb{R}$ be a continuous and elliptic bilinear form, defined by

[TABLE]

where subsequently we assume that the coefficients are symmetric (i.e., $a_{ij}=a_{ji}$ ) and bounded on $\Omega$ , and that $c$ and $r$ are non-negative bounded functions in $\Omega$ and $\partial\Omega$ , respectively. Furthermore, assume that there exists $a_{0}>0$ such that

[TABLE]

We assume further that not both $c$ and $r$ are identically [math]. As usual, it follows by the Poincaré inequality that $a$ is coercive, i.e., there exists $c_{1}>0$ such that

[TABLE]

Alternatively, we could impose Dirichlet boundary conditions on (part of) $\partial\Omega$ to obtain coercivity. However, in the following discussion we stick to the case $H^{1}(\Omega)$ , mainly for simplicity of presentation.

It then follows from the Lax–Milgram theorem that for each $\ell\in H^{1}(\Omega)^{*}$ , there is a unique $y\in H^{1}(\Omega)$ , such that $a(y,p)=\ell(p)$ for all $p\in H^{1}(\Omega)$ . In this way, the well-known isomorphism $A_{H^{1}}:H^{1}(\Omega)\to H^{1}(\Omega)^{*}$ is constructed via $(A_{H^{1}}y)(p):=a(y,p)$ .

Extension to measure-valued right-hand sides

Our next aim is to define a version of this operator that covers elliptic PDEs with measure-valued right-hand sides. For $d\geq 2$ , this does not fit into the classical variational framework. Following the method of Stampacchia [Stampacchia:1965a], we will therefore first construct an unbounded pre-dual operator $\prescript{*\!}{}{A}$ with domain $C(\overline{\Omega})$ , and then consider its adjoint $A:=(\prescript{*\!}{}{A})^{*}$ whose co-domain is then – by definition – the dual of $C(\overline{\Omega})$ , which can be identified by the Riesz representation theorem with the space of Radon measures $\mathcal{M}(\overline{\Omega})$ . The following construction is similar to the one given in [Schiela:2010]; our main reference concerning unbounded operators is [Goldberg:2006].

Consider an index $q>d$ (the spatial dimension), so that $W^{1,q}(\Omega)\hookrightarrow C(\overline{\Omega})$ , and its dual index $q^{\prime}$ which satisfies $q^{-1}+q^{\prime-1}=1$ . By Hölder’s inequality applied to the derivatives, $a(\cdot,\cdot)$ is still well-defined and continuous as a bilinear form

[TABLE]

Let us define a domain $\operatorname{dom}\prescript{*\!}{}{\!A}\subset H^{1}(\Omega)$ (often called “maximal domain of definition”) and a bijective mapping $\prescript{*\!}{}{\!A}:\operatorname{dom}\prescript{*\!}{}{\!A}\to W^{1,q^{\prime}}(\Omega)^{*}$ in the following way:

[TABLE]

Let us stress that here (and in similar occasions) the bound $c_{p}$ may depend on $p$ but not on $y$ .

By (5), we conclude that $H^{1}(\Omega)\supset\operatorname{dom}\prescript{*\!}{}{\!A}\supset W^{1,q}(\Omega)$ , and under relatively mild assumptions on the smoothness of the coefficients and on the domain, regularity theory even yields $\operatorname{dom}\prescript{*\!}{}{\!A}=W^{1,q}(\Omega)\hookrightarrow C(\overline{\Omega})$ if $q$ is sufficiently close to $d$ ; see, e.g., [Troianiello:1987a, Theorem 3.16]. This is called the case of “maximal regularity”. In fact, for $d=2$ , it is always possible to find an appropriate $q$ . In this case we can define $\prescript{*\!}{}{A}$ as follows:

[TABLE]

Otherwise, if $\operatorname{dom}\prescript{*\!}{}{\!A}$ is a proper superset of $W^{1,q}(\Omega)$ , the bilinear form $a(y,p)$ is not defined anymore for all $y\in W^{1,q^{\prime}}(\Omega)$ and $p\in\operatorname{dom}\prescript{*\!}{}{\!A}$ due to lack of integrability of the principal part. However, by the definition of $\operatorname{dom}\prescript{*\!}{}{\!A}$ in (6), we can extend $a(\cdot,\cdot)$ to a bilinear form $\overline{a}(\cdot,\cdot):W^{1,q^{\prime}}(\Omega)\times\operatorname{dom}\prescript{*\!}{}{\!A}$ via the unique continuous extension

[TABLE]

where $\{y_{n}\}_{n\in{\mathbb{N}}}$ is a sequence in $H^{1}(\Omega)$ such that $y_{n}\to y$ in $W^{1,q^{\prime}}(\Omega)$ . By density of $H^{1}(\Omega)$ in $W^{1,q^{\prime}}(\Omega)$ , such a sequence always exists, and by definition of $\operatorname{dom}\prescript{*\!}{}{A}$ in (6), the limit of $a(y_{n},p)$ always exists and depends only on the limit $y$ .

Under very mild assumptions, it is still possible to show $\operatorname{dom}\prescript{*\!}{}{\!A}\subset C(\overline{\Omega})$ (see, e.g., [Rehberg:2009, Theorem 3.3, Corollary 3.5, Corollary 3.6]), so that we obtain:

[TABLE]

In both cases $\prescript{*\!}{}{\!A}$ is a bijective, closed, unbounded operator (cf. [Schiela:2010]) and thus has continuous inverse $\prescript{*\!}{}{\!A}^{-1}$ by the open mapping theorem for closed operators; see, e.g., [Goldberg:2006, II.1.8]. In what follows only this – more general – setting is required, keeping in mind, however, that $\prescript{*\!}{}{\!A}$ (and thus also its adjoint, defined next) corresponds to $\overline{a}(\cdot,\cdot)$ , which only coincides with $a(\cdot,\cdot)$ if $\operatorname{dom}\prescript{*\!}{}{\!A}=W^{1,q}(\Omega)$ , cf. [Schiela:2010].

Since $\operatorname{dom}\prescript{*\!}{}{\!A}\supset W^{1,q}(\Omega)$ is dense in $C(\overline{\Omega})$ , the Banach space adjoint (also called conjugate) $A:=(\prescript{*\!}{}{\!A})^{*}$ of $\prescript{*\!}{}{\!A}$ is well-defined as a linear operator (cf., e.g., [Goldberg:2006, Def. II.2.2])

[TABLE]

where $\operatorname{dom}A$ is canonically defined as

[TABLE]

Then for any $y\in\operatorname{dom}A$ , the mapping $p\mapsto\overline{a}(y,p)$ defines a continuous linear functional on the dense subspace $\operatorname{dom}\prescript{*\!}{}{\!A}\subset C(\overline{\Omega})$ . It can thus be extended uniquely to a continuous functional $Ay$ on $C(\overline{\Omega})$ satisfying $(Ay)(p)=\overline{a}(y,p)$ for all $p\in\operatorname{dom}\prescript{*\!}{}{\!A}$ . By the Riesz representation theorem, $Ay$ can be identified with an element of $\mathcal{M}(\overline{\Omega})$ . We stress that this is the standard construction of the Banach space adjoint of an unbounded, densely defined operator. By [Goldberg:2006, Theorem II.2.6, Theorem II.4.4], the operator $A$ is also closed and continuously invertible, because $\prescript{*\!}{}{A}$ is.

We even obtain the following compactness property:

Lemma 2.1 ([Schiela:2010, Lemma 2.15]).

Consider a sequence $\{\mu_{n}\}_{n\in{\mathbb{N}}}$ that converges weakly- $*$ in $\mathcal{M}(\overline{\Omega})$ to $\mu$ . Then the sequence $\{A^{-1}\mu_{n}\}_{n\in{\mathbb{N}}}$ converges strongly in $W^{1,q^{\prime}}(\Omega)$ to $A^{-1}\mu$ .

Control operator $B$

Next, consider a compact set ${\overline{\omega}_{c}}\subset\overline{\Omega}$ such that there exists a continuous trace or embedding operator $\prescript{*\!}{}{B}_{H^{1}}:H^{1}(\Omega)\to L^{2}({\overline{\omega}_{c}})$ . Here $L^{2}({\overline{\omega}_{c}})$ is defined with respect to an appropriate positive and bounded measure $\nu$ on ${\overline{\omega}_{c}}$ ; e.g., ${\overline{\omega}_{c}}=\overline{\Omega}$ with the Lebesgue measure for distributed control, and ${\overline{\omega}_{c}}=\partial\Omega$ with the boundary measure for boundary control. Technically, we will require in the following that $\nu({\overline{\omega}_{c}}\cap O)>0$ for any open subset $O\subset\mathbb{R}^{d}$ such that ${\overline{\omega}_{c}}\cap O$ is non-empty. This guarantees applicability of LABEL:thm:wsd (see LABEL:sec:appendix).

We introduce the linear and continuous restriction operator

[TABLE]

which coincides with the above mentioned restriction operator $\prescript{*\!}{}{B}_{H^{1}}$ on $C(\overline{\Omega})\cap H^{1}(\Omega)$ , this space being dense in both $C(\overline{\Omega})$ and $H^{1}(\Omega)$ .

Its adjoint $B:=(\prescript{*\!}{}{B})^{*}$ can be interpreted (via the Riesz representation theorem) as a mapping

[TABLE]

acting as the extension by [math] of a measure on ${\overline{\omega}_{c}}$ to a measure on $\overline{\Omega}$ . On $L^{2}({\overline{\omega}_{c}})$ it coincides with the operator $B_{H^{1}}:=(\prescript{*\!}{}{B}_{H^{1}})^{*}:L^{2}({\overline{\omega}_{c}})\to H^{1}(\Omega)^{*}$ . Moreover, by LABEL:thm:wsd the space $L^{2}({\overline{\omega}_{c}})$ is weakly- $*$ sequentially dense in $\mathcal{M}({\overline{\omega}_{c}})$ .

Observation operator $E$

For the operator $E$ , which will be defined on reflexive spaces, it is most convenient to start with the primal operator. Let ${\omega_{o}}\subset\overline{\Omega}$ , equipped with a suitable measure, and assume that there exists a closed (possibly unbounded) operator

[TABLE]

where $\operatorname{dom}E\supset H^{1}(\Omega)$ is dense in $W^{1,q^{\prime}}(\Omega)$ . By this assumption, the restriction of $E$ to $H^{1}(\Omega)$ , i.e.,

[TABLE]

is defined on all of $H^{1}(\Omega)$ . It is readily verified that $E_{H^{1}}$ is closed as well. Thus, by the closed graph theorem (see, e.g., [Goldberg:2006, II.1.9]), $E_{H^{1}}$ is even a continuous operator.

In many cases $E$ is continuous for suitable $q^{\prime}$ , and $\operatorname{dom}E=W^{1,q^{\prime}}(\Omega)$ holds, but there are also important cases where $E$ lacks continuity. Typical examples (e.g., embedding or trace operators) are discussed in detail below.

By reflexivity, we can define its adjoint $\prescript{*\!}{}{E}:=E^{*}$ as a closed operator

[TABLE]

since in this case $(\prescript{*\!}{}{E})^{*}=E^{**}=E$ . Like all adjoints of closed operators in reflexive spaces, $\prescript{*\!}{}{E}$ has a dense domain; see, e.g., [Goldberg:2006, Theorem II.2.14]. Comparison with $\prescript{*\!}{}{E_{H^{1}}}:=E^{*}_{H^{1}}$ yields that $\prescript{*\!}{}{E_{H^{1}}}h=\prescript{*\!}{}{E}h$ for every $h$ for which the latter is defined, i.e., for $h\in\operatorname{dom}\prescript{*\!}{}{E}$ . Thus, the continuous operator $\prescript{*\!}{}{E}_{H^{1}}$ can be considered as the unique continuous extension of $\prescript{*\!}{}{E}$ after the co-domain space has been extended from $W^{1,q^{\prime}}(\Omega)^{*}$ to $H^{1}(\Omega)^{*}$ (and renormed).

Control-to-observation mapping $S$

Finally, we define

[TABLE]

where $\operatorname{dom}\prescript{*\!}{}{S}:=\operatorname{dom}\prescript{*\!}{}{E}$ is dense in $L^{2}({\omega_{o}})$ by our above assumptions. This mapping is well-defined, since $\prescript{*\!}{}{B}\prescript{*\!}{}{\!A}^{-1}:W^{1,q^{\prime}}(\Omega)^{*}\to C({\overline{\omega}_{c}})$ is a continuous operator, defined on all of $W^{1,q^{\prime}}(\Omega)^{*}$ . Since the adjoint of a densely defined (unbounded) linear operator is closed, see, e.g., [Goldberg:2006, Theorem II.2.6], $S:=(\prescript{*\!}{}{S})^{*}$ is a closed operator

[TABLE]

Since $E$ may be unbounded, the following assertion is not obvious.

Lemma 2.2.

It holds that

[TABLE]

and $S=EA^{-1}B$ . Furthermore, $S$ is weakly- $*$ closed, i.e., if $u_{n}\rightharpoonup^{*}u$ in $\mathcal{M}({\overline{\omega}_{c}})$ and $h_{n}\rightharpoonup h$ in $L^{2}({\omega_{o}})$ with $Su_{n}=h_{n}$ , then $Su=h$ .

Proof 2.3.

By purely algebraic arguments we have for $u\in\operatorname{dom}S\cap\operatorname{dom}EA^{-1}B$ that $Su=EA^{-1}Bu$ since then both sides of the equality are well-defined. Thus, we have to prove the equality of their domains, using the definition of $\operatorname{dom}EA^{-1}B$ in (19). By continuity of $\prescript{*\!}{}{B}\prescript{*\!}{}{\!A}^{-1}$ we conclude

[TABLE]

By definition of domains of adjoints, $u\in\operatorname{dom}S$ iff $\langle u,\prescript{*\!}{}{S}h\rangle_{\mathcal{M}({\overline{\omega}_{c}}),C({\overline{\omega}_{c}})}\leq c_{u}\|h\|_{L^{2}({\omega_{o}})}$ , and $A^{-1}Bu\in\operatorname{dom}{E}$ iff $\langle A^{-1}Bu,\prescript{*\!}{}{E}h\rangle_{W^{1,q^{\prime}}(\Omega),W^{1,q^{\prime}}(\Omega)^{*}}\leq c_{A^{-1}Bu}\|h\|_{L^{2}({\omega_{o}})}$ . By (20), $c_{u}=c_{A^{-1}Bu}$ , and hence the domains coincide.

The last inclusion in (19) follows from the fact that for $u\in L^{2}({\overline{\omega}_{c}})$ , we have $A^{-1}Bu\in H^{1}(\Omega)\subset\operatorname{dom}E$ . This in turn is a consequence of $Bu\in H^{1}(\Omega)^{*}$ , so that $A^{-1}Bu$ coincides with the variational solution of the state equation.

By Lemma 2.1, weak- $*$ convergence of $u_{n}$ implies strong convergence of $A^{-1}Bu_{n}$ in $W^{1,q^{\prime}}(\Omega)$ . Since $E$ is closed, it is also weakly closed (since its graph is a convex closed set, thus weakly closed). Hence, $A^{-1}Bu_{n}\to A^{-1}Bu$ and $h_{n}\rightharpoonup h$ with $Su_{n}=h_{n}$ imply $Su=EA^{-1}Bu=h$ .

We remark for later reference that by definition of adjoints, we have that

[TABLE]

where here and in the following, we have omitted the domains from the spaces appearing in duality pairings if they are clear from the context. Also, by definition of $\operatorname{dom}S$ , for $u\notin\operatorname{dom}S$ there exists a bounded sequence $h_{n}$ in $\operatorname{dom}\prescript{*\!}{}{S}$ such that $\langle u,\prescript{*\!}{}{S}h_{n}\rangle_{\mathcal{M},C}\to\infty$ .

Finally, we remark that $\operatorname{dom}S$ is weak- $*$ sequentially dense in $\mathcal{M}({\overline{\omega}_{c}})$ . This follows via $\operatorname{dom}S\supset L^{2}({\overline{\omega}_{c}})$ , using LABEL:thm:wsd, which states that $L^{2}({\overline{\omega}_{c}})$ is weakly- $*$ sequentially dense in $\mathcal{M}({\overline{\omega}_{c}})$ . In particular, $\langle u,\varphi\rangle_{\mathcal{M},C}=0$ for all $u\in\operatorname{dom}S$ implies $\langle u,\varphi\rangle_{\mathcal{M},C}=0$ for all $u\in\mathcal{M}({\overline{\omega}_{c}})$ and thus $\varphi=0$ as an element of $C({\overline{\omega}_{c}})$ .

Using $B_{H^{1}}$ and $E_{H^{1}}$ , we complement the measure-space operators $S$ and $\prescript{*\!}{}{S}$ by their “standard” counterparts, i.e., the continuous mappings

[TABLE]

The operator $S_{H^{1}}$ is a restriction of $S$ and coincides with it on $L^{2}({\overline{\omega}_{c}})$ . In contrast, $\prescript{*\!}{}{S}_{H^{1}}$ is an extension of $\prescript{*\!}{}{S}$ and is defined on all of $L^{2}({\omega_{o}})$ and not only on $\operatorname{dom}\prescript{*\!}{}{S}$ . This is possible because $\prescript{*\!}{}{S}_{H^{1}}$ has a larger co-domain $L^{2}({\overline{\omega}_{c}})\supset C({\overline{\omega}_{c}})$ .

3 Existence of minimizers

Using the control-to-observation operator, we can state Problem (1) in reduced form as

[TABLE]

where $\delta_{\mathcal{M}({\overline{\omega}_{c}})^{+}}$ denotes the indicator function of the positive cone in $\mathcal{M}({\overline{\omega}_{c}})$ , i.e.,

[TABLE]

We now address existence of minimizers to (P), which requires an assumption on the control-to-observation operator which we call a pre-dual Slater condition. Since this operator is defined via duality, it will be seen that it is natural to formulate this assumption in terms of the pre-adjoint $\prescript{*\!}{}{S}$ .

Assumption 3.1 (Pre-dual Slater condition).

There exists a function $h\in\operatorname{dom}\prescript{*\!}{}{S}\subset L^{2}({\omega_{o}})$ such that $\prescript{*\!}{}{S}h\in C({\overline{\omega}_{c}})$ is strictly positive, i.e., there is $\varepsilon>0$ such that

[TABLE]

Since $\prescript{*\!}{}{S}=\prescript{*\!}{}{B}\prescript{*\!}{}{\!A}^{-1}\prescript{*\!}{}{E}$ , 3.1 claims the existence of a function $h\in L^{2}({\omega_{o}})$ such that the solution $p$ of the equation $\prescript{*\!}{}{\!A}p=\prescript{*\!}{}{E}h$ is a continuous function and satisfies $\prescript{*\!}{}{B}p\geq\varepsilon>0$ . We are thus looking for solutions of elliptic equations that are strictly positive (on parts of the domain).

Using this assumption, we can show that a minimizing sequence is bounded in a sufficiently strong topology.

Lemma 3.2.

If 3.1 holds, then any minimizing sequence $\{u_{n}\}_{n\in{\mathbb{N}}}\subset\mathcal{M}({\overline{\omega}_{c}})$ for (P) is bounded in $\mathcal{M}({\overline{\omega}_{c}})$ with $\{Su_{n}\}_{n\in{\mathbb{N}}}$ bounded in $L^{2}({\omega_{o}})$ .

Proof 3.3.

First, note that the non-negativity constraint and coercivity of the tracking term imply, respectively, that $u_{n}\geq 0$ for all $n\in{\mathbb{N}}$ and that $\{Su_{n}\}_{n\in{\mathbb{N}}}$ is bounded in $L^{2}({\omega_{o}})$ (and in particular, that $\{u_{n}\}_{n\in{\mathbb{N}}}\subset\operatorname{dom}S$ ). Using 3.1 and identifying $\varepsilon>0$ with the constant function $\varepsilon\mathds{1}(x)\in C({\overline{\omega}_{c}})$ , we thus deduce from the definition of the total variation norm of a non-negative measure that

[TABLE]

and hence the claimed boundedness follows.

With this, we obtain existence of a minimizer by Tonelli’s direct method.

Theorem 3.4.

Under the above assumptions, there exists a minimizer $\bar{u}\in\mathcal{M}({\overline{\omega}_{c}})$ of (P) such that $S\bar{u}\in L^{2}({\omega_{o}})$ . If $S$ is injective, $\bar{u}$ is unique.

Proof 3.5.

Let $\{u_{n}\}_{n\in{\mathbb{N}}}\subset\mathcal{M}({\overline{\omega}_{c}})$ be a minimizing sequence for (P), which is bounded in $\mathcal{M}({\overline{\omega}_{c}})$ by Lemma 3.2. Since $C({\overline{\omega}_{c}})$ is separable, the Banach–Alaoglu theorem yields existence of a subsequence converging weakly- $*$ to some $\bar{u}\in\mathcal{M}({\overline{\omega}_{c}})$ . By boundedness of $Su_{n}$ , we may then extract another subsequence such that $Su_{n}$ converges weakly to some $z\in L^{2}({\omega_{o}})$ . By Lemma 2.2 we obtain $z=S\bar{u}$ . From weak- $*$ sequential closedness of the non-negative cone in $\mathcal{M}$ , we deduce that $\bar{u}$ is feasible and thus a minimizer of (P). Finally, strict convexity of the tracking term implies that any pair of minimizers $u_{1},u_{2}$ satisfies $Su_{1}=Su_{2}$ and hence, if $S$ is injective, $u_{1}=u_{2}$ .

3.1 Verification of the pre-dual Slater condition

We now discuss situations in which 3.1 can be verified. Recall that we have to show for some $h\in\operatorname{dom}\prescript{*\!}{}{E}$ the existence of a solution $p\in\operatorname{dom}\prescript{*\!}{}{\!A}$ to the equation

[TABLE]

such that $\prescript{*\!}{}{B}p$ is strictly positive on ${\overline{\omega}_{c}}$ . Although it is well-known that elliptic PDEs have non-negative solutions for non-negative right-hand sides and boundary data, existence of a strictly positive solution is not a trivial matter and of course not satisfied in general (consider the homogenous Dirichlet problem and ${\overline{\omega}_{c}}=\overline{\Omega}$ ). Moreover, the literature – although quite exhaustive for the Dirichlet problem – is much scarcer in the case of Neumann, Robin or even mixed boundary conditions.

We first remark that under the stated assumptions, $a(\cdot,\cdot)$ given by (2) is uniformly elliptic and hence defines a positive operator, i.e., for all $p\in H^{1}(\Omega)$ ,

[TABLE]

This already implies strict positivity on compact subsets of $\Omega$ .

Lemma 3.6.

Let $\Omega\subset\mathbb{R}^{d}$ be a domain. Assume that $p\geq 0\in H^{1}(\Omega)\cap C(\overline{\Omega})$ satisfies $p\not\equiv 0$ and

[TABLE]

If $K\subset\Omega$ is compact, there is a $\delta>0$ such that $p\geq\delta$ on $K$ , and in particular, $p>0$ on $\Omega$ .

Note the discrepancy between $p\in H^{1}(\Omega)$ and $y\in H^{1}_{0}(\Omega)$ ; we choose this setting because it fits to the setting in [GilTru1977, Chapter 8], from which we cite a crucial result: the Harnack inequality. Unfortunately, a Harnack inequality for the setting $y\in H^{1}(\Omega)$ (covering Robin, Neumann, or mixed boundary conditions explicitly) is hard to find in the literature.

Proof 3.7.

The result is a consequence of the weak Harnack inequality (cf. [GilTru1977, Theorem 8.18]), which holds for non-negative supersolutions of $a(p,\cdot)=0$ . Let $x\in\Omega$ be given and denote by $B_{r}(x)$ a ball around $x$ of radius $r$ . If $B_{4R}(x)\subset\Omega$ , then there exists a $C>0$ such that

[TABLE]

With this result, we will show that either $p\equiv 0$ or $p>0$ on $\Omega$ for any supersolution $p\geq 0$ . Since $\Omega$ is a domain, and thus open and connected, we merely have to assert that $\Omega_{0}:=\{x\in\Omega:p(x)=0\}$ is open and closed, because then either $\Omega_{0}=\Omega$ (i.e., $p\equiv 0$ ) or $\Omega_{0}=\emptyset$ (i.e. $p>0$ ). Indeed, by continuity of $p$ , $\Omega_{0}$ is (relatively) closed in $\Omega$ and by (29), every $x\in\Omega_{0}$ is contained in a ball $B_{2R}(x)\subset\Omega_{0}$ as long as $B_{4R}(x)\subset\Omega$ . Hence, $\Omega_{0}$ is open. Thus, if $p\not\equiv 0$ on $\Omega$ , we have $\Omega_{0}=\emptyset$ and so $p>0$ on $\Omega$ .

Finally, if $K\subset\Omega$ is compact, then $p>0$ has a minimizer $\underline{x}$ on $K$ , i.e., $p(x)\geq\delta:=p(\underline{x})>0$ for all $x\in K$ .

In what follows we denote $L^{s}(\overline{\Omega}):=L^{s}(\Omega)\times L^{s}(\partial\Omega)$ , where the first factor is equipped with the Lebesgue measure, and the second with the boundary measure; we denote the corresponding product measure by $d\overline{\nu}:=dx\times ds$ . If $M$ is any subset of $\overline{\Omega}$ , the space $L^{s}(M)$ is taken relatively to $L^{s}(\overline{\Omega})$ .

Lemma 3.6 already yields a first result. In the following, $\chi_{M}$ denotes the characteristic function of $M$ , which is identically $1$ on $M\subset\overline{\Omega}$ and [math] on $\overline{\Omega}\setminus M$ .

Corollary 3.8.

If $\,{\overline{\omega}_{c}}$ is a compact subset of $\Omega$ and ${\omega_{o}}\subset\overline{\Omega}$ has positive measure (i.e., $\overline{\nu}({\omega_{o}})>0$ ), then 3.1 is satisfied.

Proof 3.9.

Set $h:=\chi_{{\omega_{o}}}>0$ in (24). Since $h\in L^{\infty}(\Omega\times\partial\Omega)\subset W^{1,q^{\prime}}(\Omega)^{*}$ , we have $\prescript{*\!}{}{\!A}^{-1}h\in C(\overline{\Omega})$ and thus $h\in\operatorname{dom}\prescript{*\!}{}{S}\subset L^{2}({\omega_{o}})$ . Hence, Lemma 3.6 can be applied and yields the desired result.

Next, we want to cover the general case ${\overline{\omega}_{c}}\subseteq\overline{\Omega}$ .

Lemma 3.10.

Assume that $p\in H^{1}(\Omega)$ satisfies $p\not\equiv 0$ as well as

[TABLE]

and assume moreover that there is $\delta>0$ such that for $(c,r)\in L^{\infty}(\Omega)\times L^{\infty}(\partial\Omega)$ it holds that

[TABLE]

Then $p\geq\varepsilon:=\min\left\{\delta,\|r\|^{-1}_{L^{\infty}(\Omega)},\|c\|^{-1}_{L^{\infty}(\Omega)}\right\}$ .

Proof 3.11.

We insert $y:=p^{-}:=\min\{p,\varepsilon\}-\varepsilon\leq 0$ , which is in $H^{1}(\Omega)$ , into (2) and show that $p^{-}=0$ and thus $p\geq\varepsilon$ . Observe that $p\leq\varepsilon$ implies $p=p^{-}+\varepsilon$ and that $p>\varepsilon$ implies $p^{-}=0$ and $p^{-}_{x_{i}}=0$ for $i=1\dots d$ . With this we compute:

[TABLE]

and obtain

[TABLE]

Since $p\geq\delta\geq\varepsilon$ implies that $p^{-}=0$ , the last two integrals vanish by our assumption on $c$ and $r$ . Moreover, since $1-\varepsilon\,c\geq 1-\varepsilon\,\|c\|_{L^{\infty}(\Omega)}\geq 0$ and $1-\varepsilon\,r\geq 1-\varepsilon\,\|r\|_{L^{\infty}(\partial\Omega)}\geq 0$ , the first two integrals are non-positive (recall that $p^{-}\leq 0$ ). It follows that $a(p^{-},p^{-})=0$ , implying $p^{-}=0$ .

From this we can deduce the following sufficient criterion for the pre-dual Slater condition.

Proposition 3.12.

If $r=0$ on $\partial\Omega\setminus{\omega_{o}}$ , then 3.1 is fulfilled for any compact ${\overline{\omega}_{c}}\subset\overline{\Omega}$ .

Proof 3.13.

We show that the solution $p$ of (30) is strictly positive. By Lemma 3.6, we already know that $p>0$ on $\Omega$ . For $\delta>0$ , let $\Omega_{\delta}:=\{x\in\Omega:p(x)\leq\delta\}$ . Note that $|\Omega_{\delta}|\to 0$ as $\delta\to 0$ since $p>0$ on $\Omega$ .

Define $a_{\delta}(\cdot,\cdot)$ like $a(\cdot,\cdot)$ but with $c$ replaced by $c_{\delta}:=(1-\chi_{\Omega_{\delta}})c$ , and $p_{\delta}$ as the solution of

[TABLE]

Then $p_{\delta}\geq 0$ and

[TABLE]

Hence, $p_{\delta}\geq p$ , and thus $p_{\delta}(x)<\delta$ implies that $p(x)<\delta$ and thus $c_{\delta}(x)=0$ . Hence, Lemma 3.10 yields (after choosing $\delta\leq\min\left\{\|c\|^{-1}_{L^{\infty}(\Omega)},\|r\|^{-1}_{L^{\infty}(\Omega)}\right\}$ ) that $p_{\delta}\geq\delta$ .

Furthermore,

[TABLE]

and for any $1\leq s<\infty$ ,

[TABLE]

so that by [Stampacchia:1965a, Théorème 4.1], there exists a $C>0$ such that for any $s>d$ ,

[TABLE]

Since $|\Omega_{\delta}|\to 0$ for $\delta\to 0$ , we can choose $\delta$ sufficiently small such that for adequately chosen $s\in(d,\infty)$ , we have

[TABLE]

Hence, we can estimate

[TABLE]

i.e., $\|p_{\delta}\|_{L^{\infty}(\Omega_{\delta})}\leq\frac{4}{3}\delta$ . We conclude that $\|p-p_{\delta}\|_{L^{\infty}(\Omega)}\leq\frac{1}{4}\frac{4}{3}\delta=\frac{1}{3}\delta$ , and therefore

[TABLE]

as claimed.

3.2 Examples

To illuminate our abstract framework further, let us discuss in the following a couple of examples. All of them have in common the generic definition of

[TABLE]

where $q^{\prime}\leq 2$ is chosen appropriately as stated in the beginning of Section 2. However, the examples will cover different definitions of $E$ and $B$ and the corresponding spaces, i.e., different types of control and observation.

Distributed control for a Neumann problem

As a first example, consider a homogeneous Neumann problem with distributed control (i.e., $r=0$ and ${\overline{\omega}_{c}}=\overline{\Omega}$ ), such that

[TABLE]

is the control operator with pre-adjoint $\prescript{*\!}{}{B}=\mathrm{Id}:C(\overline{\Omega})\to C(\overline{\Omega})$ .

Let us first consider boundary observation, i.e., ${\omega_{o}}=\partial\Omega$ . We start with recalling that there exists a continuous trace operator

[TABLE]

for suitably chosen $s$ depending on $q^{\prime}$ and the spatial dimension $d$ of $\Omega$ . In particular, for $q^{\prime}=2$ we may always choose $s=2$ . In the general case, we may define

[TABLE]

(which implies $\operatorname{dom}E\supset H^{1}(\Omega)$ if $q^{\prime}\leq 2$ ), and then

[TABLE]

as the restriction of $\tau_{q^{\prime}}$ to $\operatorname{dom}E$ . Since the norm of the co-domain space has been strengthened, $E$ is in general not continuous anymore. It is, however, a closed operator: Assume that $y_{n}\to y$ in $W^{1,q^{\prime}}(\Omega)$ and $Ey_{n}\to h$ in $L^{2}(\partial\Omega)$ . By continuity of $\tau_{q^{\prime}}$ , we conclude that $Ey_{n}\to\tau_{q^{\prime}}y$ in $L^{s}(\partial\Omega)$ ; but from $Ey_{n}\to h$ in $L^{2}(\partial\Omega)$ we deduce that $\tau_{q^{\prime}}y=h\in L^{2}(\partial\Omega)$ and thus $y\in\operatorname{dom}E$ and $Ey=\tau_{q^{\prime}}y=h$ .

We summarize that $E$ satisfies all our assumptions, and note that for $d=2$ we may choose $q^{\prime}$ sufficiently close to $2$ such that $E:=\tau_{q^{\prime}}:W^{1,q^{\prime}}(\Omega)\to L^{2}(\partial\Omega)$ is well-defined as a continuous operator. However, the same is impossible for $d=3$ , so that we have to work with unbounded $E$ in this case.

For the case of observation on the whole domain (i.e., ${\omega_{o}}=\Omega$ ) and $d\leq 3$ , we may simply define $E:W^{1,q^{\prime}}(\Omega)\to L^{2}(\Omega)$ as the Sobolev embedding which exists for suitably chosen $q^{\prime}$ . In the “exotic” case $d>3$ , a similar effect as for boundary control with $d=3$ appears, and $E$ has to be defined as an unbounded operator.

By Proposition 3.12 and by our assumption $r=0$ , we see that we can choose ${\omega_{o}}\subset\overline{\Omega}$ arbitrarily as long as it has positive measure with respect to the measure $d\overline{\nu}$ on $\overline{\Omega}$ .

Robin or Neumann boundary control

In this case, our control operator is defined as the extension by zero

[TABLE]

i.e., $\prescript{*\!}{}{B}:C(\overline{\Omega})\to C(\partial\Omega)$ denotes the trace operator from $\overline{\Omega}$ to ${\overline{\omega}_{c}}=\partial\Omega$ . Again, we take $\prescript{*\!}{}{E}$ as the identity. To verify the pre-dual Slater condition, we then need to find $h\in L^{2}(\Omega)$ , such that the solution $p\in W^{1,q}(\Omega)$ of the problem

[TABLE]

has a strictly positive boundary trace, i.e., $\prescript{*\!}{}{B}p\geq\varepsilon>0$ . According to Proposition 3.12 this can be achieved for Neumann boundary conditions if ${\omega_{o}}$ is arbitrary (of non-zero measure), and for Robin boundary conditions if ${\omega_{o}}\supset\partial\Omega$ .

Distributed control for a Dirichlet problem

We close this section with a simple example for which 3.1 is violated. Consider the problem

[TABLE]

Due to the homogemous Dirichlet boundary conditions and by continuity, there cannot be any solutions of the predual problem which are larger than some $\varepsilon>0$ on the whole domain, which coincides with the control domain. So 3.1 is clearly violated.

To show that also the conclusions of Theorem 3.3 do not hold, let us take for $n\geq 2$ the sequence of measures $u_{n}=n\delta_{1/n}$ , which is contained in $\mathcal{M}([0,1])$ but unbounded.

Lemma 3.14.

The weak solution $y_{n}\in H^{1}_{0}(0,1)$ of $y^{\prime\prime}=n\delta_{1/n}$ is given by

[TABLE]

Proof 3.15.

We have to find $y_{n}$ such that $\int_{\Omega}y_{n}^{\prime}p^{\prime}\,dx=n\,p(1/n)$ for all $p\in H^{1}_{0}((0,1))$ and $y_{n}(0)=y_{n}(1)=0$ . By the Lax–Milgram theorem, we know that this solution is unique; moreover, the special form of the right-hand side leads us to the ansatz $y^{\prime}_{n}=\alpha$ on $[0,1/n]$ and $y^{\prime}_{n}=\beta$ on $[1/n,1]$ . Using the homogenous boundary conditions, we find that $y_{n}=\alpha x$ on $[0,1/n]$ and $y_{n}=\beta(x-1)$ on $[1/n,1]$ . Since $y_{n}$ has to be continuous at $x=1/n$ , we conclude that $\alpha\frac{1}{n}=\beta\frac{1}{n-1}$ .

Then, we can obtain using the weak formulation and the fundamental theorem of calculus that

[TABLE]

which implies that $\alpha-\beta=n$ . Solving these two equations for $\alpha$ and $\beta$ yields our claim.

Proposition 3.16.

Problem (49) does not possess an optimal solution in $\mathcal{M}([0,1])$ .

Proof 3.17.

From Lemma 3.14 we conclude that $y_{n}\to 1-x$ in $L^{2}((0,1))$ . Hence, $\{(y_{n},u_{n})\}_{n\in{\mathbb{N}}}$ is a minimizing sequence, since each pair is feasible and $J(y_{n})\to 0\leq J(y)$ for all $y$ . However, the limit $J=0$ cannot be attained, because the only possible candidate $y(x)=1-x$ does not satisfy the boundary conditions.

If we instead consider

[TABLE]

for some $\delta>0$ , then the control domain $[\delta,1-\delta]$ is a compact subset of $(0,1)$ . So by Lemma 3.6 we can verify 3.1 and thus apply Theorem 3.4 to assert existence of an optimal control in $\mathcal{M}([0,1])$ . This reasoning works in general for distributed control on a compact subset ${\overline{\omega}_{c}}$ of the domain $\Omega$ .

4 Optimality conditions

We apply Fenchel duality to derive optimality conditions for minimizers of (P). For the reader’s convenience, we recall duality theory, e.g., from [Ekeland:1999a, Chapter II.4]. For a functional $\mathcal{F}:W\to\overline{\mathbb{R}}{}:=\mathbb{R}\cup\{\infty\}$ defined on a Banach space $W$ , let $\mathcal{F}^{*}:W^{*}\to\overline{\mathbb{R}}{}$ denote the Fenchel conjugate of $\mathcal{F}$ given for $w^{*}\in W^{*}$ by

[TABLE]

Furthermore, let

[TABLE]

denote the subdifferential of the convex function $\mathcal{F}$ at $w$ , which reduces to the Gâteaux-derivative $\mathcal{F}^{\prime}(w)$ if it exists. These definitions immediately yield the Fenchel–Young inequality

[TABLE]

where equality holds if and only if $w^{*}\in\partial\mathcal{F}(w)$ .

The Fenchel duality theorem states that if $\mathcal{F}:W\to\overline{\mathbb{R}}{}$ and $\mathcal{G}:Z\to\overline{\mathbb{R}}{}$ are proper, convex, and lower semicontinuous functionals on the Banach spaces $X$ and $Z$ , $\Lambda:W\to Z$ is a continuous linear operator, and there exists a $w_{0}\in W$ such that $\mathcal{F}(w_{0})<\infty$ , $\mathcal{G}(\Lambda w_{0})<\infty$ , and $\mathcal{G}$ is continuous at $\Lambda w_{0}$ (a generalized Slater condition), then

[TABLE]

and the right-hand side of (56) – the dual problem – has at least one solution. Furthermore, the equality in (56) is attained at $(\bar{w},\bar{z}^{*})\in W\times Z^{*}$ if and only if

[TABLE]

holds; see, e.g., [Ekeland:1999a, Remark III.4.2].

We wish to apply the Fenchel duality theorem to (P), where $\Lambda$ would take the role of the control-to-observation mapping $S$ . Since $\mathcal{M}$ is non-reflexive, the dual problem would be posed in $\mathcal{M}^{*}$ , which is difficult to characterize. We therefore follow a pre-dual approach as in [Clason:2010a, Clason:2011a], where we introduce the optimization problem

[TABLE]

(obtained by formal application of Fenchel duality) and show that its Fenchel dual coincides with problem (P).

Remark 4.1.

Before delving into a deeper analysis, let us point out that the pre-dual problem ( $\prescript{*\!}{}{}$ P) is essentially a state-constrained optimal control problem with control $h\in\operatorname{dom}\prescript{*\!}{}{S}\subset L^{2}({\omega_{o}})$ and state $p:=\prescript{*\!}{}{S}h\in C(\overline{\Omega})$ , i.e.,

[TABLE]

However, it has the slightly unusual characteristics that the state does not appear in the objective and that the inequality constraint is imposed on a subdomain.

A further complication arises if $\operatorname{dom}\prescript{*\!}{}{S}$ is a proper subset of $L^{2}({\omega_{o}})$ . This case corresponds to a state-constrained problem where the control-to-state mapping does not map into the space of continuous functions. Such problems have been analysed in [Schiela:2009]. The analysis performed in this section may offer an alternative approach to this class of problems.

Problem ( $\prescript{*\!}{}{}$ P) is strictly convex and admits a feasible point by 3.1 and thus is non-trivial, i.e., admits a finite infimum. If $\operatorname{dom}\prescript{*\!}{}{S}$ is not closed, we cannot expect ( $\prescript{*\!}{}{}$ P) to have a minimizer. However, any minimizing sequence is bounded in $L^{2}({\omega_{o}})$ and thus has a weak cluster point $\bar{h}\in L^{2}({\omega_{o}})$ . In fact, by strict convexity of the term $\|h+y_{d}\|_{L^{2}({\omega_{o}})}^{2}$ , any minimizing sequence converges even strongly to the unique limit $\bar{h}$ . While $\bar{h}$ is possibly not contained in $\operatorname{dom}\prescript{*\!}{}{S}$ – and hence $\prescript{*\!}{}{S}\bar{h}$ is not defined – we can express the limit using a suitable extension of $\prescript{*\!}{}{S}$ which we will define below.

Although the Fenchel duality theorem is not directly applicable since $\prescript{*\!}{}{S}$ may be an unbounded operator, a modification of the arguments in [Ekeland:1999a] shows that the statement still holds. In our argumentation, we can make use of the fact that we have already established existence of solutions of the dual problem in Theorem 3.4. For the sake of completeness, we give here the full proof, where we closely follow [Ekeland:1999a, Chapter II.4]. Let us define for problem ( $\prescript{*\!}{}{}$ P) the perturbation function $\Phi:L^{2}({\omega_{o}})\times C({\overline{\omega}_{c}})\to\overline{\mathbb{R}}$ by

[TABLE]

Clearly, $\Phi(h,v)$ is convex but – by the last term – not lower semicontinuous with respect to $h$ unless $\operatorname{dom}\prescript{*\!}{}{S}=L^{2}({\omega_{o}})$ . Furthermore, $\inf_{h}\Phi(h,0)$ coincides with ( $\prescript{*\!}{}{}$ P) and hence is finite.

Consider now the Fenchel conjugate $\Phi^{*}:L^{2}({\omega_{o}})\times\mathcal{M}({\overline{\omega}_{c}})\to\overline{\mathbb{R}}$ of $\Phi$ with respect to $(h,v)$ .

Lemma 4.2.

The dual problem

[TABLE]

coincides with problem (P). Furthermore, if 3.1 is satisfied, the supremum is attained at $\bar{v}^{*}=\bar{u}$ .

Proof 4.3.

By definition, the Fenchel conjugate at $h^{*}=0$ is given by

[TABLE]

Using that $\operatorname{dom}\prescript{*\!}{}{S}$ is dense in $L^{2}({\omega_{o}})$ and introducing for $h\in\operatorname{dom}\prescript{*\!}{}{S}$ the function $p:=\prescript{*\!}{}{S}h-v\in C({\overline{\omega}_{c}})$ then yields for the case that $v^{*}\in\operatorname{dom}S$ :

[TABLE]

If, in contrast, $v^{*}\notin\operatorname{dom}S$ , there exists a sequence $\{h_{n}\}_{n\in{\mathbb{N}}}\subset\operatorname{dom}\prescript{*\!}{}{S}$ , bounded in $L_{2}({\omega_{o}})$ , such that $\langle v^{*},\prescript{*\!}{}{S}h_{n}\rangle_{\mathcal{M},C}\to\infty$ . Hence the first term in the first line is unbounded, while the opthers are bounded, and thus $\Phi^{*}(0,v^{*})=\infty$ . We therefore assume that $v^{*}\in\operatorname{dom}S$ and maximize separately with respect to $p$ and $h$ . Considering the first term, we have that $\langle v^{*},p\rangle_{\mathcal{M},C}<0$ for some $p\geq 0$ implies that $\Phi^{*}(0,v^{*})=\infty$ . Otherwise, the supremum is attained at $p=0$ and is [math]. For the second term, we use that the functional is differentiable with respect to $h$ to deduce that the supremum is attained at $h=Sv^{*}-y_{d}$ . Together, we obtain

[TABLE]

Writing $u:=v^{*}$ , we see that the dual problem (60) is precisely our original problem (P), which by Theorem 3.4 has a solution $\bar{u}\in\operatorname{dom}S\subset\mathcal{M}({\overline{\omega}_{c}})$ .

To derive optimality conditions, we first show that the duality gap between ( $\prescript{*\!}{}{}$ P) and (P) is zero.

Proposition 4.4.

We have that

[TABLE]

Proof 4.5.

The claim follows from [Ekeland:1999a, Proposition III.2.1] if Problem ( $\prescript{*\!}{}{}$ P) is normal, i.e., the mapping $v\mapsto\inf_{h}\Phi(h,v)$ is lower semicontinuous at [math]. To verify this, it suffices to show that for each feasible point $h_{v}\in\operatorname{dom}\Phi(h,v)$ , we can find a nearby feasible point $h_{0}\in\operatorname{dom}\Phi(h,0)$ with $\Phi(h_{v},v)$ close to $\Phi(h_{0},0)$ . This can be achieved by adding a small multiple of the function $h$ from 3.1, since $\prescript{*\!}{}{S}h$ is strictly positive and the perturbations are measured in the $C({\overline{\omega}_{c}})$ -norm.

Thus, for given $\varepsilon>0$ we can find $\delta>0$ such that with $\|v\|_{L^{\infty}({\omega_{o}})}<\delta$ , $h_{0}:=h_{v}+\varepsilon h$ is feasible for the original problem, as long as $h_{v}$ is feasible for the perturbed problem. Moreover, it is easy to see that $\Phi(h_{0},0)-\Phi(h_{v},v)\leq\tau(\varepsilon)$ with $\tau\to 0$ as $\varepsilon\to 0$ . Taking infima, this implies that

[TABLE]

which in turn yields the desired lower semicontinuity and thus (64).

To derive optimality conditions from the equality (64), we continue as in [Ekeland:1999a, § III, equation (4.22)]. We first derive a limiting form of the optimality conditions.

Proposition 4.6.

Let $\{h_{n}\}_{n\in{\mathbb{N}}}\subset\operatorname{dom}\prescript{*\!}{}{S}\subset L^{2}({\omega_{o}})$ be a minimizing sequence for Problem ( $\prescript{*\!}{}{}$ P) with $h_{n}\to\bar{h}\in L^{2}({\omega_{o}})$ , and let $\bar{u}\in\mathcal{M}({\overline{\omega}_{c}})$ be the solution to Problem (60). Then,

[TABLE]

Proof 4.7.

By definition of $\Phi^{*}$ , Proposition 4.4 implies that if $\{h_{n}\}_{n\in{\mathbb{N}}}$ is a minimizing sequence of $\Phi(\cdot,0)$ and $\bar{u}$ is a minimizer of $\Phi^{*}(0,\cdot)$ , we have

[TABLE]

We now use continuity of $\|\cdot\|_{L^{2}({\omega_{o}})}$ with respect to $h_{n}\to\bar{h}$ (recall that this limit exists due to the strict convexity of the first term in ( $\prescript{*\!}{}{}$ P)), which yields

[TABLE]

Next, we observe that, since $\bar{u}\in\operatorname{dom}S$ and thus $S\bar{u}\in L^{2}({\omega_{o}})^{*}$ , we have the convergence

[TABLE]

Hence, continuing our last computation, we obtain

[TABLE]

We now argue that both brackets are non-negative. For the first bracket, we use the fact that the third term is the Fenchel conjugate of the sum of the first two terms to apply the Fenchel–Young inequality (55). For the second bracket, feasibility of elements of a minimizing sequence (after passing to a subsequence if necessary) implies that $\prescript{*\!}{}{S}h_{n}\geq 0$ and $\bar{u}\geq 0$ and hence that the first two terms vanish. By definition of non-negativity of measures, positivity of $\bar{u}$ and $\prescript{*\!}{}{S}h_{n}$ implies that $\langle\bar{u},\prescript{*\!}{}{S}h_{n}\rangle_{\mathcal{M},C}\geq 0$ for all $n\in{\mathbb{N}}$ and hence that the third term is non-negative as well. Therefore, each bracket has to vanish separately. The first one immediately yields equality in (55) and hence that

[TABLE]

i.e., the first relation of (66). From the second bracket, we directly obtain the remaining relations (i.e., the second line) of (66).

We now wish to pass to the limit $n\to\infty$ in (66), which is impeded by the fact that the operators $S$ and $\prescript{*\!}{}{S}$ are defined in the non-standard setting needed for measure-valued control. Recall that $\prescript{*\!}{}{S}$ – which appears in $\langle\bar{u},\prescript{*\!}{}{S}h_{n}\rangle_{\mathcal{M},C}$ – is a restriction of its classical counter-part $\prescript{*\!}{}{S}_{H^{1}}:L^{2}({\omega_{o}})\to L^{2}({\overline{\omega}_{c}})$ . Hence, while $\prescript{*\!}{}{S}\bar{h}$ may not be well-defined, $\prescript{*\!}{}{S}_{H^{1}}\bar{h}$ is well-defined since $\bar{h}\in L^{2}({\omega_{o}})$ . Moreover, from $\bar{u}\in\operatorname{dom}S$ we can deduce not only that $\bar{u}\in\mathcal{M}({\overline{\omega}_{c}})$ but also that $S\bar{u}\in L^{2}({\omega_{o}})$ .

We thus make use of $\prescript{*\!}{}{S}_{H^{1}}$ to define a new bilinear form

[TABLE]

that can be used as a replacement of the term $\langle\bar{u},\prescript{*\!}{}{S}h_{n}\rangle_{\mathcal{M},C}$ in (66) but is well-defined also for the limit $\bar{h}$ . Let $u\in\operatorname{dom}S$ and $\lambda\in\operatorname{ran}\prescript{*\!}{}{S}_{H^{1}}$ with $h\in L^{2}({\omega_{o}})$ such that $\lambda=\prescript{*\!}{}{S}_{H^{1}}h$ , then set

[TABLE]

With this definition, we obtain the following first-order necessary optimality conditions.

Theorem 4.8.

Let $\bar{u}\in\mathcal{M}({\overline{\omega}_{c}})$ be a minimizer of Problem (1). Then there exist $\bar{y}\in W^{1,q^{\prime}}(\Omega)$ , $\bar{p}\in H^{1}(\Omega)$ and $\bar{\lambda}\in\operatorname{ran}\prescript{*\!}{}{S}_{H^{1}}\subset L^{2}({\overline{\omega}_{c}})$ satisfying

[TABLE]

Proof 4.9.

First, we note that $\langle u,\lambda\rangle_{\operatorname{dom}S,\operatorname{ran}\prescript{*\!}{}{S}_{H^{1}}}$ is well-defined because $u\in\operatorname{dom}S$ implies $Su\in L^{2}({\omega_{o}})$ , and because $h\in L^{2}({\omega_{o}})=\operatorname{dom}\prescript{*\!}{}{S}_{H^{1}}$ . We now to argue that this bilinear form can indeed be used in (66). For $h\in\operatorname{dom}\prescript{*\!}{}{S}$ , we have $\lambda=\prescript{*\!}{}{S}_{H^{1}}h=\prescript{*\!}{}{S}h\in C({\overline{\omega}_{c}})$ and thus

[TABLE]

Furthermore, if $u\in\operatorname{dom}S$ and the sequence $\{h_{n}\}_{n\in{\mathbb{N}}}\subset\operatorname{dom}\prescript{*\!}{}{S}$ converges to $h$ in $L^{2}({\omega_{o}})$ , then

[TABLE]

Thus, the limit $\lim_{n\to\infty}\langle\bar{u},\prescript{*\!}{}{S}h_{n}\rangle_{\mathcal{M},C}$ in (66) can be replaced by $\langle\bar{u},\prescript{*\!}{}{S}_{H^{1}}\bar{h}\rangle_{\operatorname{dom}S,\operatorname{ran}\prescript{*\!}{}{S}_{H^{1}}}$ as claimed.

Introducing the state $\bar{y}:=S\bar{u}=A^{-1}B\bar{u}$ , an adjoint state $\bar{p}:=\prescript{*\!}{}{\!A}_{H^{1}}^{-1}\prescript{*\!}{}{E}\bar{h}=\prescript{*\!}{}{\!A}_{H^{1}}^{-1}\prescript{*\!}{}{E}(S\bar{u}-y_{d})\in H^{1}(\Omega)$ and a Lagrangian multiplier $\bar{\lambda}:=\prescript{*\!}{}{B}\bar{p}=\prescript{*\!}{}{S}_{H^{1}}\bar{h}\in\operatorname{ran}\prescript{*\!}{}{S}_{H^{1}}$ now yields (OS).

If $E$ is continuous, we can directly pass to the limit in the second relation of (66) and obtain a Lagrange multiplier $\bar{\lambda}=\prescript{*\!}{}{S}\bar{h}\in C({\overline{\omega}_{c}})$ .

Corollary 4.10.

Assume that $E$ is continuous, and let $\bar{u}\in\mathcal{M}({\overline{\omega}_{c}})$ be a minimizer of Problem (1). Then there exist $\bar{y}\in\operatorname{dom}A$ , $\bar{p}\in\operatorname{dom}\prescript{*\!}{}{\!A}\subset H^{1}(\Omega)\cap C(\overline{\Omega})$ , and $\bar{\lambda}\in C({\overline{\omega}_{c}})$ satisfying

[TABLE]

In this case, the optimality conditions can also be obtained by direct application of the Fenchel duality theorem to problem ( $\prescript{*\!}{}{}$ P), where the last three relations of (76) are the complementarity conditions of the second relation of (57), which here read $-\bar{u}\in\partial\delta_{C^{+}}(\bar{\lambda})$ .

5 Connection to problems with control costs

In this section, we show that problem (P) can be interpreted as the limit problem for vanishing $L^{2}$ or measure-space control costs.

5.1 $\scriptstyle L^{2}$ control costs

We first connect the measure-space problem (P) with the classical control-constrained linear quadratic problem

[TABLE]

which for every $\alpha>0$ is known to admit a minimizer $u_{\alpha}\in L^{2}({\overline{\omega}_{c}})$ ; see, e.g., [TroBook, Theorem 2.14]. Arguing as in the proof of Theorem 3.4, it can be shown that $u_{\alpha}$ converges weakly- $*$ to some $\hat{u}$ in $\mathcal{M}({\overline{\omega}_{c}})$ as $\alpha\to 0$ (up to a subsequence if $S$ is not injective). It is, however, not obvious that the limit $\hat{u}$ coincides with the global minimizer $\bar{u}$ from Theorem 3.4. The validity of this assertion hinges on the question, whether there is a sequence $\{u_{n}\}_{n\in{\mathbb{N}}}\subset L^{2}({\overline{\omega}_{c}})^{+}$ such that $u_{n}\rightharpoonup^{*}\bar{u}$ and $Su_{n}\rightharpoonup S\bar{u}$ in $L^{2}({\omega_{o}})$ , i.e., whether optimal control and optimal observation can be approximated simultaneously by a sequence of positive functions.

Due to LABEL:thm:wsd, this is certainly the case if $E$ is continuous, since then $u_{n}\rightharpoonup^{*}\bar{u}$ implies $Su_{n}\to S\bar{u}$ by Lemma 2.1.

Theorem 5.1.

Assume that $E$ is continuous, $S$ is injective, and ${\overline{\omega}_{c}}$ is equipped with a measure $\nu$ such that $\nu({\overline{\omega}_{c}}\cap O)>0$ for every open set $O\subset\mathbb{R}^{d}$ , such that ${\overline{\omega}_{c}}\cap O$ is non-empty. Then

[TABLE]

Proof 5.2.

By LABEL:thm:wsd, there exists a sequence $\{v_{n}\}_{n\in{\mathbb{N}}}\subset L^{2}({\overline{\omega}_{c}})^{+}$ such that $v_{n}\rightharpoonup^{*}\bar{u}$ . Since $E$ is continuous, this implies via Lemma 2.1 that $Sv_{n}\to S\bar{u}$ strongly and thus that $\|Sv_{n}-y_{d}\|_{L^{2}({\omega_{o}})}\to\|S\bar{u}-y_{d}\|_{L^{2}({\omega_{o}})}$ . Denoting by $J_{\alpha}$ the functional in (Pα) and by $J$ the functional in (P), we conclude that for each $\varepsilon>0$ there are $v_{n}$ and $\alpha_{n}$ such that

[TABLE]

Hence, $\{u_{\alpha_{n}}\}_{n\in{\mathbb{N}}}$ is a minimizing sequence for $J$ , which satisfies – like any minimizing sequence – the properties stated in the proof of Theorem 3.4. This yields our assertions.

On the other hand, if $E$ and thus $S$ is unbounded, the graph norm on $\operatorname{dom}S$ , defined by $\|u\|_{S}:=\|u\|_{\mathcal{M}({\overline{\omega}_{c}})}+\|Su\|_{L^{2}({\omega_{o}})}$ , is strictly stronger than $\|u\|_{\mathcal{M}({\overline{\omega}_{c}})}$ . Thus, there may be sequences in $L^{2}({\overline{\omega}_{c}})$ that converge weakly- $*$ in $(\mathcal{M}({\overline{\omega}_{c}}),\|u\|_{\mathcal{M}({\overline{\omega}_{c}})})$ but are unbounded in $(\operatorname{dom}S,\|u\|_{S})$ and thus cannot converge weakly- $*$ with respect to this norm. Hence if $S$ is unbounded, the weak- $*$ sequential closure of $L^{2}({\overline{\omega}_{c}})$ may be a proper subset of $\operatorname{dom}S$ , and thus we cannot expect in general that our global minimizer $\bar{u}$ can be approximated by a minimizing sequence in $L^{2}({\overline{\omega}_{c}})$ .

Although the necessary optimality conditions for Problem (Pα) are standard (see, e.g., [TroBook, Theorem 2.22]), it is instructive to derive them using the convex analysis framework employed for (P). Since Problem (Pα) is posed in the Hilbert space $L^{2}({\overline{\omega}_{c}})$ and we have assumed $E$ to be continuous, we can apply the Fenchel duality theorem directly, where we denote by $\mathcal{F}^{*}$ the tracking term and by $\mathcal{G}_{\alpha}^{*}$ the two remaining terms in (Pα). To derive an explicit characterization of the second relation of (57), we set $\lambda_{\alpha}:=S^{*}h_{\alpha}\in L^{2}({\omega_{o}})$ and use the fact that due to the Hilbert space setting, $\mathcal{G}_{\alpha}$ coincides with the Moreau envelope of $\delta_{L^{2}({\overline{\omega}_{c}})^{+}}$ , i.e.,

[TABLE]

see, e.g., [Bauschke, Proposition 13.12]. Hence, $\partial\mathcal{G}_{\alpha}$ coincides with the Yoshida regularization of $\partial\delta_{L^{2}({\overline{\omega}_{c}})^{+}}$ , i.e.,

[TABLE]

since the proximal mapping of an indicator function of a convex set $C$ is given by the metric projection onto $C$ ; see, e.g., [Bauschke, Proposition 12.29]. After some algebraic manipulations, we thus obtain the the optimality system

[TABLE]

where $\max$ is to be understood pointwise almost everywhere in ${\overline{\omega}_{c}}$ . Note that the system (OSα) coincides with the well-known projection formulation of the optimality condition for the control-constrained linear-quadratic problem (Pα); see, e.g., [TroBook, Theorem 2.28].

5.2 Measure-space control costs

We now connect problem (P) with the non-negative “sparse control problem”

[TABLE]

considered in [Clason:2011a]. Existence of an optimal control $u_{\beta}\in\mathcal{M}({\overline{\omega}_{c}})^{+}$ can be shown as in Theorem 3.4, using the fact that a minimizing sequence is necessarily bounded in $\mathcal{M}({\overline{\omega}_{c}})$ by virtue of the additional (weak- $*$ lower semi-continuous) term. Similarly, by the minimizing property of $u_{\beta}$ , the family $\{Su_{\beta}\}_{\beta>0}$ is bounded in $L^{2}({\omega_{o}})$ and hence $u_{\beta}$ converges weakly- $*$ to $\bar{u}$ in $\mathcal{M}({\overline{\omega}_{c}})$ as $\beta\to 0$ (up to a subsequence if $S$ is not injective) if 3.1 holds and $E$ is continuous. If on the other hand $E$ is unbounded, the discussion in Section 5.1 shows that $\operatorname{dom}S$ is in general not weakly- $*$ closed, and we cannot expect weak- $*$ convergence of $u_{\beta}$ to a minimizer $\bar{u}$ .

Optimality conditions for (Pβ) with a bounded control-to-observation mapping $S$ can be derived by application of the Fenchel duality theorem, making use of the fact that the Fenchel conjugate of

[TABLE]

is given by

[TABLE]

see [Clason:2011a, Remark 2.5]. (Recall that by (56) the dual problem involves $\mathcal{G}^{*}_{\beta}(-u)$ .) Fenchel duality now leads to the necessary optimality conditions

[TABLE]

see again [Clason:2011a, Remark 2.5], where the last relation was equivalently expressed as a variational inequality. Setting $\beta=0$ , we recover (66).

The optimality conditions (83) are frequently used as a justification for calling $u_{\beta}$ a sparse control: From the last relations, we see that $u_{\beta}$ must be zero on all subsets of ${\overline{\omega}_{c}}$ where $\prescript{*\!}{}{B}p_{\beta}$ is strictly greater than $-\beta$ . Hence, the support of $u_{\beta}$ is contained in the set $\{x\in{\overline{\omega}_{c}}:\prescript{*\!}{}{B}p_{\beta}(x)=-\beta\}$ , which in many situation (e.g., if $p_{\beta}$ is harmonic) can be argued to be a set of zero Lebesgue measure. Furthermore, increasing $\beta$ will decrease the size of this set. The same argument is possible for (66): the optimal control $\bar{u}$ must be zero on all subsets with $\prescript{*\!}{}{B}p_{\beta}>0$ , and hence the support of $\bar{u}$ is contained in $\{x\in{\overline{\omega}_{c}}:\prescript{*\!}{}{B}\bar{p}(x)=0\}$ (which has Lebesgue measure zero in similar situations as in the case $\beta>0$ ). This implies that optimal measure-space controls have an inherent sparsity independent of the sparsity-promoting control cost, whose role is solely to control the size of the support.

We can also apply our framework from Section 4 to derive optimality conditions for unbounded observation operators (which cannot be treated using the standard approach as in, e.g., [Clason:2011a]). Proceeding exactly as before with $\delta_{C({\overline{\omega}_{c}})^{+}}$ replaced by $\delta_{\{v\geq-\beta\}}$ and $\delta_{\mathcal{M}({\overline{\omega}_{c}})^{+}}$ replaced by $\beta\|{\cdot}\|_{\mathcal{M}({\overline{\omega}_{c}})}+\delta_{\mathcal{M}({\overline{\omega}_{c}})^{+}}$ , we obtain the modified optimality conditions

[TABLE]

Again setting $\beta=0$ , we recover (OS). However, since the last relation can no longer be interpreted pointwise, a sparsity property of $u_{\beta}$ does not follow directly.

6 Numerical solution

The numerical solution is based on the conforming discretization of $\mathcal{M}({\overline{\omega}_{c}})$ introduced in [Clason:2012], which we briefly recall. The starting point is to replace $S:\mathcal{M}({\overline{\omega}_{c}})\to L^{2}({\omega_{o}})$ by its finite element semidiscretization $S_{h}:\mathcal{M}({\overline{\omega}_{c}})\to Y_{h}$ , where $Y_{h}\subset L^{2}({\omega_{o}})$ is a finite-dimensional space spanned by the usual continuous piecewise linear nodal basis (“hat”) functions attached to the vertices $\{x_{j}\}_{j=1}^{N}$ of a triangulation of $\overline{\Omega}$ . We then consider the semidiscrete optimal control problem

[TABLE]

Existence of an optimal control $\bar{u}$ can be shown as in Section 3. Although the optimal state $\bar{y}_{h}=S_{h}\bar{u}$ is unique, this is no longer the case for the control due to the finite number of observations. However, there is a unique $\bar{u}_{h}\in\mathcal{M}({\overline{\omega}_{c}})$ with $\bar{y}_{h}=S_{h}(\bar{u}_{h})$ that can be represented as a linear combination of Dirac measures concentrated on the vertices $x_{j}$ contained in ${\overline{\omega}_{c}}$ ; see [Clason:2012, Theorem 3.2]. We can thus restrict the minimization in (Ph) over the set $U_{h}$ of such linear combinations. In this sense, this approach is related to a discretization method introduced in [Winther:1978] for unconstrained linear-quadratic problems and also to the variational discretization of control-constrained problems of [Hinze2005].

This allows expressing Problem (Ph) purely in terms of the expansion coefficients $\vec{u}$ of $\bar{u}_{h}$ and $\vec{y}$ of $\bar{y}_{h}$ . Using that $\bar{u}_{h}\in\mathcal{M}({\overline{\omega}_{c}})^{+}$ if and only if $\vec{u}\geq 0$ componentwise and applying the Fenchel duality theorem as in Corollary 4.10 (all finite-dimensional operators being bounded) yields the fully discrete optimality conditions

[TABLE]

where $A_{h}$ denotes the stiffness matrix corresponding to the differential operator $A$ , $M_{h}$ the restricted mass matrix on the observation domain ${\omega_{o}}$ , and $B_{h}^{T}$ the discrete restriction operator to the components of $\vec{p}$ corresponding to vertices contained in ${\overline{\omega}_{c}}$ . (Note the lack of mass matrix for the discrete state equation.) Since $\mathbb{R}^{N}$ is a Hilbert space, we can reformulate the last relation in (OSh) using resolvent calculus similarly as in Section 5 as

[TABLE]

for any $\alpha>0$ ; see also [Kunisch:2008a, Theorem 4.41]. (Comparing this relation with the last relation in (OSα), we remark that the only difference is the presence of $\vec{u}$ on the right-hand side.) In particular, for $\alpha=1$ we obtain

[TABLE]

where the $\max$ is to be understood componentwise.

It is well-known that the $\max$ operator is semismooth on $\mathbb{R}^{N}$ with Newton derivative at $\vec{v}$ in direction $\vec{h}$ is given componentwise by

[TABLE]

and that system (OSh) therefore can be solved by a superlinearly convergent semismooth Newton method; see [Kunisch:2008a, Ulbrich:2002a]. To account for the local convergence of Newton methods, we compute a starting point by solving a sequence of discrete regularized problems analogous to Section 5. Specifically, we add for $\alpha>0$ the $\ell_{2}$ penalty $\frac{\alpha}{2}|\vec{u}|_{2}^{2}$ and proceed as in Section 5 to obtain

[TABLE]

Since the last relation is explicit, we can eliminate $\vec{u}$ and apply a semismooth Newton method to the reduced system, starting with $\alpha=1$ and successively reducing $\alpha$ , taking for each $\alpha$ the previous solution as starting point.

7 Numerical examples

We illustrate the nature of the generalized measure-space controls with numerical examples for the Laplace equation on the unit square with homogeneous Dirichlet conditions, i.e., we take $\Omega=[-1,1]^{2}\subset\mathbb{R}^{2}$ and $A=-\Delta$ . The domain is discretized using the standard uniform triangulation arising from $256\times 256$ equidistributed nodes. The optimal controls for the discretized problem are computed using a matlab implementation of the approach described in Section 6, which can be downloaded from https://github.com/clason/positivecontrol.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Code & Models

Videos

Optimal control of elliptic equations with positive measures

Abstract

1 Introduction

2 State equation

Elliptic differential operator AAA

Extension to measure-valued right-hand sides

Lemma 2.1** ([Schiela:2010, Lemma 2.15]).**

Control operator BBB

Observation operator EEE

Control-to-observation mapping SSS

Lemma 2.2**.**

Proof 2.3**.**

3 Existence of minimizers

Assumption 3.1** (Pre-dual Slater condition).**

Lemma 3.2**.**

Proof 3.3**.**

Theorem 3.4**.**

Proof 3.5**.**

3.1 Verification of the pre-dual Slater condition

Lemma 3.6**.**

Proof 3.7**.**

Corollary 3.8**.**

Proof 3.9**.**

Lemma 3.10**.**

Proof 3.11**.**

Proposition 3.12**.**

Proof 3.13**.**

3.2 Examples

Distributed control for a Neumann problem

Robin or Neumann boundary control

Distributed control for a Dirichlet problem

Lemma 3.14**.**

Proof 3.15**.**

Proposition 3.16**.**

Proof 3.17**.**

4 Optimality conditions

Remark 4.1**.**

Lemma 4.2**.**

Proof 4.3**.**

Proposition 4.4**.**

Proof 4.5**.**

Proposition 4.6**.**

Proof 4.7**.**

Theorem 4.8**.**

Proof 4.9**.**

Corollary 4.10**.**

5 Connection to problems with control costs

5.1 L2\scriptstyle L^{2}L2 control costs

Theorem 5.1**.**

Proof 5.2**.**

5.2 Measure-space control costs

6 Numerical solution

7 Numerical examples

Elliptic differential operator $A$

Lemma 2.1 ([Schiela:2010, Lemma 2.15]).

Control operator $B$

Observation operator $E$

Control-to-observation mapping $S$

Lemma 2.2.

Proof 2.3.

Assumption 3.1 (Pre-dual Slater condition).

Lemma 3.2.

Proof 3.3.

Theorem 3.4.

Proof 3.5.

Lemma 3.6.

Proof 3.7.

Corollary 3.8.

Proof 3.9.

Lemma 3.10.

Proof 3.11.

Proposition 3.12.

Proof 3.13.

Lemma 3.14.

Proof 3.15.

Proposition 3.16.

Proof 3.17.

Remark 4.1.

Lemma 4.2.

Proof 4.3.

Proposition 4.4.

Proof 4.5.

Proposition 4.6.

Proof 4.7.

Theorem 4.8.

Proof 4.9.

Corollary 4.10.

5.1 $\scriptstyle L^{2}$ control costs

Theorem 5.1.

Proof 5.2.