Error estimates for Dirichlet control problems in polygonal domains
Thomas Apel, Mariano Mateos, Johannes Pfefferer, Arnd R\"osch

TL;DR
This paper provides improved error estimates for finite element approximations of Dirichlet boundary control problems on polygonal domains, including non-convex cases, with detailed analysis of convergence rates.
Contribution
It introduces new error estimates that improve upon existing results and extends analysis to non-convex polygonal domains for Dirichlet control problems.
Findings
Enhanced convergence rates for control variables
Error estimates applicable to non-convex domains
Analysis of unconstrained and constrained control problems
Abstract
The paper deals with finite element approximations of elliptic Dirichlet boundary control problems posed on two-dimensional polygonal domains. Error estimates are derived for the approximation of the control and the state variables. Special features of unconstrained and control constrained problems as well as general quasi-uniform meshes and superconvergence meshes are carefully elaborated. Compared to existing results, the convergence rates for the control variable are not only improved but also fully explain the observed orders of convergence in the literature. Moreover, for the first time, results in non-convex domains are provided.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Error estimates for Dirichlet control problems in polygonal domains
††thanks: The project was supported by DFG through the International Research Training Group IGDK 1754 Optimization and Numerical Analysis for Partial Differential Equations with Nonsmooth Structures. The second author was partially supported by the Spanish Ministerio Español de Economía y Competitividad under research project MTM2014-57531-P
Thomas Apel Institut für Mathematik und Bauinformatik, Universität der Bundeswehr München,85577 Neubiberg, Germany. [email protected]
Mariano Mateos Departamento de Matemáticas, Universidad de Oviedo, 33203, Gijón, Spain. [email protected]
Johannes Pfefferer Lehrstuhl für Optimalsteuerung, Technische Universität München, 85748 Garching bei München, Germany. [email protected]
Arnd Rösch Fakultät für Mathematik, Universtät Duisburg-Essen, 45127 Essen, Germany. [email protected]
Abstract
The paper deals with finite element approximations of elliptic Dirichlet boundary control problems posed on two-dimensional polygonal domains. Error estimates are derived for the approximation of the control and the state variables. Special features of unconstrained and control constrained problems as well as general quasi-uniform meshes and superconvergence meshes are carefully elaborated. Compared to existing results, the convergence rates for the control variable are not only improved but also fully explain the observed orders of convergence in the literature. Moreover, for the first time, results in non-convex domains are provided.
keywords:
optimal control, boundary control, Dirichlet control, nonconvex domain, finite elements, error estimates, superconvergence meshes
**AMS subject classification **65N30, 65N15, 49M05, 49M25
1 Introduction
In this paper we will study the finite element approximation of the control problem
[TABLE]
where is the very weak solution of the state equation
[TABLE]
the domain is bounded and polygonal, is its boundary, and are real constants, and is a function whose precise regularity will be stated when necessary. We assume that and comment on the opposite case in Remark 5.4. Abusing notation, we will allow the case and to denote the absence of one or both of the control constraints.
First order optimality conditions read as (see [1, Lemma 3.1])
Lemma 1.1**.**
Suppose . Then problem (P) has a unique solution with related state and adjoint state . The following optimality system is satisfied:
[TABLE]
The variational inequality (1.2a) is equivalent to
[TABLE]
where denotes the pointwise projection on the interval .
The aim of this paper is to investigate a finite element solution of the system (1.2a)–(1.2c), in particular to derive discretization error estimates. The precise description of the regularity of the solution of the first order optimality system is an important ingredient of such estimates. They were proven in our previous paper [1]; we recall these results in Section 2. There were two interesting observations which we may illustrate in the following example.
Example 1.2**.**
Consider the L-shaped domain. The angle leads in general to a singularity of type in the solution of the adjoint equation; the regularity can be characterized by with . Hence, the control has a -singularity in the unconstrained case, for all . In the constrained case, however, the control is in general constant in the vicinity of the singular corner since the normal derivative of the adjoint state has a pole there, we get for all . This regularity is determined by the larges convex angle and by the kinks due to the constraints.
Unfortunately, this is not the whole truth. In exceptional cases, e. g. when the data enjoy certain symmetry, the leading singularity of type may not appear in the adjoint state. Instead, the solution may have a -singularity whose normal derivative has a -singularity which is not flattened by the projection . The control is less regular, for all . See Example 3.6 in [1].
Hence, dealing with these exceptional cases is not fun but necessary. If in the unconstrained case a stress intensity factor vanishes , i.e., the leading singularity does not occur, then the convergence result is still true, one may only see a better convergence in numerical tests. See Figure 3, right hand side and Remark 4.8. However, in the constrained case, the situation is the opposite. The exceptional case leads to the worst-case estimate. To deal with the “worst-case” and the “usual-case” in an unified way, we introduce in (2.4) some numbers related to the singular exponents.
We distinguish two cases for the investigation of the discretization errors. After proving a general result in Section 3 we study the unconstrained case in Section 4 and the constrained case in Section 5. We focus on quasi-uniform meshes and distinguish general meshes and certain superconvergence meshes. In order not to overload the present paper, we postpone the study of graded meshes to [2]. The numerical tests in Section 6 confirm the theoretical results.
The study of error estimates for Dirichlet control problems posed on polygonal domains can be traced back to [9], where a control constrained problem governed by a semilinear elliptic equation posed in a convex polygonal domain is studied. An order of convergence of is proved for all , where is the largest interior angle, in both the control and the state variable. Later, in [18], it is proven that for unconstrained linear problems posed on convex domains, the state variable exhibits a better convergence property. The corresponding proof is based on a duality argument and estimates for the controls in weaker norms than . However, to the best of our knowledge, the argumentation is restricted to unconstrained problems. For the error of the controls in , the order shown in [9] is not improved.
Nevertheless, the regularity of the control and the existing numerical experiments, see [18, 17], suggested that for the control variable the order should be greater: for all if one uses standard quasi-uniform meshes, and for all if one uses certain quasi-uniform meshes which allow for superconvergence effects, see Definition 4.5. Our main results, Theorems 4.1 and 5.3, fully explain the observed orders of convergence in the literature for the control variable, improve existing results for the state variable in constrained linear-quadratic problems posed in convex domains, and provide the first available results in nonconvex domains.
2 Notation and regularity results
Let us denote by the number of sides of and its vertexes, ordered counterclockwise. For convenience denote also and . We will denote by the side of connecting and , and by the angle interior to at , i.e., the angle defined by and , measured counterclockwise. Notice that . We will use as local polar coordinates at , with and the angle defined by and the segment . In order to describe the regularity of the functions near the corners, we will introduce for every a positive number and an infinitely differentiable cut-off function such that the sets
[TABLE]
satisfy for all and if and in the set , in the set .
For every we will call the in general leading singular exponent associated with the operator corresponding to the corner . For the Laplace operator it is well known that . Since in general the regularity of the solution of a boundary value problem depends on the smallest singular exponent, it is customary to denote
[TABLE]
Our main estimates are for data for some . To get these estimates it is key to use the sharp regularity results of the optimal control, state and adjoint state provided in [1]. For both the control and the state it is enough to know the Hilbert Sobolev-Slobodetskiĭ space they belong to, but for the adjoint state we will need to know with some more detail the development in terms of powers of the singular exponents. To write this development, we must proceed in two steps in order to be able to define the effectively leading singularity in each corner.
Our first result concerns the regularity of the adjoint state and is a consequence of [1, Theorem 3.2 and Theorem 5.1]. For , and we define
[TABLE]
Lemma 2.1**.**
Suppose . Let be the optimal adjoint state, solution of (1.2c). Then, there exist a unique function and unique real numbers , for all for constrained problems and for unconstrained problems, such that
[TABLE]
Note that in convex domains such that we obtain for constrained as well as for unconstrained problems the same regularity of the optimal adjoint state. However, in non-convex domains, the control and hence the state, as part of the right hand side of the adjoint equation, may be unbounded in the unconstrained case, which leads to the restriction for the regularity of . Moreover, it may happen that the effectively leading singularity corresponding to a corner is not the first one. This means that the associated coefficient in the asymptotic representation (2.3) is equal to zero. However, this will be of interest only for constrained problems in case of nonconvex corners , i.e., . To be able to cover this, we define the numbers
[TABLE]
for each corner. In addition, we introduce
[TABLE]
In convex domains, will determine the regularity of both the optimal control and state. This holds for unconstrained as well as for constrained problems. However, in nonconvex domains, different cases may appear. If we have no control constraints then the regularity of the optimal control and state will again be determined by . If the problem is constrained then in the vicinity of any corner , where the coefficient of the corresponding first singularity is unequal to zero, the optimal control is flattened there due to the projection formula and consequently smooth. This is the usual case. If then the optimal control in the neighborhood of such a corner is at least as regular as the normal derivative of the corresponding second singular function. In the control constrained case, will determine the regularity of the optimal control, at least in a worst case sense. The regularity of the optimal state may depend on as well since singular terms may occur within its asymptotic representation independent of the adjoint state.
For unconstrained problems the following regularity result holds, see [1, Corollary 5.3, Corollary 4.2, Theorem 3.4].
Lemma 2.2** (unconstrained case).**
Suppose and for all . Then
[TABLE]
For constrained problems, we can improve this result, see [1, Corollary 4.2, Theorem 3.4].
Lemma 2.3** (control constrained case).**
Suppose , for all . Assume that the optimal control has a finite number of kink points. Then
[TABLE]
We also have the following result from [1, Proof of Theorem 3.4].
Lemma 2.4**.**
Suppose and . If and , then one of the control constraints is active near the corner , i.e., there exists such that for with either or .
Finally, we can write the representation of the adjoint state for regular enough data. For , and we will also need
[TABLE]
The following result is a consequence of [1, Corollary 4.4].
Lemma 2.5**.**
Suppose that is convex or , and that with . Then, for such that
[TABLE]
and
[TABLE]
there exist a unique function and unique real numbers and , such that
[TABLE]
Notice that the coefficients that appear in both expansions in Lemmata 2.1 and 2.5 coincide, due to the uniqueness of the expansion. In the expansion of Lemma 2.5 new terms appear that belong to for all but not to for satisfying the conditions in Lemma 2.5.
3 A general discretization error estimate
In this section we will present a general discretization error estimate in Theorem 3.2. The terms in this general estimate have to be estimated in particular cases. This work will be done in later sections.
For the discretization, consider a family of regular triangulations depending on a mesh parameter in the sense of Ciarlet [11]. Notice, that a triangulation of the boundary is naturally induced by . We assume that the space is the space of conforming piecewise linear finite elements. The space is the space of piecewise linear functions generated by the trace of elements of on the boundary . We denote the subspace of with vanishing boundary values by .
We also introduce the discrete solution operator . For the function is defined as the unique solution of
[TABLE]
We emphasize that on the boundary coincides with the -projection of on . Thus we get on for . Notice as well that (3.1) is not a conforming discretization of the very weak formulation of the state equation. However, according to [3, 8], its applicability is guaranteed.
In our discretized optimal control problem we aim to minimize the objective function
[TABLE]
The first order optimality conditions of this problem were derived in [9] and are stated in the next lemma.
Lemma 3.1**.**
Problem has a unique solution , with related discrete state and adjoint state . The following discrete optimality system is satisfied
[TABLE]
where the discrete normal derivative is defined as the unique solution of
[TABLE]
An important tool in the numerical analysis is the construction of a discrete control which interpolates in a certain sense, see Lemma 4.2 and Lemma 5.6, and satisfies
[TABLE]
If the optimal control with then we use a quasi-interpolant introduced by Casas and Raymond in [9]: Denote the boundary nodes of the mesh by , , and let , , be the nodal basis of . We set
[TABLE]
and define a control by its coefficients
[TABLE]
According to [9, Lemma 7.5] the function belongs to . Moreover, it is constructed such that on the active set, and it fulfills (3.4).
If with , we use a modification of the standard Lagrange interpolant of , again denoted by , which is defined by its coefficients as follows
[TABLE]
cf. [10, Section 2]. Of course, if we consider control problems without control constraints, that is , the interpolant is just the Lagrange interpolant. In case of control bounds , in order to get an unique definition of , we need to assume that on each element only one control bound is active. However, due to the Hölder continuity of , which we have for with , there exists a mesh size such that for all the above definition of the interpolant is unique. Obviously, this interpolant belongs to . Moreover, it satisfies (3.4) by construction. Indeed, whenever , we have .
As already announced, we conclude this section by stating a general error estimate for the control and state errors which will serve as a basis for the subsequent error analysis.
Theorem 3.2**.**
For the solution of the continuous and the discrete optimal control problem we have
[TABLE]
Proof.
First, let us define the intermediate error . Then, we obtain
[TABLE]
To deal with the third term, we take into account the continuity of :
[TABLE]
cf. [3, Lemma 2.3 and Corollary 3.3]. Accordingly, we only need estimates for the second and fourth term in (3.8). We begin with estimating the second one, but as we will see this also yields an estimate for the fourth term. There holds
[TABLE]
Next, we consider the second term of (3.10) in detail. By adding the continuous and discrete variational inequalities (1.2a) and (3.2a) with and , respectively, we deduce
[TABLE]
Rearranging terms and using (3.4) leads to
[TABLE]
Integration by parts (cf. [12, Lemma 3.4]) using on , (3.3), (1.2c) and (3.1) yield
[TABLE]
By collecting the estimates (3.10) and (3.11) we obtain
[TABLE]
From the Young inequality we can deduce
[TABLE]
Finally, the assertion is a consequence from (3.8), (3.9) and (3.13). ∎
4 Problems without control constraints
In the rest of the paper, we will always assume that is a quasi-uniform family of meshes. However, if the underlying mesh has a certain structure then it is possible to improve the error estimates. These special quasi-uniform meshes are called superconvergence meshes or -irregular meshes; for the precise definition we refer to Definition 4.5. The main result of this section is the following one.
Theorem 4.1**.**
Suppose that either and , or for some . Then it holds
[TABLE]
where is equal to one for and equal to zero else. If, further, is -irregular according to Definition 4.5, then
[TABLE]
For the proof, we are going to estimate the three terms that appear in the general estimate of Theorem 3.2. Whereas the first two terms in (3.7) can be estimated by standard techniques, the third one needs special care. Analogously to the derivation of (3.11), this term can formally be rewritten as
[TABLE]
where is defined as in (3.3) just by replacing with and with the Ritz-projection of on . Thus, we are interested in the error between the normal derivative of the adjoint state and the corresponding discrete normal derivative of its Ritz-projection. In order to estimate the above term, we will pursue two different strategies. The first one relies on local and global -discretization error estimates. In case of general quasi-uniform meshes, this will result in a convergence order of for all such that and , where is equal to one for and equal to zero else. The second strategy will rely on special super-convergence meshes as introduced in [6]. The idea to use such meshes in the context of Dirichlet boundary control problems originally stems from [13]. In contrast to the setting in that reference, we are not concerned with smoothly bounded domains but with polygonal domains. For that reason we need to extend the corresponding estimates to that case, that is, we have to deal with less regular functions due to the appearance of corner singularities. This will yield an approximation rate of with , which results in an improvement for domains with interior angles less than .
Lemma 4.2**.**
Suppose for all . Then we have
[TABLE]
Proof.
We know from Lemma 2.2 that the control satisfies for all . If , we choose as defined in (3.5), and the estimate for the control follows from [9, Eq. (7.10)] by setting with . If , we have due to the Sobolev embedding theorem. Thus, the modified Lagrange interpolant from (3.6) is well-defined. Actually, in the present case, is just the Lagrange interpolant. As a consequence, the error estimate for the control is given by a standard estimate for the Lagrange interpolant.
Again from Lemma 2.2, the optimal state satisfies , for all . Thus, for all if . By the Aubin–Nitsche method we obtain
[TABLE]
cf. for instance [7]. Since can be chosen greater than , we have the desired result in case that . For we do not have such that standard techniques for estimating finite element errors fail. However, in this case we can directly refer to Remark 5.4 of [3]. ∎
Lemma 4.3**.**
Suppose that either and , or for some . Then there is the estimate
[TABLE]
where is equal to one for and equal to zero else.
Proof.
As above, we denote by the operator that maps a function of to its Ritz-projection in . In addition, we introduce the extension operator which extends a function belonging to to one in by zero. Using the norm equivalence in finite dimensional spaces on a reference domain we easily infer for any and
[TABLE]
Since is discrete harmonic, we obtain together with the orthogonality properties of the Ritz-projection the identity
[TABLE]
where we employed that belongs to .
Now, we distinguish the three cases , and for .
In the first one, we know from Lemma 2.5 that the optimal adjoint state belongs to (for some ), which is continuously embedded in . Consequently, a global -discretization error estimate from e.g. [23, 14], and (4.4) yield
[TABLE]
which represents, together with (4.5) and the embedding , the desired result for , .
Next, we consider the case for . For simplicity, we assume that the domain has only one corner with an interior angle greater or equal to . However, the proof extends to the general case in a natural way. In the following, that corner is located at the origin. Furthermore, we denote its interior angle by , the distance to that corner by , and the corresponding leading singular exponent by . According to Lemma 2.5, the optimal adjoint state admits the splitting
[TABLE]
where belongs to with some . Combining (4.5) and (4.7) yields the identity
[TABLE]
For the latter term, we can argue as in (4.6) to show first order convergence, i.e.,
[TABLE]
In order to estimate the singular term, we decompose the neighborhood of the critical corner in subdomains which are defined by
[TABLE]
We set the radii equal to and choose the index in such a way that with a constant . Below, this constant is chosen large enough such that on the one hand local -finite element error estimates from [14, Corollary 1] are applicable on the strips , see (4.12), and on the other hand the validity of the weighted error estimate (4.15) is guaranteed. Moreover, we set
[TABLE]
and
[TABLE]
with the obvious modifications for and . Using this kind of covering, we obtain
[TABLE]
Arguing as in (4.4), we get
[TABLE]
Having chosen the constant large enough, local -error estimates from [14, Corollary 1] yield
[TABLE]
where denotes the Lagrange interpolant of . Notice, according to [14, Remark 2], this inequality is only valid for any if the domain is non-convex, i.e. . Now, let , which possesses the properties for and . By combining (4.10)–(4.12), we infer
[TABLE]
The second derivatives of the singular part behave like for and like if , cf. Lemma 2.5. Thus, by using standard interpolation error estimates (on the strips ), we get for , hence for ,
[TABLE]
From [22, Corollary 3.62] (setting and there) we know that for large enough there holds
[TABLE]
Notice that in that reference problems with Neumann boundary conditions are considered. However, the proof for the present problem is just a word by word repetition. Next, the Cauchy-Schwarz inequality and basic integration yield
[TABLE]
By collecting the results from (4.13)–(4.16), we obtain
[TABLE]
which yields together with (4.9), (4.8) and (4.5) the assertion in the second case.
Finally, we consider the case for . Similar to the foregoing considerations, we assume that only the angle is greater or equal to and hence . According to (4.5), the Cauchy-Schwarz inequality, (4.4), and a standard finite element error estimate, we obtain
[TABLE]
for all . This ends the proof. ∎
Remark 4.4**.**
Related results to those of Lemma 4.3, which are established by using similar techniques, can be found in [4, 15, 19, 22].
According to the previous lemma, the critical term in the general estimate (3.7) converges with an order close to one provided that the interior angles are less . However, it is possible to improve the convergence rate if we assume a certain structure of the underlying mesh. The following definition for superconvergence meshes can be found in [6]. Those have been used in [13] in the context of Dirichlet boundary control problems in the case of smoothly bounded domains.
Definition 4.5**.**
The triangulation is called to be -irregular if the following conditions hold:
- (a)
The set of interior edges of the triangulation is decomposed into two disjoint sets and which fulfill the following properties:
- •
For each , let and denote the two elements of the triangulation that share this edge . Then the lengths of any two opposite edges of the quadrilateral differ only by .
- •
. 2. (b)
The set of the boundary vertexes is decomposed into two disjoint set and which satisfy the following properties:
- •
For each vertex , let and be the two boundary edges sharing this vertex as an endpoint. Denote by and the elements having and , respectively, as edges and let and be the corresponding unit tangents. Furthermore, take and as one pair of corresponding edges, and make a clockwise traversal of and to define two additional corresponding edge pairs. Then and the lengths of any two corresponding edges only differ by .
- •
with a constant independent of .
Next, let us recall a result from [13, Lemma 5.2], which leads us to Lemma 4.7.
Lemma 4.6**.**
Let be any polygonal domain with boundary . Suppose that the triangulation of is irregular and let for some . Then for any there holds
[TABLE]
where denotes the piecewise linear Lagrange interpolant.
Lemma 4.7**.**
Suppose that either and , or for some . Suppose further that is a family of -irregular meshes. Then it holds
[TABLE]
Proof.
First we observe that
[TABLE]
since represents the discrete harmonic extension operator and has zero boundary conditions. If at least one interior angle is greater or equal to , we have and therefore . Consequently, there is no advantage in taking a superconvergence mesh and we can apply the result for quasi-uniform meshes. If for , and hence , we can directly apply the results of Lemma 4.6 since for some according to Lemma 2.5. For these reasons, we focus in the following only on the case . We are in this case if the largest interior angle, denoted by in the following, fulfills . For simplicity, we assume as in the proof of Lemma 4.3 that the remaining angles are less than . However, the proof again extends to the general case in a natural way. According to Lemma 2.5 we have that
[TABLE]
where belongs to with some . For the regular part we can again employ Lemma 4.6 to obtain the order . The singular part behaves at worst like or like , respectively, if . As before, we would like to use Lemma 4.6 to get the corresponding estimate. For that purpose, we decompose the domain into two disjoint subsets and . The set consists of the elements of the triangulation which have contact to the corner , while . Since the triangulation of is irregular, the triangulation of is irregular, either. Applying Lemma 4.6 yields for any
[TABLE]
Since the number of elements in is bounded independently of and , we have that . Using this fact, the Hölder inequality, and a discrete Sobolev inequality, we obtain
[TABLE]
Define as the zero extension operator as in the proof of Lemma 4.3. Since denotes the discrete harmonic extension of , we infer
[TABLE]
Using this in combination with the Poincaré inequality yields
[TABLE]
where we used (4.4) in the last step.
Next, we observe that the third derivatives of behave like such that we can conclude for some arbitrary (depending on )
[TABLE]
since . Collecting (4.18)–(4.21) yields
[TABLE]
which represents the desired result for the subdomain . Finally, for the subdomain , we conclude by inserting a standard interpolation error estimate and the a priori estimate for the operator as before that
[TABLE]
After observing that the second derivatives of behave like or , respectively, if , and that , we get the desired result for the subdomain . ∎
Proof of Theorem 4.1.
The result is obtained from the general error estimate in Theorem 3.2 using the estimates in Lemmata 4.2, 4.3 and 4.7. ∎
Remark 4.8**.**
As we commented in the introduction, it is possible, though unlikely, that the coefficient of the leading singular exponent vanishes. In this case, we can replace the parameter in Theorem 4.1 by .
5 The control constrained case
This section is devoted to the numerical analysis of control constrained Dirichlet control problems. As we will see, the convergence rates in convex domains coincide with those for the unconstrained problems. More precisely, we will prove the following theorem.
Theorem 5.1**.**
Suppose that either and , or for some . Moreover, assume that the optimal control has a finite number of kink points. Then it holds
[TABLE]
where is equal to one for and equal to zero else. If, further, is -irregular according to Definition 4.5, then
[TABLE]
The proof of this theorem is postponed to Section 5.1. As already observed, this is exactly the result which we have proven in the unconstrained case. However, if the underlying domain is non-convex, the approximation rates in the control constrained case can be improved. In this regard, one of our results relies on a structural assumption on the discrete optimal control which we formulate next. Through this section we will shortly write
[TABLE]
Assumption 5.2**.**
There exists some such that for every , there exists independent of such that for all if .
Let us comment on Assumption 5.2. In Lemma 2.4 it was established that in the neighbourhood of a non-convex corner, the optimal control will normally be constant and either equal to the lower or the upper bound. Assumption 5.2 says that this property is inherited by the discrete optimal control.
One of our main results in the constrained case is now given as follows.
Theorem 5.3**.**
Suppose for some . Moreover, let either or Assumption 5.2 be satisfied, and assume that the optimal control has a finite number of kink points. Then there is the estimate
[TABLE]
where is equal to one for and equal to zero else. If further is irregular, then
[TABLE]
Remark 5.4**.**
We only consider the case . This is because it is known that for those corners such that we have that . In the case , the projection formula (1.3) implies that in a neighbourhood of , the optimal control will satisfy , and hence its regularity will be determined by that of the adjoint state. If , then the same projection formula implies that in a neighborhood of , will be equal to some of the control bounds. If we suppose, as in Assumption 5.2 that this property is inherited by the solutions of the discrete approximations, we have that the conclusions of Theorem 5.3 remain valid.
The proof of Theorem 5.3 is postponed to Section 5.1. Since and , we always have a convergence rate greater than . This is a real improvement compared the unconstrained case since in the latter it may happen that the convergence rates tend to zero as the largest interior angle tend to . However, one may ask for a justification of Assumption 5.2. In Lemma 5.10 we will see that there exist constants and greater than zero for all , and a constant such that
[TABLE]
Thus, we could relax Assumption 5.2 to an -dependent neighborhood of those corners with . Moreover, due to (5.3), it is even possible to show the following improved result in non-convex domains without any structural assumption on the discrete optimal control, i.e., we can always expect a convergence rate close to in non-convex domains.
Theorem 5.5**.**
Suppose for some , and assume that the optimal control has a finite number of kink points. Then it holds
[TABLE]
The proof of Theorem 5.5 is given in Section 5.2.
5.1 Proof of Theorems 5.1 and 5.3
The results of Theorem 5.1, and Theorem 5.3 for directly follow from the general error estimate given in Theorem 3.2, the estimates for the adjoint state provided in Section 4 in Lemmata 4.3 and 4.7 and the error estimates for the control and the state established below in Lemma 5.6.
Lemma 5.6**.**
Suppose for all and assume that the optimal control has a finite number of kink points. Then
[TABLE]
Proof.
The proof starts exactly following the lines of the proof of Lemma 4.2, using the regularity stated in Lemma 2.3. In this way, if we again obtain the desired estimate for , as defined in (3.5), from [9, Eq. (7.10)]. If , is given by (3.6). Since control constraints are now present, we have to derive error estimates for the modified Lagrange interpolant. To this end, let us consider two adjacent boundary elements and belonging to which are determined by the line segments and , respectively. Since we assume a finite number of kink points of due to the projection formula (1.3), we have to deal with the following situations (at least for small enough): First, no kink is contained in , second, there is exactly one kink of in due to the projection formula. In the first case, we have that coincides with the Lagrange interpolant on such that the desired estimate on these elements is obtained by standard discretization error estimates for the Lagrange interpolant employing the regularity results from Lemma 2.3, i.e.,
[TABLE]
with . In the second case, we can assume without loss of generality that , and . Thus, is equal to on . Using the regularity of the optimal control with from Lemma 2.3, we now estimate the interpolation error on each of the elements and . For the error on we obtain by means of the Hölder continuity of
[TABLE]
Next, recall that the nodal basis function associated with is denoted by . Then we deduce for the error on
[TABLE]
where we again used the Hölder continuity of . Since we assume a finite number of kink points, the desired interpolation error estimate for on in case that is just a combination of (5.5)–(5.7).
Since the optimal control belongs at least to , the optimal state is a weak solution such that we can rely on standard techniques for the derivation of the second estimate of the assertion. More precisely, by employing the regularity of with and with from Lemma 2.3, an application of a duality argument, cf. for instance [7], yields
[TABLE]
where . For the last two steps notice that and . ∎
Since , a straightforward application of Theorem 3.2, and Lemmata 5.6, 4.3 and 4.7 leads to an order of convergence identical to the one we have for unconstrained problems. Notice that Lemmata 4.3 and 4.7 can be used since bounds on the control do not play any role there. Thus, Theorem 5.1 and Theorem 5.3 for are proved.
For the results of Theorem 5.3, in case that and Assumption 5.2 is valid, we use the above error estimates for the control and the state, and we show in Lemmata 5.8 and 5.9 below how to improve the result for the adjoint state. Then an adaptation of the general error estimate, see Theorem 5.7, which we are going to prove next, can finally be used to combine these results. Let us define
[TABLE]
Moreover, let
[TABLE]
Under the structural Assumption 5.2 it is clear that , so we have the following modification of the general error estimate (3.7).
Theorem 5.7**.**
Suppose Assumption 5.2 holds. Then
[TABLE]
Proof.
Since due to Assumption 5.2, the result can be obtained in the same way as in the proof of Theorem 3.2 just by replacing
[TABLE]
in (3.12) by
[TABLE]
∎
Next, we are concerned with discretization error estimates for the critical term in the general estimate of Theorem 5.7. First, we deal with estimates for general quasi-uniform meshes. Afterwards we show improved estimates if we assume -irregular meshes.
Lemma 5.8**.**
Let for some . Then there is the estimate
[TABLE]
where is equal to one for and equal to zero else.
Proof.
To be able to localize the effects in the neighborhood of all corners with , we introduce a cut-off function which is equal to one in a fixed neighborhood of these corners and decays smoothly. In addition, we set . Then we infer for the quantity of interest
[TABLE]
For the first term on the right hand side of this inequality, we directly apply Lemma 4.3 to conclude
[TABLE]
where is equal to one for and equal to zero else, having in mind the regularity results of Lemma 2.5 for the adjoint state and noting that the singular terms coming from the corners with do not have any influence due to the cut-off function . To deal with the second term in (5.8), let denote the extension operator which extends a piecewise linear function on the boundary by zero to a function in . Thus, is equal to zero in for any . Moreover, let be the operator that maps a function in to its Ritz-projection in . Due the properties of the discrete harmonic extension and the Ritz-projection , we obtain
[TABLE]
By applying the Hölder inequality, local -discretization error estimates for the Ritz-projection from [14, Corollary 1], and (4.4), we obtain
[TABLE]
where . Having regard to the regularity results for the optimal adjoint state from Lemma 2.5 and by using standard interpolation error estimates and a standard finite element error estimate, we deduce
[TABLE]
which is valid for all such that and . Combining (5.8)–(5.14) ends the proof. ∎
Lemma 5.9**.**
Let for some and suppose further that is a family of -irregular meshes. Then it holds
[TABLE]
for all
Proof.
As before, we introduce the circular sectors
[TABLE]
For technical reasons we also need the circular sector
[TABLE]
Let the operators and be defined as in the proof of Lemma 5.8. Moreover, let be a smooth cut-off function which is equal to one in with . In addition, we choose such that which is possible without any restriction for small enough. We set . Analogously to the foregoing proof, we infer
[TABLE]
Observe that is equal to zero in a fixed neighborhood of all corners with . Consequently, Lemma 4.6, applied as in the proof of Lemma 4.7, yields for the latter term in (5.15)
[TABLE]
with such that and . By applying the Hölder inequality, local -discretization error estimates for the Ritz-projection from [14, Corollary 1], and (4.4), we obtain for the first term in (5.15)
[TABLE]
where we used that and are equal to zero in . Usual error estimates for the Ritz-projection and a standard embedding yield
[TABLE]
which is valid for all such that and . This ends the proof. ∎
Finally, an application of Theorem 5.7, and Lemmata 5.6, 5.8 and 5.9, yield the results of Theorem 5.3 where and Assumption 5.2 is satisfied.
5.2 Proof of Theorem 5.5
In this subsection we show the results of Theorem 5.5. That is, we show a convergence rate close to for the optimal controls and states in the constrained case if the domain is non-convex and even if the structural Assumption 5.2 does not hold. For that purpose, let us recall that denotes the corners of , is the set of boundary nodes of the mesh and is the basis of such that . Thus, every function can be written as
[TABLE]
By testing the discrete variational inequality appropriately, we deduce
[TABLE]
Lemma 5.10**.**
For each interior angle , where from (2.3) is unequal to zero, there are two constants and greater than zero such that
[TABLE]
for all nodes with .
Proof.
In the following we focus only on one non-convex corner . Without loss of generality let be greater than zero. Hence the normal derivative of is negative, and the lower bound of the control is active, and in the vicinity of this corner. We need to show that there are two constants and such that
[TABLE]
for all nodes with . According to [20, Theorem 3.4], we know that
[TABLE]
where the function is of the form
[TABLE]
where denotes the cut-off function introduced at the beginning of Section 2 and the function denotes a function which solves
[TABLE]
and denotes the commutator. According to Theorem 5.1, we deduce the existence of an constant such that for all there holds
[TABLE]
due to the assumption . Using this result, we will show that the singular part of the function which solves
[TABLE]
behaves like the singular part of . Indeed, admits the splitting
[TABLE]
according to Lemma 2.1. The regular part belongs to , at least for some , since belongs to due to the convergence result of Theorem 5.1. The singular part can be written as
[TABLE]
where the constant is greater than zero according to (5.16). Assuming that is already small enough such that on , we get by basic calculations
[TABLE]
where we used that and that is uniformly bounded in due to the embedding for . As before, let us denote by the operator which extends any function of to one in by zero. Also observe that is the Ritz-projection of . Then integration by parts, the definition of in (3.3) and (5.17) yield
[TABLE]
where we employed a standard discretization error estimate for the Ritz-projection, (4.4) and an inverse inequality in the last step. Now we proceed as in the proof of Lemma 4.3 between (4.13) and (4.17). Let and the subdomains be defined as in that proof and let the index be chosen such that . Assume that with a constant large enough such that local -error estimates for the Ritz-projection are applicable. Then those estimates of [14, Corollary 1] and (4.4) yield
[TABLE]
The second derivatives of the singular part behave like . Thus, by means of standard interpolation error estimates and the results of [22, Corollary 3.62], we infer
[TABLE]
Combining (5.18)–(5.21), we obtain
[TABLE]
Finally, we observe that . Thus, we are able to choose constants and such that
[TABLE]
if and
[TABLE]
if . This proves the assertion. ∎
Remark 5.11**.**
By using the Cauchy-Schwarz inequality, estimates for the Ritz-projection from [5, Theorem 5.1], (4.4) and an inverse inequality, we infer
[TABLE]
However, this is not enough to show that the discrete optimal control admits one of the control bounds in the direct vicinity of the corner , since then
[TABLE]
Now, we redefine the sets and by
[TABLE]
and
[TABLE]
and we set again
[TABLE]
Moreover, let . We have the following modification for the general error estimate
Theorem 5.12**.**
For the solution of the continuous and the discrete optimal control problem we have
[TABLE]
Note that the first term on the left hand side of (5.22) is a norm with respect to .
Proof.
We proceed as in the proof of Theorem 3.2. In contrast, we will test the optimality conditions with different functions. For that purpose, let us introduce and by
[TABLE]
Note that and are constant, even coincide, on and that is equal to at least for all with for according to Lemma 5.10. Next, we define the intermediate error , which is equal to zero in . Then, we obtain
[TABLE]
To deal with the third term, we take into account the continuity of :
[TABLE]
Accordingly, we only need estimates for the second and fourth terms in (3.8). We begin estimating the second one, but as we will see this also yields an estimate for the fourth term. There holds
[TABLE]
Next, we consider the second term of (5.25) in detail. By adding the continuous and discrete variational inequality with and , respectively, we deduce
[TABLE]
Rearranging terms and using (3.4) leads to
[TABLE]
Integration by parts (cf. [21, Theorem 3.1.1]), (3.3), (1.2c) and (3.1) yield
[TABLE]
By collecting the estimates (5.25) and (5.26) we obtain
[TABLE]
From the Young inequality we can deduce
[TABLE]
Finally, the assertion is a consequence from (5.23), (5.24) and (5.27). ∎
Finally, by observing that
[TABLE]
according to Lemma 5.10 and the uniform boundedness of in , we deduce the desired result of Theorem 5.5 by combining Theorem 5.12 and Lemmata 5.6 and 5.8.
6 Numerical experiments
The experiments have been performed with Matlab R2015a on an Intel(R) Core(TM) i7 CPU 870 @2.93 GHz with 16GB RAM on Windows 7 64 bits. All the scripts and functions have been programmed by us.
To build an example with exactly known solution , we just define and compute , such that in , on and . In general, it is not possible to compute exactly, so we will use its finite element approximation to compute an approximation of .
Since the aim of the experiment is to measure the order of convergence of the error in the control variable, we have solved the problems in two quasi-uniform families of nested meshes obtained by diadic refinement from a rough initial mesh. One of them is built such that it does not have the superconvergence property (see Figure 1), while the other is obtained using regular refinement, which results in a -irregular family which has the superconvergence property (see Figure 2). The finest mesh has between 1 million and 3.15 million nodes, depending on the geometry of the domain. Notice that these fine meshes induce boundary meshes that only have between 4 thousand and 7 thousand nodes only. To solve the optimization problem, we have used a semismooth Newton method; see [16] for the details.
In the examples where the optimal control is continuous, we measure the error at the mesh at level as
[TABLE]
where is the solution of and is the nodal Lagrange interpolation operator. If the exact solution is singular at the point , we simply approximate , for some small enough.
Since we are using a dyadic refinement strategy, we have that and we can measure the Experimental Order of Convergence at level as
[TABLE]
It is to be expected that converges to the Theoretical Order of Convergence () as , so for every problem we report on and compare with the corresponding .
Let denote the usual polar coordinates in and define as the interior of the convex hull of the set of points if and for . We will consider the following cases
- (a)
if , 2. (b)
if , 3. (c)
if , 4. (d)
if ,
where we have tested the value for , and the special case for . Straightforward calculations show that for all and some . Also for all . Hence, for an unconstrained problem for all , which implies that for all and therefore for all . If the problem is constrained, then and therefore for all and for some .
Notice that for the case we have that and for the case we have , so when we choose in the definition of and solve a constrained problem, the leading singular exponent to be taken into account should be, respectively, or . Nevertheless, the exact adjoint state has been chosen in such a way that in the first case and in the second case for , so for this example we need not take this into account.
We fix . For constrained problems, we will consider and . We choose so the asymptotic behavior of the error shows up for the mesh sizes used. If were two big, the problem would behave like an unconstrained one for our meshes; on the other hand, were too small, we would be approximating an optimal control very similar to a constant and the experimental orders of convergence would be too high for our meshes.
Graphs with the experimental results can be found in figures 3 and 4. It is remarkable that experimental results are quite in agreement with theoretical estimates.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] T. Apel, M. Mateos, J. Pfefferer and A. Rösch, On the regularity of the solutions of Dirichlet optimal control problems in polygonal domains, SIAM J. Control Optim. , 53 (2015), 3620–3641, URL http://dx.doi.org/10.1137/140994186 . · doi ↗
- 2[2] T. Apel, M. Mateos, J. Pfefferer and A. Rösch, Mesh grading for the numerical solution of Dirichlet control problems in polygonal domains, 2017, In preparation.
- 3[3] T. Apel, S. Nicaise and J. Pfefferer, Discretization of the Poisson equation with non-smooth data and emphasis on non-convex domains, Numer. Methods Partial Differential Equations , 32 (2016), 1433–1454.
- 4[4] T. Apel, J. Pfefferer and A. Rösch, Finite element error estimates on the boundary with application to optimal control, Mathematics of Computation , 84 (2015), 33–70.
- 5[5] C. Bacuta, J. Bramble and J. Xu, Regularity estimates for elliptic boundary value problems in Besov spaces, Mathematics of Computation , 72 (2003), 1577–1595.
- 6[6] R. Bank and J. Xu, Asymptotically exact a posteriori error estimators, part I: Grids with superconvergence, SIAM Journal on Numerical Analysis , 41 (2003), 2294–2312.
- 7[7] S. Bartels, C. Carstensen and G. Dolzmann, Inhomogeneous Dirichlet conditions in a priori and a posteriori finite element error analysis, Numerische Mathematik , 99 (2004), 1–24, URL http://dx.doi.org/10.1007/s 00211-004-0548-3 . · doi ↗
- 8[8] M. Berggren, Approximations of very weak solutions to boundary-value problems, SIAM J. Numer. Anal. , 42 (2004), 860–877 (electronic), URL http://dx.doi.org/10.1137/S 0036142903382048 . · doi ↗
