State-constrained control-affine parabolic problems I: first and second order necessary optimality conditions
M. Soledad Aronna, J. Fr\'ed\'eric Bonnans, Axel Kr\"oner

TL;DR
This paper establishes first and second order necessary optimality conditions for a control problem governed by a semilinear heat equation with integral state constraints, bilinear control terms, and a tracking cost functional.
Contribution
It introduces novel second order necessary conditions using alternative costates and quasi-radial critical directions for complex PDE control problems.
Findings
Derived second order necessary conditions for the control problem.
Provided an example demonstrating the applicability of the theoretical results.
Extended optimality conditions to problems with integral state constraints.
Abstract
In this paper we consider an optimal control problem governed by a semilinear heat equation with bilinear control-state terms and subject to control and state constraints. The state constraints are of integral type, the integral being with respect to the space variable. The control is multidimensional. The cost functional is of a tracking type and contains a linear term in the control variables. We derive second order necessary conditions relying on the concept of alternative costates and quasi-radial critical directions. The appendix provides an example illustrating the applicability of our results.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
State-constrained control-affine parabolic problems I: First and Second order necessary optimality conditions
M. Soledad Aronna
EMAp/FGV, Rio de Janeiro 22250-900, Brazil
,
Frédéric Bonnans
INRIA-Saclay and Centre de Mathématiques Appliquées, Ecole Polytechnique, 91128 Palaiseau, France
and
Axel Kröner
Weierstrass Institute for Applied Analysis and Stochastics, 10117 Berlin, Germany
Abstract.
In this paper we consider an optimal control problem governed by a semilinear heat equation with bilinear control-state terms and subject to control and state constraints. The state constraints are of integral type, the integral being with respect to the space variable. The control is multidimensional. The cost functional is of a tracking type and contains a linear term in the control variables. We derive second order necessary conditions relying on the concept of alternative costates and quasi-radial critical directions. The appendix provides an example illustrating the applicability of our results.
Key words and phrases:
optimal control of partial differential equations, semilinear parabolic equations, state constraints, second order analysis, control-affine problems
The first author was supported by FAPERJ, CNPq and CAPES (Brazil) and by the Alexander von Humboldt Foundation (Germany). The second author thanks the ‘Laboratoire de Finance pour les Marchés de l’Energie’ for its support. The second and third authors were supported by a public grant as part of the Investissement d’avenir project, reference ANR-11-LABX-0056-LMH, LabEx LMH, in a joint call with Gaspard Monge Program for optimization, operations research and their interactions with data sciences.
This is the first part of a work on optimality conditions for a control problem of a semilinear heat equation. More precisely, the full version, available at https://arxiv.org/abs/1906.00237v1, has been divided in two, resulting in the current manuscript (that corresponds to Part I) and https://arxiv.org/abs/1909.05056 (which is Part II)
Keywords:
1. Introduction
This is the first part of two papers on necessary and sufficient optimality conditions for an optimal control problem governed by a semilinear heat equation containing bilinear terms coupling the control and the state, and subject to constraints on the control and state. The control may have several components and enters in an affine way in the cost. In this first part we derive necessary optimality conditions of first and second order, in the second part [2] sufficient optimality conditions are shown.
In the context of second order conditions for problems governed by control-affine ordinary differential equations we can mention several works, starting with the early papers [18] by Goh and [19] by Kelley, later [15] by Dmitruk, and recently [1]. In this context, the case dealing with both control and state constraints was treated in e.g. Maurer [25], McDanell and Powers [28], Maurer, Kim and Vossen [27], Schättler [30], and Aronna et al. [3]. Fore a more detailed description of the contributions in this framework, we refer to [3].
In the infinite dimensional case, the issue of second order conditions for problems governed by elliptic equations and assuming state constraints was treated by several authors, see e.g. Casas, Tröltzsch and Unger [12], Bonnans [6], Casas, Mateos and Tröltzsch [11] and Casas and Tröltzsch [13].
Parabolic optimal control problems with state constraints were discussed in several articles. For a semilinear equation in the presence of pure-state constraints, Raymond and Tröltzsch [29], and Krumbiegel and Rehberg [20] obtained second order sufficient conditions. Casas, de Los Reyes, and Tröltzsch [10] and de Los Reyes, Merino, Rehberg and Tröltzsch [14] proved sufficient second order conditions for semilinear equations, both in the elliptic and parabolic cases. The articles mentioned in this paragraph did not consider bilinear terms as we do in the current work.
Further details regarding the existing results on second order analysis of control-affine state-constrained problems are given in the second part [2] of this research.
The contribution of this paper are first and second order necessary optimality conditions for an optimal control problem for a semilinear parabolic equation with cubic nonlinearity, several controls coupled with the state variable through bilinear terms, pointwise control constraints and state constraints that are integral in space. To incorporate the state constraints we use the concept of alternative costates (see Bonnans and Jaisson [8]) and the concept of quasi-radial directions (see Bonnans and Shapiro [9] and Aronna, Bonnans and Goh [3]).
The paper is organized as follows. In Section 2 the problem is stated and main assumptions are formulated. In Section 3 first order analysis is done. Section 4 is devoted to second order necessary conditions. Finally, in the appendix, we give an example satisfying the hypotheses of our main results.
Notation
Let be an open and bounded subset of , with boundary . Given and , let be the Sobolev space of functions in with derivatives (here and after, derivatives w.r.t. or w.r.t. time are taken in the sense of distributions) in up to order Let be the set of functions with compact support in . By we denote the closure of with respect to the -topology. Given a horizon , we write . denotes the norm in and , indistinctively. When a function depends on both space and time, but the norm is computed only with respect to one of these variables, we specify both the space and domain. For example, if and we fix we write . For the -norm in for we use , for the Euclidean norm we omit the index. We set and , with dual denoted by . By we mean the Sobolev space of -functions whose second derivative in space and first derivative in time belong to . For , we denote by the set of elements of with zero trace on , and by its trace at time zero. We write for and, setting we define the state space as
[TABLE]
The latter is continuously embedded in
[TABLE]
Note that if is a function over , we use to denote its time derivative in the sense of distributions. As usual we denote the spatial gradient and the Laplacian by and . By for , we denote the distance of to the set .
2. Statement of the problem and main assumptions
In this section we introduce the optimal control problem we deal with and we show well-posedness of the state equation and existence of solutions of the optimal control problem.
2.1. Setting
Consider the state equation
[TABLE]
and
[TABLE]
, is a constant, and . Lemma 2.3 below shows that for each control there is a unique associated solution of (2.1), called the associated state. Let denote this solution. We consider control constraints of the form , where
[TABLE]
In some statements, we will consider a specific form of (see (3.26) below). In addition, we have finitely many linear running state constraints of the form
[TABLE]
where for , and . The regularity of is used in Lemma 3.2 to derive regularity results for the adjoint state and the regularity in Proposition 3.11 for results on the Lagrange multiplier associated with the state constraint.
We call any a trajectory, and if it additionally satisfies the control and state constraints, we say it is an admissible trajectory. The cost function is
[TABLE]
where
[TABLE]
and . We consider the optimal control problem
[TABLE]
For problem (P) we consider the two types of solution given next.
Definition 2.1**.**
Let . We say that is an -local solution (resp., -local solution) if there exists such that is a minimum among the admissible trajectories that satisfy (resp., ).
2.2. Well-posedness of the state equation
Here we study the state equation and analyze, by means of the Implicit Function Theorem, the control-to-state mapping, i.e. the mapping that associates to each control, the corresponding solution of the state equation. We start by the following easily checked technical result.
Lemma 2.2**.**
For , the mapping defined on , given by has image in , is of class , and satisfies
[TABLE]
A uniqueness and existence result, and a priori estimates for the state follows.
Lemma 2.3**.**
The state equation (2.1) has a unique solution in . The mapping (u,y_{0},f)\mapsto y[{\color[rgb]{0,0,0}u,y_{0},f}] is from to , and nondecreasing w.r.t. and . In addition, there exist functions , to 2, not decreasing w.r.t. each component, such that
[TABLE]
Moreover, the state also belongs to , since is continuously embedded in that space [24, Theorem 3.1, p.23].
In the proof that follows, we use several times the (continuous) Sobolev inclusion
[TABLE]
Proof.
(i) Observe first that by the standard Sobolev inclusions and Lemma 2.2, any is such that and belong to . So, and, therefore, the notion of solution of the state equation in is clear. We could as well define a solution in but since by (2.10), for , and the compatibility condition (equality between the trace of the initial condition on and the Dirichlet condition on ) holds, it follows then that any solution in is a solution in .
(ii) We establish the a priori estimates (2.8)-(2.9). Multiplying the state equation by and integrating over , we get
[TABLE]
In particular, satisfies
[TABLE]
By Gronwall’s Lemma:
[TABLE]
and then (2.8) easily follows.
Now multiplying the state equation by we get, for all ,
[TABLE]
Choosing we get, after cancellation,
[TABLE]
For , integrating from [math] to , and using (2.10), we obtain that
[TABLE]
We easily deduce (2.9) since we can estimate and, therefore, also with the previous relations.
(iii) We construct a sequence of Galerkin approximations for which estimates analogous to (2.8) hold. Some subsequence weakly converges in to some and is such that the sequence , bounded in , weakly converges in this space. By the Aubin-Lions lemma [4], the injection of into is compact. So (extracting again a subsequence if necessary), converges a.e. to . By Lions [22, Lem. 1.3, p. 12], the weak limit of is , and is therefore solution of the state equation.
(iv) The regularity of is a consequence of the Implicit Function Theorem. In fact, let denote the trace at time 0 of elements of , which with the trace norm is a Banach space containing . Then the mapping defined by
[TABLE]
is of class . That the linearized mapping is bijective follows from results already shown in this proof.
(v) Uniqueness follows from the monotonicity w.r.t. , that we prove as follows. Consider the difference of two solutions and of (2.1), with data , resp. By the Mean Value Theorem, is solution of
[TABLE]
where a.e., {\color[rgb]{0,0,0}\tilde{y}_{0}}:=y_{02}-y_{01}\leq 0 and . Testing the equation with we get that satisfies
[TABLE]
and applying Gronwall’s inequality we obtain that . ∎
In the analysis that follows, we fix a trajectory
For this trajectory , let us consider the linear continuous operator from to such that, for each and
[TABLE]
Lemma 2.4**.**
For any , the equation
[TABLE]
has a unique solution that verifies
[TABLE]
Proof.
We follow the same method used in Lemma 2.3. Multiplying (2.21) by and integrating over space we obtain that for a.a.
[TABLE]
The r.h.s. of (2.23) can be bounded above by
[TABLE]
Then we deduce the estimate (2.22) with Gronwall’s Lemma. ∎
2.3. Existence of solution of the optimal control problem
In order to study the existence of local solutions, we need to establish the sequential weak continuity of the control-to-state mapping. We use ’’ to denote the weak convergence of a sequence, the space being indicated in each case. We need the following result (see [23, p. 14]):
[TABLE]
Lemma 2.5**.**
The mapping is sequentially weakly continuous from into .
Proof.
Taking in , we shall prove that in , where , We know that it is enough to check that any subsequence of weakly converges to in . To do this, we prove that we can pass to the limit in each term of the state equation.
(a) We know by Lemma 2.3 that is bounded in , so extracting a subsequence if necessary, we may assume that it weakly converges in to some . By (2.25), in and, therefore, maybe for a subsequence, it converges almost everywhere in .
Let be integer. Set . By the mean value theorem, , with a.e. Obviously is measurable and bounded in . By Hölder’s inequality, with and (note that ), we get
[TABLE]
Therefore, in . Taking we get the desired result.
(b) We claim that weakly converges in to . It is enough to get the result when . Fix in . By Lemma 2.2, is bounded in and has therefore (up to a subsequence) a weak limit in that space. Since in , . On the other hand since . Therefore . Since is a dense subset of . The claim follows.
By steps (a)-(b), we can pass to the limit in the weak formulation, and obtain (due to the uniqueness of solution) that . The conclusion follows. ∎
Theorem 2.6**.**
(i)* The function from to , is weakly sequentially l.s.c. (ii) The set of solutions of the optimal control problem (P) is weakly sequentially closed in (iii) If (P) has a bounded minimizing sequence, the set of solutions of (P) is non empty. This is the case in particular if (P) is admissible and is a nonempty, bounded subset of .*
Proof.
(i) Combine Lemma 2.5 and the fact that the cost function is continuous and convex on , hence it is also weakly lower semicontinuous over this product space.
(ii) Let be a sequence of solutions weakly converging to with associated states . By Lemma 2.5, weakly converge in to the state associated with and, by point (i), . This lower limit being nothing but the value of problem (P), the conclusion follows.
(iii) By the previous arguments, a weak limit of a minimizing sequence is a solution of (P). This weak limit exists iff the sequence is bounded. This concludes the proof. ∎
3. First order analysis
In this section we state first order necessary optimality conditions. More precisely, we introduce the adjoint equation, and define and prove existence of associated Lagrange multipliers.
Throughout the section, is a trajectory of problem (P). We recall the hypotheses (2.2), (2.6) on the data, and the definition of the operator given in (2.20).
3.1. Linearized state equation and costate equation
The linearized state equation at is given by
[TABLE]
For , equation (3.1) above possesses a unique solution (as follows from Lemma 2.4), and the mapping is linear and continuous from to Particularly, the following estimate holds.
Proposition 3.1**.**
One has
[TABLE]
*where *
Proof.
Immediate consequence of Lemma 2.4. ∎
It is well-known that the dual of is the set of (finite) Radon measures, and that the action of a finite Radon measure coincides with the Stieltjes integral associated with a bounded variation function . We may assume w.l.g. that , and we let denote the Radon measure associated to . Note that if belongs to the set of nonnegative finite Radon measures then we may take nondecreasing. Set
[TABLE]
The generalized Lagrangian of problem is, choosing the multiplier of the state equation to be and taking ,
[TABLE]
The costate equation is the condition of stationarity of the Lagrangian with respect to the state that is, for any :
[TABLE]
To each , let us associate , the unique solution of
[TABLE]
Since this mapping is onto, the costate equation (3.5) can be rewritten, for and arbitrary , as
[TABLE]
The r.h.s. of (3.7) can be seen as a linear continuous form on the pairs of the space . By the Riesz Representation Theorem, there exists a unique satisfying (3.7), that means, there is a unique solution of the costate equation.
Next consider the alternative costates
[TABLE]
Lemma 3.2**.**
Let satisfy (3.7), let be given by (3.8). Then , it satisfies , and it is the unique solution of
[TABLE]
Moreover, and are well-defined as elements of in view of (3.8), and we have
[TABLE]
Proof.
Let . Note that, for , the function , belongs to and is, therefore, of bounded variation. Using the integration by parts formula for the product of scalar functions with bounded variation, one of them being continuous (see e.g. [8, Lemma 3.6]), and taking into account the fact that , we get that, for ,
[TABLE]
By the definition (3.8) of the alternative costate, the latter equation can be rewritten as
[TABLE]
Now adding (3.7) and (3.12), as well as the identity
[TABLE]
we obtain, since , that (implicitly identifying, as usual, with its dual)
[TABLE]
Since is symmetric, using (2.6), we see that is solution in of (3.9); the solution of the latter being clearly unique. Multiplying (3.9) by and integrating over , with an integration by parts of the term with , we recover (using (3.8)) equation (3.14) implying that for a.a. in . Conversely, it is easy to prove that any solution of (3.14) is solution of (3.9).
Since and belong to , by (3.8) also has this regularity. Use (3.8) again, the final condition on and the fact that to get the second relation of (3.10). Furthermore, we have
[TABLE]
∎
Corollary 3.3**.**
If then and
[TABLE]
Proof.
This follows immediately from (3.8) and (3.9). ∎
3.2. First order optimality conditions
Let be an admissible trajectory of problem . We say that is complementary to the state constraint for if
[TABLE]
Let We say that is the costate associated with , or shortly to if it is the unique solution of (3.5) with .
Definition 3.4**.**
We say that the triple is a generalized Lagrange multiplier if it satisfies the following first-order optimality conditions: is complementary to the state constraint, is the costate associated with , the non-triviality condition
[TABLE]
holds and, for to , defining the switching function by
[TABLE]
one has and
[TABLE]
We let denote the set of generalized Lagrange multipliers associated with . If we say that the corresponding multiplier is singular. Finally, we write for the set of pairs with . When the nominal solution is fixed and there is no place for confusion, we just write and
Note that, in view of (3.10), and hence we do not need to consider as a component of the multiplier.
3.2.1. The reduced abstract problem
Set and , . The reduced problem is
[TABLE]
where is the closed convex cone of continuous functions over with values in Its interior is the set of functions in with negative values. We say that the reduced problem (RP) is qualified at if:
[TABLE]
Given a Banach space a closed convex subset and a point the normal cone to at is defined as
[TABLE]
We get the following first order conditions for our problem :
Lemma 3.5**.**
(i)* If is an -local solution of then the associated set of multipliers is nonempty.
(ii) If in addition the qualification condition (3.21) holds at , then there is no singular multiplier, and is bounded in .*
Proof.
(i) Let us consider the generalized Lagrangian associated with the reduced problem (RP):
[TABLE]
Let be a local solution of (RP). By, e.g., [9, Proposition 3.18], since has nonempty interior, there exists a generalized Lagrange multiplier associated with problem (RP), that is, for such that
[TABLE]
Due to the costate equation (3.7), the latter condition is equivalent to (3.20).
(ii) That is nonempty and weakly-* compact follows from [9, Proposition 3.16]. ∎
Observe that the qualification condition for (RP) given in (3.21) holds if and only if the following qualification condition for the original problem (P) is satisfied:
[TABLE]
In view of Lemma 3.5, if (3.25) is satisfied, then is nonempty and weakly-* compact.
In the sequel of this section, we consider with the state associated with the admissible control and
3.3. Arcs and junction points
We assume in the remainder of the article that the admissible set of controls has the form
[TABLE]
for some constants , for Consider the contact sets associated to the control bounds defined, up to null measure sets, by
[TABLE]
For the contact set associated with the th state constraint is
[TABLE]
Given , we say that is a maximal state constrained arc for the th state constraints, if contains but it contains no open interval strictly containing . We define in the same way a maximal (lower or upper) control bound constraints arc (having in mind that the latter are defined up to a null measure set).
We will assume the following finite arc property:
[TABLE]
In the sequel we identify (defined up to a null measure set) with a function whose th component is constant over each interval of time that is included, up to a zero-measure set, in either or . For almost all , the set of active constraints at time is denoted by where
[TABLE]
These sets are well-defined over open subsets of where the set of active constraints is constant, and by (3.29), there exist time points called junction points
[TABLE]
such that the intervals are maximal arcs with constant active constraints, for We may sometimes call them shortly maximal arcs.
Definition 3.6**.**
For let denote the set of indexes of active lower and upper bound constraints, and state constraints, on the maximal arc , and set .
As a consequence of above definitions and hypothesis (3.26) on the admissible set of controls, we get the following characterization of the first order condition.
Corollary 3.7**.**
The first order optimality condition (3.20) is equivalent to
[TABLE]
for every
3.4. About the jumps of the multiplier at junction points
Given a function , where is a Banach space, we denote (if they exist) its left and right limits at by , with the convention , ; then the jump of at time is defined as .
We denote the time derivative of the state constraints by
[TABLE]
Note that is an element of for each
Lemma 3.8**.**
Let have left and right limits at . Then
[TABLE]
Proof.
Since , , and any function with bounded variation has left and right limits, we have that has left and right limits in and satisfies
[TABLE]
Consequently has left and right limits over , and
[TABLE]
Next, if has left and right limits at some , then, using the state equation and (3.33), we get
[TABLE]
Thus, by (3.36) and (3.37), we have
[TABLE]
By the first order conditions (3.32) we have , for to . Also , and if , the corresponding state constraint has a maximum at time . Then [{{\color[rgb]{0,0,0}\bar{g}^{(1)}_{j}[\tau]}}]\leq 0. So, all terms in the sums in (3.38) are nonpositive and therefore are equal to zero. The conclusion follows. ∎
3.5. Regularity of the switching function and multiplier over maximal arcs
In the discussion that follows we fix in , and consider a maximal arc where the junction points are given in (3.31). Recall Definition 3.6 for and . Set and
[TABLE]
Let (of size ) denote the submatrix of having rows with index in and columns with index in . In the sequel we make the following assumption.
Hypothesis 3.9**.**
We assume that for and that the following (uniform) local controllability condition holds:
[TABLE]
Remark 3.10**.**
This hypothesis was already used in a different setting (i.e. higher-order state constraints in the finite dimensional case) in e.g. [7, 26]. Note that condition (3.40) implies, in particular, that the matrix has rank over .
The expression of the derivative of the th state constraint, for , is
[TABLE]
or, in vector form, for the active state constraints (denoting by {{\color[rgb]{0,0,0}\bar{g}^{(1)}_{C_{k}}[t]}} the vector of components {{\color[rgb]{0,0,0}\bar{g}^{(1)}_{j}[t]}} for ), we get
[TABLE]
where is the restriction of to the components in , and takes into account the contributions of the integral in (3.41) and of the components of in , that is, for :
[TABLE]
By the controllability condition (3.40), is onto from to . In view of the state equation, by an integration by parts argument, has a bounded derivative and is therefore Lipschitz continuous. So there exists a linear change of control variables of the form for some invertible Lipschitz continuous matrix of size , such that, calling the upper diagonal block of it holds that has its first columns being equal to the identity matrix, the other columns having null components. That is, for all :
[TABLE]
Over a maximal arc , we have that {{\color[rgb]{0,0,0}\bar{g}^{(1)}_{j}[t]}}=0 for is equivalent to
[TABLE]
The following result on the regularity of the state constraint multiplier holds. Recall the definition of the switching function given in (3.19).
Proposition 3.11**.**
There exists such that
- (i)
[TABLE]
- (ii)
We have that is locally integrable over , hence is locally absolutely continuous, and the following expression holds
[TABLE]
Proof.
By (3.8) and (3.19), one has, for :
[TABLE]
Let be given by
[TABLE]
Note that is integrable (this follows integrating by parts the contribution of and since ), and that
[TABLE]
Integrating by parts the terms in (3.50) containing Laplacians, we get, for the integral term in (3.49),
[TABLE]
It follows that and (3.46) holds. Consequently has bounded variation.
Over , we have whenever , and so
[TABLE]
Since is continuous and injective, and is integrable, this implies the existence of , for . This yields (3.47).
And so, is locally absolutely continuous. ∎
Corollary 3.12**.**
Let the finite maximal arc property (3.29) and the uniform controllability condition (3.40) hold.
- (i)
If then
- (ii)
If additionally then is over each maximal arc
Proof.
Indeed, a careful inspection of the previous proof shows that is a sum of essentially bounded terms, so (i) follows. If the additional regularity hypotheses of item (ii) hold, then is continuous. The regularity of follows from (3.52) and the local controllability assumption (3.40). This concludes the proof. ∎
4. Second order necessary conditions
In this section we derive second order necessary optimality conditions, based on the concept of radiality of critical directions.
Let us consider an admissible trajectory .
4.1. Assumptions and additional regularity
For the remainder of the article we make the following set of assumptions.
Hypothesis 4.1**.**
The following conditions hold:
the control set has the form (3.26),
- 2.
the finite maximal arc property (3.29),
- 3.
the qualification hypothesis (3.25),
- 4.
the local (uniform) controllability condition (3.40) over each maximal arc ,
- 5.
the discontinuity of the derivative of the state constraints at corresponding junction points, i.e.,
[TABLE]
- 6.
the uniform distance to control bounds whenever they are not active, i.e. there exists such that,
[TABLE]
- 7.
*the following regularity for the data (we do not try to take the weakest hypotheses) *for some :
[TABLE]
- 8.
*the control has left and right limits at the junction points , (this will allow to apply Lemma 3.8). *
In view of point 3 above, we consider from now on and thus we omit the component of the multipliers.
Theorem 4.2**.**
The following assertions hold.
- (i)
For any the associated state belongs to If remains in a bounded subset of then the corresponding states form a bounded set in . In addition, if the sequence of admissible controls converges to a.e. on , then the associated sequence of states converges uniformly to in .
- (ii)
For every one has that and is essentially bounded in .
Proof.
(i) Let . That follows from Theorem A.3 in the Appendix. Taking , it follows from the Sobolev Embedding Theorem (see e.g. [17, Theorem 5, p. 269]) that is continuous (and even Hölder-continuous) on the closure of , with uniform bound over the set of admissible controls. If the sequence of admissible controls converges a.e. to , by the Dominated Convergence Theorem, in for all . So, by similar arguments it can be proved that the associated sequence of states converges uniformly to .
(ii) By Hypothesis 4.1, is the trace at time of an element of vanishing on and this obviously holds also for in view of Theorem A.3 in the Appendix. It follows then from corollary A.2 that . The continuity of at junction points follows from (4.1) in Hypothesis 4.1 and Lemma 3.8. The boundedness on each arc of the derivative of follows from (3.47) for , since by Corollary 3.12, and by (3.40), is ‘uniformly injective’ over each arc. The conclusion follows. ∎
4.2. Second variation
For set
[TABLE]
and consider the quadratic form
[TABLE]
Let be a trajectory, and set
[TABLE]
Recall the definition of the operator given in (2.20). Subtracting the state equation at from the one at , we get that
[TABLE]
Combining with the linearized state equation (3.1), we deduce that given by
[TABLE]
satisfies the equation
[TABLE]
where and are defined as
[TABLE]
Proposition 4.3**.**
Let , and let be a trajectory. Then
[TABLE]
Here, we omit the dependence of the Lagrangian on being equal to .
Proof.
Use to denote the l.h.s. of (4.11). We have
[TABLE]
By (3.5) we obtain
[TABLE]
Thus, from (4.12) and (4.13) we get
[TABLE]
which leads to (4.11) in view of the definition of given in (3.19). This concludes the proof. ∎
4.3. Critical directions
Recall the definitions of and given in (3.27) and (3.28), and remember that we use to denote the solution of the linearized state equation (3.1) associated to
Let us define the cone of critical directions at in , or in short critical cone, by
[TABLE]
The strict critical cone is defined below, and it is obtained by imposing that the linearization of active constraints is zero,
[TABLE]
Hence, clearly and is a closed subspace of Now, note that in the interior of each one has, for every
[TABLE]
which can be rewritten as
[TABLE]
in view of the definition of given in (3.39). Therefore, over any arc we have for if and only if and (4.18) holds over . We define the entry (resp. exit) point of a time interval as (resp. ). This induces the consideration of the following sets
[TABLE]
[TABLE]
With these definitions, we can write the strict critical cone as
[TABLE]
and prove the following result.
Lemma 4.4**.**
C_{\rm s}\cap\Big{(}Y\times L^{\infty}(0,T)^{m}\Big{)}* is dense in with respect to the -topology.*
Proof.
In view of Dmitruk’s density lemma (see [16, Lemma 1]), it is enough to prove that C_{\rm n}\cap\Big{(}Y\times L^{\infty}(0,T)^{m}\Big{)} is a dense subset of .
Let us then take Recall the definition of the junction times given after equation (3.39). Fix Note that we can take a partition of , say , such that is contained in some , and on a fixed set of the rows of is linearly independent with rank equal to the one of . Now consider the matrix given after (3.39). Using the same notation as in (3.42), let us write to refer to the restriction of to the components in For each , we can write
[TABLE]
where and for almost all hence for some Let be the -dimensional vector with components
[TABLE]
Then (4.18) can be rewritten as
[TABLE]
and, therefore, \lambda_{k}(t)=\big{(}\bar{M}_{k}(t)^{\top}\bar{M}_{k}(t)\big{)}^{-1}E_{C_{k}}(t), so that
[TABLE]
By an integration by parts (in space) argument, it follows that is a continuous function, and so is . Therefore, is continuous on each maximal arc. We may also view the application as a linear and continuous mapping say
[TABLE]
where is the set of active state constraints on and, for , is the Banach space of continuous real functions with domain , endowed with the norm
[TABLE]
with the convention “”.
For any there exists in such that , it has zero components for indexes corresponding to active control bound constraints, and for a.a. In fact, to construct this it suffices to project an approximation of obtained by a truncation argument on the kernel . In what follows we shall abuse notation and use the same symbol to denote a vector and its canonical immersion in Let be the unique solution in of the linearized equation
[TABLE]
with the usual initial and boundary conditions, and where is the restriction of to the set Set and define to have the restriction to equal to and the restriction to equal to Then is in and Hence, C_{\rm n}\cap\Big{(}Y\times L^{\infty}(0,T)^{m}\Big{)} is a dense subset of . The conclusion follows. ∎
4.3.1. Radiality of critical directions
According to Aronna et al. [3, Definition 6], a critical direction is quasi radial if there exists such that, for the following conditions are satisfied:
[TABLE]
Lemma 4.5**.**
Every direction in C_{\rm s}\cap\Big{(}Y\times L^{\infty}(0,T)^{m}\Big{)} is quasi radial.
Proof.
Let (z,v)\in C_{\rm s}\cap\Big{(}Y\times L^{\infty}(0,T)^{m}\Big{)}. Then (4.30) follows from (4.2). Let us next prove (4.29). The function has the derivative , so that and hence, . Let . By the Cauchy-Schwarz inequality, for any :
[TABLE]
Let be a maximal constrained arc with say . Take , and . When , by the Dominated Convergence Theorem, . Given , we deduce with (4.1) that for and close enough to :
[TABLE]
The maximum of the r.h.s. of (4.32) over is attained when
[TABLE]
So the r.h.s. of (4.32) is less or equal than . Since we can take arbitrarily small, it is of order . For close to , we have a similar result. For far from the boundary, (4.29) is a consequence of hypothesis (4.1). The conclusion follows. ∎
Combining the previous result with Lemma 4.4, we deduce that:
Corollary 4.6**.**
The set of quasi radial critical directions of is dense in
4.4. Second order necessary condition
We obtain the following result applying Corollary 4.6 above and the second order condition in an abstract setting proved in [3, Theorem 8].
Theorem 4.7** (Second order necessary condition).**
Let the admissible trajectory be an -local solution of . Then
[TABLE]
Proof.
Let By Corollary 4.6, there exists a sequence of quasi radial directions converging to in . Doing as in [3, Theorem 8], we get the existence of a multiplier (with defined in Section 3.2.1), such that
[TABLE]
By Lemma 3.5, is bounded so that is also bounded. Extracting if necessary a subsequence, we may assume that weakly- converges to some with , and since is included in , weakly converges in to some , such that . Since in , by lemma 2.2, strongly converges to , and so we easily deduce that The conclusion follows. ∎
Appendix A Strong solutions of the heat equation
We consider the heat equation with Dirichlet boundary condition:
[TABLE]
We have the following result, see Lieberman [21, Thm 7.32, p. 182]:
Theorem A.1**.**
Let , and . Setting and (trace of over ), equation (A.1) has a unique solution . In addition there exists such that
[TABLE]
Corollary A.2**.**
Given , y_{0}\in{\color[rgb]{0,0,0}W^{1,r}_{0}({\Omega})\cap W^{2,r}({\Omega})} and , equation (A.1) has, for , a unique solution that satisfies
[TABLE]
Proof.
Apply Theorem A.1 with . It is clear that and that has trace at time 0 and zero trace over . The conclusion follows. ∎
By the standard Sobolev embeddings, we have the continuous inclusion
[TABLE]
This allows to prove the following.
Theorem A.3**.**
Assume that , y_{0}\in{\color[rgb]{0,0,0}W^{1,r}_{0}({\Omega})\cap W^{2,r}({\Omega})} and , with . Then the state equation (2.1) has a unique solution in , and the mapping is of class from L^{\infty}(0,T)\times{\color[rgb]{0,0,0}W^{1,r}_{0}({\Omega})\cap W^{2,r}({\Omega})}\times L^{r}({\Omega}) into .
Proof.
We have that belongs to . Let be the unique solution of in , where and , with homogeneous Dirichlet condition on the boundary. Set and . Denote by (resp., ) the solution of the state equation (2.1) when is (resp. ). By the monotonicity results in Lemma 2.3, we have that . Now let , denote the solutions of the state equation (2.1) when is , respectively and, in addition, . We claim that . Indeed, for , set . Then
[TABLE]
Since and have the same initial conditions, it follows that . In an analogous way, it can be proved that
Since y_{0}^{\pm}\in{\color[rgb]{0,0,0}W^{1,r}_{0}({\Omega})\cap W^{2,r}({\Omega})} and , by Corollary A.2, and belong to and, therefore, since , they are also elements of . So, . Consequently, and, by Theorem A.1 again, .
We recall that, for , denotes the set of elements of with zero trace on , and denotes the trace of at time zero. Endowed with the “trace norm”, is a Banach space that contains in view of the proof of the above Corollary A.2 (by Lions [23, p. 20], is a subset of ). That is of class is a consequence of the Implicit Function Theorem applied to the mapping from into , defined by
[TABLE]
The key step is to prove that the partial derivative is bijective; this can be done easily, taking advantage of the fact that when . ∎
Appendix B An example
Since we made a number of hypotheses about the optimal trajectory, especially at junction points, it is useful to give an example where these hypotheses are satisfied. For that purpose we discuss a particular case in which the original optimal control problem can be reduced to the optimal control of a scalar ODE.
Let and denote by the first (normalized) eigenvector of the Laplace operator.
We assume that , the control is scalar (), and in and that in Then the state equation with initial condition reads
[TABLE]
It is easily seen that the state satisfies , where is solution of
[TABLE]
We set and consider the state constraint (3.17) with and and the cost function (2.5) with . The state constraint reduces to
[TABLE]
As target functions take and with
[TABLE]
We assume that the lower and upper bounds for the control are and . We will check that the optimal control is
[TABLE]
Thus, for the optimal state we have
[TABLE]
The above control is feasible. The trajectory is optimal since for any , the state has the best possible value (in order to approach and minimize the cost function) that respects the state constraint.
Let us check Hypothesis 4.1 for this example. Conditions 1 and 2 are obviously satisfied. For the constraint qualification in Condition 3 consider the linearized state equation with unique :
[TABLE]
with . One easily checks that for all . Hence, we can find such that
[TABLE]
Conditions 4 holds, since
[TABLE]
For Condition 5 we have
[TABLE]
and hence,
[TABLE]
Conditions 6 and 8 hold by the choice of the control in (B.5). Condition 7 holds by definition.
We solve this problem numerically using BOCOP [5] and get the optimal control and state given in Figure 1.
We now discuss the second order optimality condition for this example. The costate equation is
[TABLE]
with as defined in (2.20). Since and are colinear to , it follows that , and
[TABLE]
Over , (sate constraint not active) and , therefore and identically vanish. Over , is out of bounds and therefore
[TABLE]
It follows that and also vanish on and that
[TABLE]
Over the control attains its upper bound, then
[TABLE]
with final condition , so that
[TABLE]
As expected, is negative.
Next, the linearized state equation at reads
[TABLE]
Since , we deduce that , with solution of
[TABLE]
Therefore if satisfy the linearized state equation
[TABLE]
If in addition is a critical direction, since and a.e. on , and on we get
[TABLE]
Thus, is non-negative for any critical directions , in accordance with the second-order necessary condition of Theorem 4.7.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] M. S. Aronna, J. F. Bonnans, A. V. Dmitruk, and P. A. Lotito, Quadratic order conditions for bang-singular extremals , Numerical Algebra, Control and Optimization, AIMS Journal 2 (2012), no. 3, 511–546.
- 2[2] M. S. Aronna, J. F. Bonnans, and A. Kröner, State-constrained control-affine parabolic problems II: Second-order sufficient optimality conditions , (2019).
- 3[3] M. S. Aronna, J.F. Bonnans, and B. S. Goh, Second order analysis of control-affine problems with scalar state constraint , Math. Program. 160 (2016), no. 1-2, Ser. A, 115–147.
- 4[4] J.-P. Aubin, Un théorème de compacité , C. R. Acad. Sci. Paris 256 (1963), 5042–5044.
- 5[5] J. Bonnans, J.F., D. Giorgi, V. Grélard, B. Heymann, S. Maindrault, P. Martinon, O. Tissot, and J. Liu, Bocop – A collection of examples , Tech. report, INRIA, 2017.
- 6[6] J.F. Bonnans, Second-order analysis for control constrained optimal control problems of semilinear elliptic systems , Appl. Math. Optim. 38 (1998), no. 3, 303–325.
- 7[7] J.F. Bonnans and A. Hermant, Second-order analysis for optimal control problems with pure state constraints and mixed control-state constraints , Ann. Inst. H. Poincaré Anal. Non Linéaire 26 (2009), no. 2, 561–598.
- 8[8] J.F. Bonnans and P. Jaisson, Optimal control of a parabolic equation with time-dependent state constraints , SIAM J. Control Optim. 48 (2010), no. 7, 4550–4571.
