Maximum principle for an optimal control problem associated to a SPDE with nonlinear boundary conditions
Stefano Bonaccorsi, Adrian Zalinescu

TL;DR
This paper develops a maximum principle for an optimal control problem involving a stochastic PDE with nonlinear boundary conditions, where control acts on the boundary and influences the boundary dynamics.
Contribution
It introduces necessary and sufficient optimality conditions for boundary-controlled stochastic PDEs with nonlinear boundary conditions, extending classical maximum principles.
Findings
Derived maximum principle for boundary control of SPDEs
Proved existence of optimal control in linear boundary control case
Established conditions for optimality in nonlinear boundary settings
Abstract
We study a control problem where the state equation is a nonlinear partial differential equation of the calculus of variation in a bounded domain, perturbed by noise. We allow the control to act on the boundary and set stochastic boundary conditions that depend on the time derivative of the solution on the boundary. This work provides necessary and sufficient conditions of optimality in the form of a maximum principle. We also provide a result of existence for the optimal control in the case where the control acts linearly.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic processes and financial applications · Stability and Controllability of Differential Equations · Advanced Mathematical Modeling in Engineering
Maximum principle for an optimal control problem associated to a SPDE
with nonlinear boundary conditions
Stefano Bonaccorsia, Adrian Zălinescua,b
a Dipartimento di Matematica, Università di Trento, via Sommarive 14, 38123 Povo (Trento), Italy.
b “O. Mayer” Mathematics Institute of the Romanian Academy, Iaşi,
Carol I Blvd., no. 8, Iaşi, 700506, Romania.
Abstract
We study a control problem where the state equation is a nonlinear partial differential equation of the calculus of variation in a bounded domain, perturbed by noise. We allow the control to act on the boundary and set boundary conditions which result in a stochastic differential equation for the trace of the solution on the boundary. This work provides necessary and sufficient conditions of optimality in the form of a maximum principle. We also provide a result of existence for the optimal control in the case where the control acts linearly.
Keywords: stochastic control, maximum principle, stochastic evolution equation, backward stochastic differential equation
MSC 2010: 93E20, 60H15, 60H30
1 Introduction
Let be a bounded smooth domain with regular boundary and outward normal unit vector ; we also fix a terminal time . We fix a nonlinear operator of Leray–Lions type, and we consider the following controlled nonlinear diffusion equation with dynamical boundary conditions:
[TABLE]
and are independent infinite dimensional Wiener processes with values in and , respectively. We assume that is an admissible control acting on the boundary and we study the problem of minimizing the cost functional
[TABLE]
Equations of the form (1), called fully parabolic boundary value problem in the seminal paper of Escher [18], have been considered also in the stochastic setting, see e.g. Chueshov and Schmalfuß [10], Bonaccorsi and Ziglio [8] and Barbu, Bonaccorsi and Tubaro [4]. Such problems are used to describe a wide variety of physical processes, among which we mention heat propagation in a plasma gas, population dynamics and other nonlinear diffusive phenomena (e.g., see [11]). It should be noticed that boundary conditions of the form prescribed in (1) are of a non-standard type; nevertheless, dynamical boundary conditions, i.e. involving formally a time derivative of the solution on the boundary are used as a model in several physical systems , see the paper [21] for a derivation and a physical interpretation in the case of the heat equation; further applications are given to heat transfer in a solid imbedded in a moving fluid [34, §7.2], surface gravity waves in oceanic models ([14], [15], [30]), as well as in fluid dynamics [33], phase separation phenomena [16], and this list is far from being exhaustive.
In our setting, existence for the solution of equation (1) is proven in [8] or [4] via an operatorial approach which allows to rewrite the system as a stochastic differential equation in the product space . A similar approach was recently developed for a class of deterministic parabolic equation with Wentzell boundary conditions in [5].
Our objective is to control such a system through the boundary, considering that in practice it is easier to implement boundary control than distributed parameter controls (see [12] for a discussion about the subject). Such control problems have been widely studied in the deterministic literature (see [27]) and have been addressed in the stochastic case as well (see [17], [22], [26], [29], [12]). With regard to dynamical boundary conditions, we mention that an associated control problem have already been addressed by Bonaccorsi, Confortola, Mastrogiacomo [7], following the backward SDEs (BSDEs, for short) approach introduced by Fuhrman and Tessitore in [20] in an abstract setting. We emphasize that in general the above papers concern one-dimensional domains.
The present article deals with the control problem from a different point of view. We will follow the maximum principle approach, which has been introduced by Pontryagin and his group in the 1950’s in order to establish necessary conditions of optimality for deterministic controlled systems. Towards the extension to the stochastic controlled systems one difficulty is that the adjoint equation becomes a linear BSDE, especially for stochastic PDEs (SPDEs), in which case the respective BSDE can be seen as a backward SPDE (BSPDE, for short). Several papers are devoted to the study of maximum principles for SPDEs; see, e.g., [6], [25], [32]. Stochastic maximum principle for SPDEs with noise and control on the boundary was established by Guatteri [23] and Guatteri and Masiero [24], in the case of an interval in . Their treatment, based on semigroup theory, is different from ours; in this paper we deal with variational solutions for the controlled system, as well as for the adjoint equation.
The paper is organized as follows. In section 2, we introduce some notations and recall some preliminary results concerning the well-posedness of the state equation. Section 3 is devoted to the derivation of necessary and sufficient optimality conditions in the form of a maximum principle. In order to achieve this, we use the duality between the adjoint equation and the variation equation. We will first analyze the adjoint equation, for which we give an existence theorem based on a result of Márquez-Durán and Real [28] concerning BSDEs in a variational framework. Then, the variation equation is obtained by using a linear perturbation of the control. In section 4, we prove directly the existence of an optimal control under the assumption that the coefficient depends linearly on the control.
2 Preliminaries
Let be a bounded domain which is sufficiently regular (see, e.g. [1], Remark 7.45 or [13]). On we introduce the standard Sobolev space ; on the boundary we consider the fractional order Sobolev space
[TABLE]
endowed with the norm
[TABLE]
The following result of compactness†††an operator is compact if it maps bounded sets into precompact sets of the injection will be useful later:
[TABLE]
It is well-known that for a smooth domain , the trace operator , with the property that \tau(y)=y|\big{.}_{\Gamma}, , is well-defined. Moreover, the range of is actually and
[TABLE]
for some constant depending only on .
In what follows we suppose that the domain is bounded and smooth. We introduce the “pivot” space endowed with the natural inner product
[TABLE]
and norm . Let us consider the Banach space
[TABLE]
endowed with the norm
[TABLE]
The embedding is compact; this property will be used in the proof of Theorem 4.1. Furthermore, the space is isomorphic to and it is densely embedded in . Let be the dual space of , with the dualization denoted . We fix the Gelfand triple (the last formal inclusion implies that for every and ).
Let be a convex, closed subset of an Euclidian space . On the coefficients of the equation we impose the following conditions:
(A0)
is a Carathéodory function‡‡‡i.e., is continuous for every and is measurable for every with , -a.e. on ;
there exist constants and positive functions , such that:
(A1)
for almost all and all :
[TABLE]
(A2)
for almost all and all :
[TABLE]
(B)
and are Hilbert-Schmidt linear operators;
(C0)
is a Carathéodory function with , -a.e. on ;
(C1)
for almost all and all :
[TABLE]
(C2)
for almost all and all :
[TABLE]
In order to give a functional setting for our equation, let be the space of smooth functions
[TABLE]
We define an operator by
[TABLE]
An integration by parts, hypotheses (A0), (A1), and the density of in show that can be extended to a bounded non-linear operator on with values in , again denoted by , such that
[TABLE]
for all and .
We also set , so that is a Hilbert-Schmidt operator; we denote the space of such operators, endowed with the norm , for an orthonormal basis of . Consider a -cylindrical Wiener process, formally written
[TABLE]
where and are orthonormal bases in and , respectively, is a sequence of independent Brownian motions on and is the filtration generated by , augmented by the null sets of .
Then, for , the state equation (1) can be written as
[TABLE]
Here we assume that is an admissible control (or simply, control), i.e. a progressively measurable process . We will denote by the space of all admissible controls.
Theorem 2.1**.**
Under hypotheses (A0)–(A2), (B), (C0)–(C2), for every control , there exists a unique solution of equation (3) such that is a continuous, adapted process with values in . Moreover,
[TABLE]
For the proof of this result the reader can refer to the book of Prévôt and Röckner [31], where a general result of existence and uniqueness for variational solutions was given. The task of verifying that the above hypotheses are sufficient to place ourselves into their framework was already carried in [8].
The notion of solution for (3) that is used in Theorem 2.1 is that of variational solution as given in the book by Prévôt and Röckner [31, Definition 4.2.1]. Actually, this means that is an -valued, adapted process with an equivalent version that belongs to and satisfies the equation -a.s.
Concerning the cost functional (2), on its coefficients we impose the following hypotheses (the functions and were already introduced for the previous set of conditions):
(F0)
and are Carathéodory functions with , -a.e. on and , -a.e. on ;
there exist constants such that:
(F1)
for almost all and all :
[TABLE]
for almost all and all :
[TABLE]
(L0)
and are Carathéodory functions with , -a.e. on and , -a.e. on ;
(L1)
for almost all and all :
[TABLE]
for almost all and all :
[TABLE]
The cost functional can then be written as
[TABLE]
where and are defined by
[TABLE]
From now on, we will assume that conditions (A0)–(A2), (B), (C0)–(C2), (F0), (F1), (L0) and (L1) are in force.
It is easy to show that and are Gâteaux differentiable in , with
[TABLE]
Also, is Gâteaux differentiable in , with
[TABLE]
3 Maximum principle
3.1 The adjoint equation
We consider the following linear BSDE in :
[TABLE]
Theorem 3.1**.**
For every control , there exists a unique solution such that is a continuous, adapted process with values in .
Proof.
In order to prove this theorem, we will use a result of Márquez-Durán and Real [28] which asserts existence and uniqueness for general (non-linear) BSDEs in a variational setting. Let us now verify that the hypotheses of Theorem 2.2 in [28] are fulfilled for the coefficients of our BSDE.
Final condition. The fact that is clearly implied by linear growth condition on and . 2. 2.
Measurability. Of course,
[TABLE]
is a progressively measurable process with values in for every . 3. 3.
Hemicontinuity. The mapping
[TABLE]
is continuous, for every and . Indeed, for and , we have
[TABLE]
and the conclusion follows from the Lebesgue’s dominated convergence theorem, by (A1) and (C1). 4. 4.
Boundedness. By (L1), . Moreover, by (A1) and (C1), for every , is bounded by . 5. 5.
Monotonicity. We have that
[TABLE]
for every , a.e., by assumptions (A2) and (C2). 6. 6.
Coercivity. There exist , and a progressively measurable process such that
[TABLE]
for every , a.e. Indeed, for , we have
[TABLE]
∎
3.2 The variational equation
We define the operator by
[TABLE]
Let now and be two controls such that is bounded; let, for , . Let us denote, for simplicity, , and instead of , and , respectively.
Proposition 3.2**.**
The equation
[TABLE]
has a unique variational solution that is a continuous, adapted process in with . Moreover, and converge weakly§§§Recall that a sequence of random variables taking values in a Hilbert space converges weakly to in as if for any random variable we have . as to and in , respectively in .
Proof.
We have, by Itô’s formula,
[TABLE]
therefore, by (A1), (A2), (C1) and (C2),
[TABLE]
Since, for any ,
[TABLE]
we have, by the assumptions on and ,
[TABLE]
where is a constant whose value is allowed to change from line to line. Hence
[TABLE]
Then there exists a progressively measurable process such that, at least on a subsequence:
- •
converges weakly to as in ;
- •
converges to a.e. as on ;
- •
converges to a.e. as on .
For with , we have
[TABLE]
where, for the sake of simplicity, we have denoted
[TABLE]
By the dominated convergence theorem and (8), since and are bounded, we have that
[TABLE]
and
[TABLE]
We also have that
[TABLE]
where is a smooth function defined on such that , for and for . Since, by (C1),
[TABLE]
we have, by the dominated convergence theorem, that
[TABLE]
On the other hand, by (7) and (8),
[TABLE]
Therefore,
[TABLE]
Let be defined by
[TABLE]
By the weak convergence of to in , the boundedness of in and the density of in , we can pass to the limit in relation (9) and obtain that, for every , and converges weakly to in . This allows the identification
[TABLE]
from which we can infer that is a variational solution of equation (6).
The uniqueness of the solution of (6) is obtained by applying Theorem 4.2.4 in [31], for instance. A consequence of the uniqueness is that the weak convergences stated inside this argument hold not only on a subsequence, but on a whole right neighborhood of [math]. ∎
3.3 Necessary conditions of optimality
In this section we will derive, in the form of a maximum principle, necessary conditions for an admissible control to be optimal. Let us define the Hamiltonian by
[TABLE]
Theorem 3.3**.**
Let be an optimal control. Then, a.s., -a.e.,
[TABLE]
Remark. This inequality is equivalent to
[TABLE]
where is the set of admissible controls such that , -a.e. and denotes the directional derivative of with respect to in the direction (which exists if and ). This is known as the local form of the maximum principle.
Proof.
As in the previous section, we will take first an arbitrary control such that is bounded and we will use the same notations , , , for . We will also write , instead of , , respectively. Let us apply Itô’s formula to :
[TABLE]
Therefore, letting and taking expectation, we get
[TABLE]
On the other hand, since is an optimal control, for any , i.e.
[TABLE]
which is equivalent to
[TABLE]
Here, denotes the directional derivative of with respect to in the direction . Passing to the limit as , by the weak convergence property stated in Proposition 3.2 and similar arguments as in its proof, we obtain
[TABLE]
Combining this inequality with relation (11), we derive
[TABLE]
i.e.
[TABLE]
Since the control such that is bounded is chosen arbitrarily, we can infer easily that a.s., -a.e.
[TABLE]
∎
3.4 Sufficient conditions of optimality
In this section we show that condition (10) is, under some supplementary assumptions, sufficient for the optimality of a given control.
Theorem 3.4**.**
Let be a control satisfying (10). If the mappings and
[TABLE]
are convex a.s., -a.e., then is optimal.
Remark. Under the above convexity hypothesis, (10) becomes equivalent to
[TABLE]
which is the global form of the maximum principle.
Proof.
For an admissible control such that is bounded, let us apply Itô’s formula to :
[TABLE]
Since the map is convex, we have
[TABLE]
where denotes the directional derivative of with respect to in the direction . We make the remark that
[TABLE]
may be infinite, but exists, by (10). From relation (13) we get
[TABLE]
The convexity of implies that
[TABLE]
consequently
[TABLE]
By relation (10), the right-hand side of the above inequality is positive, so .
If is not bounded, we can take, for ,
[TABLE]
Applying Itô’s formula to , we get, by the properties of and ,
[TABLE]
Therefore, by the dominated convergence theorem,
[TABLE]
This implies that ; hence . ∎
Example. The convexity hypothesis for is hard to verify in practice, since the direction of and the sign of are not a priori determinable. However, under convexity assumptions on the coefficients, we just need to strengthen condition (10) in order to derive a sufficient optimality condition.
We will take (or, more general, linear with respect to ). Moreover, the functions , and are supposed to be convex, -a.e. on , respectively -a.e. on . For , on and we impose that:
- •
is convex, -a.e. on ;
- •
is convex, -a.e. on .
Let, for ,
[TABLE]
where is the exterior normal cone to in if and if .
A sufficient condition of optimality for an admissible control is then
[TABLE]
This condition is obviously equivalent to (10) when -a.e.
4 Existence of an optimal control
Let now study the problem of the existence of an optimal control under the convexity conditions on the coefficients of the cost functional and linearity of control.
Assume that is bounded and:
(C3)
, where satisfies conditions (C0)–(C2) and ;
(F2)
and are convex, -a.e. on , respectively -a.e. on ;
(L2)
and are convex, -a.e. on , respectively -a.e. on .
Remark. Notice that our assumptions, although stringent, cover most of the cases in the literature. For instance, Debussche, Fuhrman and Tessitore [12], Fabbri and Goldys [19] and Bonaccorsi, Confortola and Mastrogiacomo [7] consider linear control problems on the boundary (for Neumann, Dirichlet and dynamic boundary conditions, respectively), and all those papers are concerned with the one-dimensional problem. These papers, further, consider linear quadratic term in the cost functional, that hence satisfy assumptions (F2) and (L2).
On the other hand, in this paper we do not consider the structure condition that is necessary to apply the forward-backward approach of Fuhrman and Tessitore [20], i.e., the condition that the control and the noise enters the equation with the same operator in front of them.
Theorem 4.1**.**
Under the above assumptions, there exists at least an optimal control.
The necessary condition (10) provides more information about the optimal control whose existence is guaranteed by the above result. In fact, it can be written as
[TABLE]
(recall that is the exterior normal cone to in if and if ).
Proof.
By Itô’s formula applied to , it is clear that
[TABLE]
for every control (we recall that we use the generic term for positive constants, whose values can change from one place to another). Since is bounded, is bounded, too. Let be a sequence of controls such that . There exists such that a subsequence of converges weakly to . Without restricting the generality, we can suppose that the whole sequence converges to .
For the sake of simplicity, let us denote . Let us show that converges to . We have that, exactly as in (7), that
[TABLE]
Consequently, the sequences and are also bounded in , respectively in . Therefore, at least on a subsequence:
- •
converges weakly in to a process ;
- •
converges weakly in to for every ;
- •
converges weakly in to a process ;
- •
converges weakly in to a process .
If , then
[TABLE]
Passing to the limit in this relation, we obtain, for ,
[TABLE]
meaning that satisfies the relation
[TABLE]
where the -valued, square-integrable process is defined by
[TABLE]
In order to assert that , we have to prove the identification , -a.s. For that, we will use some results from the theory of maximal monotone operators (see [3], for example).
We have that, -a.s., and
[TABLE]
Moreover, we have
[TABLE]
and
[TABLE]
Consequently, the sequence is bounded in , -a.s. By a well-known result of Aubin (see, for example, Theorem 1.20 in [3]), since the inclusion is compact, is relatively compact in .
As we already have that converges weakly to in , we infer that converges strongly to in , -a.s. By the dominated convergence theorem converges strongly to in .
Let us define the operator on by
[TABLE]
Since is hemicontinuous and monotone, by Theorem 2.4 in [3], is a maximal monotone operator.
Itô’s formula applied to , respectively , yield
[TABLE]
and
[TABLE]
Consequently, since the norm in is lower-semicontinuous with respect to the weak topology, converges weakly to in and converges strongly to in , we get
[TABLE]
where denotes the scalar product in . By Corollary 2.4 in [3],
[TABLE]
i.e.
[TABLE]
By the uniqueness of the solution of equation (3), we have that .
The functional , defined by
[TABLE]
is strongly continuous. By conditions (F2) and (L2), it is also convex and therefore weakly lower semi-continuous. As a consequence, . Since , and , has to be an optimal control. ∎
Acknowledgement. The research of A. Zălinescu was supported by the 2010 PRIN project: “Equazioni di evoluzione stocastiche con controllo e rumore al bordo”.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] R.A. Adams, J.J.F. Fournier, Sobolev spaces. Pure and Applied Mathematics, Elsevier/Academic Press, Amsterdam, 2003.
- 2[2] V. Barbu, Optimal control of variational inequalities. Research Notes in Mathematics, no. 100, Pitman Publishing, 1984.
- 3[3] V. Barbu, Nonlinear differential equations of monotone types in Banach spaces. Springer Monographs in Mathematics, Springer, New York, 2010.
- 4[4] V. Barbu, S. Bonaccorsi, L. Tubaro, Stochastic parabolic equations with nonlinear dynamical boundary conditions, J. Math. Anal. Appl. 427 (2015) 484–498.
- 5[5] V. Barbu, A. Favini, G. Marinoschi, Nonlinear parabolic flows with dynamic flux on the boundary, J. Differential Equations 258 (2015) 2160–2195.
- 6[6] A. Bensoussan, Stochastic maximum principle for distributed parameter systems, J. Franklin Inst. 315:5–6 (1983) 387–406.
- 7[7] S. Bonaccorsi, F. Confortola, E. Mastrogiacomo, Optimal control of stochastic differential equations with dynamical boundary conditions, J. Math. Anal. Appl. 344 (2008) 667–681.
- 8[8] S. Bonaccorsi, G. Ziglio, A variational approach to stochastic nonlinear diffusion problems with dynamical boundary conditions, Stochastics An International Journal of Probability and Stochastic Processes, 86:2 (2014) 218–233.
