Verification theorems for stochastic optimal control problems in Hilbert spaces by means of a generalized Dynkin formula
Salvatore Federico, Fausto Gozzi

TL;DR
This paper introduces a new approach to verification theorems in infinite-dimensional stochastic control, utilizing a generalized Dynkin formula that relaxes the need for strong solution assumptions.
Contribution
The paper presents a novel method for verification theorems in stochastic control that does not require the mild solution to be a strong solution, using a new Dynkin formula.
Findings
Developed a new Dynkin formula for infinite-dimensional stochastic processes.
Proved verification theorems without requiring strong solutions.
Applicable to Ornstein-Uhlenbeck processes with mild solutions.
Abstract
Verification theorems are key results to successfully employ the dynamic programming approach to optimal control problems. In this paper we introduce a new method to prove verification theorems for infinite dimensional stochastic optimal control problems. The method applies in the case of additively controlled Ornstein-Uhlenbeck processes, when the associated Hamilton-Jacobi-Bellman (HJB) equation admits a mild solution. The main methodological novelty of our result relies on the fact that it is not needed to prove, as in previous literature, that the mild solution is a strong solution, i.e. a suitable limit of classical solutions of the HJB equation. To achieve our goal we prove a new type of Dynkin formula, which is the key tool for the proof of our main result.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic processes and financial applications · Economic theories and models · Risk and Portfolio Optimization
Verification theorems for stochastic optimal control problems in Hilbert spaces by means of a generalized Dynkin formula
Salvatore Federico111Università degli Studi di Siena, Dipartimento di Economia Politica e Statistica, Piazza San Francesco 7, 53100, Siena (Italy). Email: [email protected].
Fausto Gozzi222LUISS University, Dipartimento di Economia e Finanza, Viale Romania 32, 00197, Rome (Italy). Email: [email protected].
Abstract
Verification theorems are key results to successfully employ the dynamic programming approach to optimal control problems. In this paper we introduce a new method to prove verification theorems for infinite dimensional stochastic optimal control problems. The method applies in the case of additively controlled Ornstein-Uhlenbeck processes, when the associated Hamilton-Jacobi-Bellman (HJB) equation admits a mild solution (in the sense of [16]). The main methodological novelty of our result relies on the fact that it is not needed to prove, as in previous literature (see e.g. [26]), that the mild solution is a strong solution, i.e. a suitable limit of classical solutions of the HJB equation. To achieve the goal we prove a new type of Dynkin formula, which is the key tool for the proof of our main result.
Key words: Stochastic optimal control, infinite dimensional HJB equations, Dynkin’s formula, transition semigroups, verification theorems, optimal feedbacks.
AMS classification: 93E20 (Optimal stochastic control); 70H20 (Hamilton-Jacobi equations); 65H15 (Stochastic partial differential equations); 49L20 (Dynamic programming method); 49N35 (Optimal feedback synthesis).
Acknowledgements. The authors are sincerely grateful to Franco Flandoli, Ben Goldys and Mauro Rosestolato for fruitful discussions on Subsection 6.1.4 and Remark 6.2. The authors are also grateful to an anonymous referee for careful scrutiny and useful suggestions that led to an improved version of the paper.
Contents
-
6.1 Neumann Boundary control of a stochastic heat equation with additive noise
-
6.2 Stochastic optimal control with delay in the control variable
1 Introduction
In this paper we introduce a new technique, based on a generalized Dynkin formula, to prove verification theorems for stochastic optimal control problems over infinite horizon in Hilbert spaces.
Verification theorems are key results to enable to solve in a closed way optimal control problems through the dynamic programming approach. Once a solution (in some sense to be precised) of the associated HJB equation is known to exists, the verification theorem provides a sufficient (sometimes also necessary) condition of optimality, which can be used to find optimal controls in feedback forms through the so called closed loop equation. In the stochastic case, when the solution is sufficiently smooth, the proof of such theorem is substantially based on an applying the Dynkin formula to the function and to the state process. In our framework of discounted time-homogeneous infinite horizon problems the dependence on time is known, so the HJB equation is elliptic and only depends on the state variable. Hence, in the finite dimensional case, to employ the classical Dynkin formula, it is needed to know that . Fortunately, in the finite dimensional case, due to the presence of a powerful regularity theory (at least for nondegenerate second order HJB equations) there is a wide class of problems for which actually is known to enjoy this regularity, hence the classical Dynkin formula applies and the verification theorem can be proved. On the other hand, if is not known to be sufficiently smooth (i.e. when is known to be only a viscosity solution), still in the finite dimensional case, other techniques have been developed to overcome the fact that the classical Dynkin formula is not applicable. We mention the following techniques.
The technique developed in [33], dealing with viscosity solutions. In this case, the classical Dynkin formula is applied to test functions and only some weak results are obtained.
- -
The technique developed in [41]. Here a solution is obtained through the solution of a suitable backward SDE (BSDE). This technique applies to semilinear HJB equations and provides the verification theorem as a byproduct of the construction itself of the solution . The latter feature is particularly meaningful, as it allows to completely bypass the problem of second order regularity of and the application of the classical Dynkin formula. On the other hand, the powerfulness of this approach is partly limited by the fact that it can be applied only when a structural condition is verified by the control operator.
- -
The technique developed in [32]: here is studied and treated as a strong solution, i.e. as a suitable limit of classical solutions.
When the state space is infinite dimensional the situation is much worse. First of all, the regularity needed to apply the classical Dynkin formula (see, e.g., [10, Sec. 4.4]) is very demanding and does not allow to deal with many applied examples proposed and only partly studied in the literature. This is partly due to additional regularity assumptions on the coefficients needed in infinite dimension, partly due to the lack of a satisfactory regularity theory in infinite dimension. Hence, elaborating alternative methods is considerably more important than in the finite dimensional case. Clearly, the first attempt consists in trying to extend the techniques developed in the finite dimensional case to infinite dimensional one. On this side, so far the state of the art can be basically depicted as follows.
- (a)
There are no results concerning the case when is a viscosity solution.
- (b)
Results with the BSDE approach have been elaborated in various papers, see e.g. [21] in the infinite horizon case, but always under the structural condition. The latter requirement leaves out the treatments of important cases like boundary control of stochastic PDEs or delayed control of SDEs.
- (c)
Results dealing with strong solutions are available in [31] and in [5].
The results we provide here are closer, in the conclusions, to the results mentioned in item (c) above. With respect to them, ours have a larger range of applicability and, not only in this sense, can be seen as a significant improvement of this technique, as we will comment more precisely afterwards.
We stress the fact that our method to prove the verification theorem is a novelty also in finite dimension: our results may be useful to treat also finite dimensional problems where only partial regularity properties of the value function are known. Here we focus on the infinite dimensional case where the application is more meaningful.
We now illustrate the results and the novelties of our paper. We consider a class of stochastic optimal control problems in a real separable Hilbert space , where the noise is additive and the control only appears in an additive form in the drift term. More precisely, the state equation is
[TABLE]
where , , , are suitable operators, with being other real separable Hilbert spaces and being a Polish space; is a -valued cylindrical Browian motion; is the control process taking values in ; is the state process taking values in the Hilbert space . The stochastic control problem consists in minimizing, over a set of admissible control processes, a cost functional in the form
[TABLE]
where is a discount factor and is a suitable real valued function. In this case the associated HJB equation is an elliptic semilinear PDE in the space :
[TABLE]
where
[TABLE]
where denotes the -gradient of a function (see Subsection 2.2). Under reasonable assumptions, it is proved in [16] that such HJB equation admits a unique mild solution, i.e. a solution of a suitable integral form of the above equation. Such solution admits -gradient, i.e. verifies the minimal differentiability requirement to give sense to the nonlinear Hamiltonian term in HJB above. Once one proves the existence of a mild solution to the associated HJB equation, the approach of item (c) would require three nontrivial technical steps: first, proving that such a mild solution is indeed a strong solution (limit, in a suitable sense, of classical solution); second, applying Dynkin formula to the approximating classical solutions; third, passing to the limit the Dynkin formula. As one may expect, passing through all these steps requires additional hypotheses that may be nontrivial to check in practice (see e.g. [31]). Our goal here is to bypass these steps through an alternative path. In fact, we show that the role of strong solutions is not essential. Indeed, relying on the theory of -semigroups (see e.g. [14, Appendix B] and [43]), we prove a generalized (abstract) Dynkin formula — deserving interest in itself — which can be directly applied to mild solutions. The proof is quite involved and this is the reason why we consider here the case of stochastic control of equation of type (1.1), where the uncontrolled part of the state equation is of Ornstein-Uhlenbeck type333It is worth to stress that, even if in the case of Ornstein-Uhlenbeck dynamics the approach of strong solutions has already been succesfully applied (see [31]), the method used here, other than being original, seems to be extendable to more general structures of state equations, where the strong solution approach would fail.. Then, relying on this formula, we straightly prove a verification theorem. The new results on -derivatives provided in [16] (see also [14, Ch. 4]) enable us to apply our method to more general examples than the ones treated by the current literature; in particular, to cases where the structural condition required at item (b) above is not verified (see Section 6).
The main results of the paper are the abstract Dynkin formula (Theorem 4.8); the verification theorem (Theorem 5.6); the consequent Corollary 5.7 on sufficient conditions for the existence of optimal control processes in feedback form. Moreover, since the existence of optimal feedback controls might be is easier to obtain when the optimal control problem is considered in the weak formulation, i.e., letting also the stochastic basis to vary, we also provide Corollary 5.8 in this direction. We underline that we do not provide general results on the existence of optimal control processes in feedback form, as such results strongly depend on the specific case at hand. To this regard, in Section 6 — where we deal with two specific applications: optimal boundary control (of Neumann type) of the stochastic heat equation and optimal control of SDEs with delay in the control variable — we provide for the first example some results and comments on the existence of optimal feedback control processes.
The paper is organized as follows. After some preliminaries in Section 2 on spaces, notation and the notion of -derivative recently extended in [16], we introduce our family of control problems in Section 3. Section 4 is devoted to prove our new Dynkin formula (Theorem 4.8), the methodological core of the paper. In Section 5 we prove our main results on the control problem: in Subsection 5.1, the verification theorem (Theorem 5.6); in Subsection 5.2, Corollary 5.7 on optimal feedbacks. Section 6 is devoted to illustrate the applications of our results to the aforementioned examples. Finally the Appendix is devoted to prove few technical results needed to prove our Dynkin formula.
2 Preliminaries
In this section we provide some preliminaries about spaces and notation used in the rest of the paper and recall from [16] the notion of -derivative. We restrict the treatment of -derivative to the case of real valued functions defined on Hilbert spaces and to constant operator maps . This will be enough for the purposes of the present paper. For a more general theory and more details we refer to the aforementioned paper [16].
2.1 Spaces and notation
Measurable bounded and continuous functions.
All the topological spaces are intended endowed with their Borel -algebra, denoted by . By measurable set (function), we always intend a Borel measurable set (function). If is a topological space and is a topological vector space, we denote by the set of bounded measurable functions from to and by the set of bounded continuous functions from to . If , we drop it in the latter notation. If is complete, the spaces and are Banach spaces when endowed with the norm
[TABLE]
Hilbert spaces.
Let be a Hilbert space. We denote its norm by and its inner product by by . We omit the subscript if the context is clear and if . If a sequence , converges to in the norm (strong) topology we write .
We denote by the topological dual of , i.e. the space of all continuous linear functionals defined on . We always identify with through the standard Riesz identification.
Linear operators.
Let be real separable Hilbert spaces. We denote by the set of all bounded (continuous) linear operators with norm , using for simplicity the notation when . Moreover, we denote by the space of closed densely defined and possibly unbounded linear operators , where denotes the domain. We recall that is a Hilbert space when endowed with the graph norm . The range of an operator is denoted by . Clearly, . Given , we denote its adjoint operator by .
We denote by the set of trace class operators, i.e. the operators such that, given an orthonormal basis of , the quantity
[TABLE]
is finite (see [45, Sec. VI.6]). The latter quantity is independent of the basis chosen and defines a norm making a separable Banach space. The trace of an operator is denoted by , i.e. . The latter quantity is finite and, again, independent of the basis chosen. We denote by the subset of of self-adjoint nonnegative (trace class) operators on . Note that, if , then .
We denote by (subset of ) the space of Hilbert-Schmidt operators from to , i.e the spaces of operators such that, given an orthonormal basis of , the quantity
[TABLE]
is finite (see [45, Sec. VI.6]). The latter quantity is independent of the basis chosen and defines a norm making a Banach space. It is actually a Hilbert space with the scalar product
[TABLE]
where is any orthonormal basis of .
Stochastic processes.
Let be a filtered probability space satisfying the usual conditions. Given , , and a Hilbert space , we denote by the set of all (equivalence classes of) progressively measurable processes such that
[TABLE]
This is a Banach space with the norm . Next, we denote by the space of all (equivalence classes of) progressively measurable processes such that for every . We denote by the set of all (equivalence classes of) progressively measurable processes such that
[TABLE]
is continuous. This is a Banach space with the norm
[TABLE]
Next, we denote by the space of all (equivalence classes of) progressively measurable processes such that for every . We also say that elements of and are “-mean continuous”.
2.2 -derivative
Here we provide the notion of -derivative for functions , where is a Hilbert space. The latter notion is considered in [16] when is a map , with Banach spaces. Here we restrict to the case of constant .
Recall that, if , the Fréchet derivative of at (if it exists) is the (unique) linear functional such that
[TABLE]
Definition 2.1** (-derivative).**
Let be Hilbert spaces, let and . We say that is continuously -Fréchet differentiable at (briefly, -differentiable at ) if there exists (clearly, if it exists, then it is unique), called the -derivative of at , such that
[TABLE]
We denote by the space of all maps such that is continuously -differentiable over , i.e. such that is -differentiable at each and the map belongs to . In the special case and , we simply use the standard notation .
Remark 2.2**.**
Note that, in the definition of the -derivative, one considers only the directions in selected by the range of . When and it reduces to the Fréchet derivative, i.e. . Clearly, if is -differentiable at , then it is also -Gateaux differentiable at , in the sense that
[TABLE]
moreover, the limit above is uniform in for every . Conversely, if there exists such that
[TABLE]
then is -differentiable at and .
The notion of -derivative allows to deal with functions which are not Gateaux differentiable, as shown by the following example.
Example 2.3**.**
Let be defined by . Clearly, does not admit directional derivative in the direction at the point . On the other hand, if we consider , defined by , then admits -Fréchet derivative at every .
Remark 2.4**.**
Clearly, if is Fréchet differentiable at some and , it turns out that is -Fréchet differentiable at and
[TABLE]
Also, if is both Fréchet differentiable and -differentiable at some , then and (2.5) holds true. Indeed, we get by Fréchet differentiability
[TABLE]
On the other hand, by -Fréchet differentiability we also have
[TABLE]
Hence
[TABLE]
It follows what claimed.
If is unbounded, a function may be Fréchet-differentiable at some and yet not -Fréchet differentiable there, as shown by the following example.
Example 2.5**.**
Let be Hilbert spaces, let be a closed densely defined unbounded linear operator on , and let be its adjoint. Next, let be defined by . Clearly, is Fréchet differentiable at every and . On the other hand, if was also -differentiable at every , by Remark 2.4 it would follow for every , i.e. , a contradiction.
3 Formulation of the stochastic optimal control problem
We are concerned with the optimal control of an Ornstein-Uhlenbeck process valued in a Hilbert space . Precisely, let three real separable Hilbert spaces, let be a real Banach space and let be measurable and endowed with the -algebra induced by , the Borel -algebra of . Let be a complete filtered probability space satisfying the usual conditions, let be a -valued cylindrical Brownian motion (see [10, Ch. 4]), and consider the controlled SDE
[TABLE]
where the control process , taking values in , belongs to a suitable space of admissible controls and the coefficients satisfy the following assumptions, which will be standing and not repeated throughout the paper.
Assumption 3.1**.**
- (i)
* is a closed densely defined linear operator generating a -semigroup \big{\{}e^{tA}\big{\}}_{t\geq 0} of operators of .* 3. (ii)
, for all , and there exists such that
[TABLE] 4. (iii)
* is a closed densely defined444The assumption that is densely defined can be done without loss of generality, as one can always restrict to . linear operator such that can be extended for every to a continuous linear operator defined on that we denote by . Moreover, there exists , and such that*
[TABLE] 5. (iv)
* is measurable and \big{|}L(u)\,\big{|}_{K}\leq C_{L}(1+|u|_{U}) for some .*
Remark 3.2**.**
Since for every and the operators and belong to and coincide on the dense subset , we have
[TABLE]
This implies that the map is strongly continuous, i.e. is continuous for each .
We now take
[TABLE]
which will be fixed in the rest of the paper. We consider, as space of admissible controls, the space of processes
[TABLE]
The reason for the choice of in (3.2) and of in (3.4)-(3.5) relies on the following result (cf. also [20, Prop. 8.8] and [23, Lemma 3.2]), which will guarantee well-posedness of the controlled state equation (Proposition 3.4).
Lemma 3.3**.**
Let be real Banach spaces, let , . Let and let be strongly continuous555Meaning that is continuous for each . and such that for some for every . Then defined as Bochner integral by
[TABLE]
is well defined and continuous.
Proof.
Let . First of all, we note that the map
[TABLE]
is measurable for each . Indeed, given the above map can be seen as the composition where
[TABLE]
Now, is clearly measurable. Also is measurable, as it is continuous: indeed is continuous for each and is a family of uniformly bounded operators for each . Hence is measurable.
Given the above, it makes sense to consider in Bochner sense for each . By Hölder’s inequality, setting , we have for each
[TABLE]
This show, at once, that is well defined as Bochner integral in and that , so is continuous at [math].
Let us show now that is continuous on each interval of the form with . Set, for ,
[TABLE]
By dominated convergence we easily see that is continuous on . Moreover, using again Hölder’s inequality we have, for all
[TABLE]
This show uniformly in , hence is continuous in , concluding the proof.
Proposition 3.4**.**
For each , the process
[TABLE]
is well-defined and belongs to . Moreover, it admits a version with continuous trajectories.
Proof.
By Remark 3.2 and Assumption 3.1(iii)-(iv), we can apply Lemma 3.3 with
[TABLE]
and defined by
[TABLE]
It follows that
[TABLE]
is well defined as stochastic process and belongs to . We can repeat the argument employed above dealing now with trajectories. Fixing and applying Lemma 3.3 with
[TABLE]
it follows that the map
[TABLE]
is continuous. The latter integral expression, for varying , clearly provides a version of (3.7) with continuous trajectories.
On the other hand, in view of Assumption 3.1(ii), from [10, Th. 5.2 and Th. 5.11] we know that the stochastic convolution
[TABLE]
is a (well defined) stochastic process belonging to and admitting a version with continuous trajectories, concluding the proof.
We refer to the process (3.6) as the controlled Ornstein-Uhlenbeck process or mild solution of SDE (3.1). We always consider its version (unique, up to indistinguishability) with continuous trajectories.
Let , , and let be such that
[TABLE]
Consider the functional
[TABLE]
By (3.8), the functional above is well defined (possibly with value ) for all and . The stochastic optimal control problem consists in minimizing the functional over the set of admissible controls , i.e. in solving the optimization problem
[TABLE]
The function is the so called value function of the optimization problem. If is such that and is such that , then is called optimal strategy and the associated state trajectory is called optimal state; moreover the couple \big{(}u^{*}(\cdot),X(\cdot;x,u^{*}(\cdot))\big{)} is called an optimal couple.
4 Generalized Dynkin’s formula
The aim of the present section is to prove an abstract Dynkin formula for the controlled Ornstein-Uhlenbeck process (3.6) composed with suitably smooth functions .
4.1 Transition semigroups, generators and -derivatives
We consider the family of transition semigroups associated to the uncontrolled version of (3.6) and to the same process under constant controls. Precisely, we denote by , where , the Ornstein-Uhlenbeck process starting at with extra drift ; i.e., the mild solution to
[TABLE]
Its explicit expression is
[TABLE]
Correspondingly, we define the family of linear operators \big{\{}P^{(k)}_{t}\big{\}}_{t\geq 0} in the space as
[TABLE]
In Proposition 4.3(i) below we will show that the family \big{\{}P^{(k)}_{t}\big{\}}_{t\geq 0} is a one-parameter semigroup of linear operators in the space . According to the related the literature, we call it the transition semigroup associated to the process . Unfortunately, such semigroup is not in general a -semigroup in , not even in the case . Indeed, in the framework of spaces of functions not vanishing at infinity, the -property, i.e. the fact that in the sup norm for every , fails even in basic cases. For instance, this property fails in the case of the Ornstein-Uhlenbeck semigroup in the space (see, e.g., [4, Example 6.1] for a counterexample in , or [8, Lemma 3.2], which implies this is a -semigroup in if and only if the drift of the SDE vanishes). Even worse: given , the map , is not in general measurable, as shown in [16, Example 4.5]. This prevents, for instance, to intend in Bochner sense, in the space for each , the integral defining the Laplace transform
[TABLE]
Nevertheless, one can get, in a weaker sense, several statements of the classical theory of -semigroups. This is performed, e.g., by the theory of -semigroups (introduced in [4], see also [6], with the different terminology of weakly continuous semigroups) and -semigroups (introduced in [43, 44]). Both theories (a survey of which can be found in Appendix B.5 of [14]) can be applied here getting substantially the same results. We employ the -semigroups approach, as it seems more natural in our context. The definition of -convergence can be found e.g. in [12, p. 111], where it is called bp-convergence (bounded-pointwise convergence) and in [43, 44]; the former in the space , the latter in the space .
Definition 4.1** (-convergence).**
A sequence of functions is said to be -convergent to a function if
[TABLE]
*Such convergence is denoted by or by *
Now we recall the definition of semigroup as given in [43, 44]. Here we state it in the space of continuous and bounded functions (the aforementioned references deal with the space of uniformly continuous and bounded functions, but also explain how to extend the definition to ).
Definition 4.2**.**
A semigroup \big{\{}P_{t}\big{\}}_{t\geq 0} of bounded linear operators on is called a semigroup on if it satisfies the following conditions.
- (P1)
There exist and such that for every , . 2. (P2)
For each and , the map is continuous. 3. (P3)
We have
[TABLE]
Define
[TABLE]
and
[TABLE]
It is proved (see [6, Lemma. 5.7] combined with the discussion of [43, Sec. 4.3]) that, for sufficiently smooth,
[TABLE]
We will use (4.7) to formally motivate the definition of mild solution (Definition 5.1) of the HJB equation associated to the control problem of Section 3.
Proposition 4.3**.**
Let .
- (i)
*The family of linear operators \big{\{}P_{t}^{(k)}\big{\}}_{t\geq 0} defined in (4.3) is a -semigroup on . We denote by its infinitesimal generator. * 2. (ii)
The operator
[TABLE]
belongs to for every and is the resolvent of :
[TABLE] 3. (iii)
We have777At the derivative is intended as right derivative.**
[TABLE]
Proof.
Claims (ii)-(iii) follow from [43, Prop. 3.2, Prop. 3.6] or [44, Prop. 6.2.7, Prop. 6.2.11](888These references deal mainly in the space of uniformly continuous and bounded functions — we warn that the author denotes by the latter space. The extension to the space of continuous and bounded function — our space — is illustrated in [43, Sec. 5] and [44, Sec. 6.5].) once one proves claim (i), which we prove below.
Proof of (i). First of all, we prove that \big{\{}P_{t}^{(k)}\big{\}}_{t\geq 0} is a semigroup of linear operators on . The fact that and that for all is immediate. The semigroup property of \big{\{}e^{tA}\big{\}}_{t\geq 0} and (3.3) yield
[TABLE]
The latter shows the strong Markov property of and then the fact that \big{\{}P^{(k)}_{t}\big{\}}_{t\geq 0} satisfies the semigroup property follows as consequence (see, e.g., [10, Cor. 9.15]). Now we show the other properties of Definition 4.2. (P1) is obviously verified with and . (P2) of Definition 4.2 corresponds to
[TABLE]
The latter follows from continuity of trajectories of and dominated convergence. Finally, (P3) of Definition 4.2 is verified by dominated convergence.
A key step towards the main goal of this section, i.e. the proof of a generalized Dynkin formula for with a suitably regular , consists in showing the following decomposition of when acting on the function
[TABLE]
Looking at \big{\{}P_{t}^{(k)}\big{\}}_{t\geq 0} as to a perturbation of \big{\{}P_{t}^{(0)}\big{\}}_{t\geq 0}, (4.9) is obtained in [25, Theorem 5.2] in the context of -semigroups with respect to mixed topology of and in [15, Theorem 4.6] in the context of bi-continuous semigroups. However, these references would require the assumptions that and are such that and . This would allow, in particular, to write the term in the formula above as , simplifying a lot the framework. Here we need to be sharper in this respect in order to cover other cases of interest in applications, e.g., the case of unbounded , occurring in boundary control problems. To this purpose we introduce the class of functions
[TABLE]
Our generalized Dynkin formula will hold for functions belonging to . In Appendix 6 we provide sufficient conditions on ensuring that .
Proposition 4.4**.**
*Let . Then (4.9) holds. *
Proof.
Since , we can write for every
[TABLE]
if the last limit exists. Observe that
[TABLE]
Therefore, since , continuity of and by dominated convergence yield
[TABLE]
The claim follows.
4.2 Proof of the generalized Dynkin’s formula
We introduce the linear space of -valued -integrable càdlàg simple processes. An element is of the form
[TABLE]
for some , and such that for all . Processes in are progressively measurable. By arguing as in the proof of Proposition 3.4 we get that, for any , the process
[TABLE]
is well defined, belongs to and has a version with continuous trajectories. We will always refer to the version of this process (unique up to indistinguishability) having continuous trajectories. Given , we write
[TABLE]
Again arguing as in the proof of Proposition 3.4 we see that this process has a version with having continuous trajectories. As above we will always refer to this version (unique up to indistinguishability).
Recall that, if , are two random variables with values, respectively, in two measurable spaces and , a version of the conditional law of given is a family of probability measures \big{\{}\mu(\cdot,v_{2})\big{\}}_{v_{2}\in E_{2}} on such that, for every , the map is measurable and
[TABLE]
where . This family, if it exists, is unique up to -null measure sets.
Lemma 4.5**.**
Let be in the form (4.11) and for some A version of the conditional law of given the couple is the family
[TABLE]
Proof.
The proof is standard (see [36, Ch. 2, Sec. 9] in finite dimension and in a much more general setting) and we omit it for brevity.
Lemma 4.6**.**
Let and . Then
[TABLE]
where the derivative has to be intended as right derivative at the times , where the simple process jumps.
Proof.
Let be as in (4.11), for some , and . Denote by the law of the couple . By Lemma 4.5, we have
[TABLE]
Now we differentiate under the integral sign using the fact that, by Proposition 4.4, and the fact that is bounded over . Then, using Proposition 4.3(i) and (4.9), we get
[TABLE]
the claim.
Lemma 4.7**.**
For each and , there exists a sequence such that
[TABLE]
Proof.
Fix and set . By standard arguments (see, e.g., [34, Ch. III, Lemma. 2.4, p.132])999It is worth to point out some differences. First, we are dealing with càdlàg approximations (as it is more meaningful and natural to state Proposition 4.6) rather than with càglàd (as in [34, Ch. III, Lemma. 2.4, p.132]): this is not a problem as, from the point of view of integration, these classes coincide. Second, we are dealing with Hilbert-valued processes: therefore, more technical care is needed as the approximation is produced by Bochner integration., we can construct a sequence such that
[TABLE]
Then, using the expression (3.6) for the state variable, the convergence
[TABLE]
follows by simply applying dominated convergence.
Theorem 4.8** (Dynkin’s formula).**
Let . Then, for every , , and , we have
[TABLE]
Proof.
Let and take the approximating sequence provided by Lemma 4.7. Then, applying, for each , Lemma 4.6, we obtain from (4.13) (by taking the right derivatives at ), for all and ,
[TABLE]
Since the function t\mapsto\mathbb{E}\left[e^{-\lambda t}\varphi\big{(}X^{\kappa_{n}(\cdot)}(t;x)\big{)}\right] is everywhere continuous and stepwise differentiable, we can apply the Fundamental Theorem of Calculus. So, integrating on , we get
[TABLE]
Now, letting , we get the claim by dominated convergence from Lemma 4.7, observing that , , and are bounded.
Remark 4.9**.**
The results of this section, in particular Theorem 4.8, can be extended, at the price of straightforward technical complications, to the case when the basic space of functions is, instead of , the space , where , used e.g. in [16]:
[TABLE]
Also the results of next Section 5 can be extended to this setting covering more general cases, in particular when the current cost of the control problem has polynomial growth in . We do not do this here for brevity.
5 HJB equation, verification theorem and optimal feedbacks
By standard Dynamic Programming arguments, one formally associates to the control problem of Section 3 the following HJB equation for the value function (3.10):
[TABLE]
where and the Hamiltonian is defined by
[TABLE]
where
[TABLE]
Note that this definition is only formal as may be not defined, since may not belong to . It is then convenient to introduce the modified Hamiltonian
[TABLE]
where
[TABLE]
Observing that
[TABLE]
(5.1) can be formally rewritten as
[TABLE]
Note that, in principle, may take the value somewhere. The concept of mild solution to (5.1) relies on Proposition 4.3(ii) and on (4.7), inspiring an integral form of (5.6) through the use of the semigroup \big{\{}P_{s}^{(0)}\big{\}}_{s\geq 0}.
Definition 5.1**.**
We say that a function is a mild solution to (5.6) if it belongs to , is bounded and solves the integral equation
[TABLE]
Remark 5.2**.**
The problem of existence and uniqueness of mild solutions for equations in the form (5.6) is addressed in [16] and in [14, Ch. 4]. In particular, existence and uniqueness of mild solutions is stated for sufficiently large , under the following assumptions (see [16, Cor. 4.12, Th. 3.8(ii)] with ):
- (A1)
* for every , where .*
- (A2)
The operators101010Here is the pseudo-inverse of .**
[TABLE]
which are well defined by (A1) and bounded by the closed graph theorem, are such that the map belongs to and is bounded in a neighborhood of .
- (A3)
The Hamiltonian satisfies, for suitable .
[TABLE]
[TABLE]
Some results in the case of locally Lipschitz Hamiltonian are available, up to now, only in special cases (see [10, Sec. 13.3.1] and [5]).
Due to Proposition 4.3(ii), a mild solution of (5.1) enjoys the property of being a solution to the same equation also in a differential abstract way, i.e., we have the following.
Proposition 5.3**.**
Let be a mild solution to (5.6). Then and
[TABLE]
Proof.
Using Proposition 4.3(ii), we rewrite (5.7) as
[TABLE]
This entails and, applying to both sides, we see that solves (5.8).
Remark 5.4**.**
By Proposition 5.3 a mild solution to (5.6) belongs to . Hence, in order to apply Theorem 4.8 to it, we only need to assume that . This is what we indeed assume in all the next results of this section.
5.1 Verification theorem
The proof of the verification theorem relies in the so called fundamental identity.
Proposition 5.5** (Fundamental identity).**
Let (3.8) hold. Let be a mild solution to (5.6) and assume that . Let and let be such that
[TABLE]
Then
[TABLE]
Proof.
Let , , and let be such that (5.10) holds. Using Proposition 5.3 and applying the abstract Dynkin formula (Theorem 4.8) to , we get
[TABLE]
Since is measurable and bounded from below by (3.8), the term \mathbb{E}\big{[}\int_{0}^{T}e^{-\lambda t}l\big{(}X(t;x,u(\cdot)),u(t)\big{)}\,dt\big{]} is well defined, possibly equal to . However, (5.10) actually entails
[TABLE]
Then, we can add and subtract \mathbb{E}\big{[}\int_{0}^{T}e^{-\lambda t}l\big{(}X(t;x,u(\cdot)),u(t)\big{)}dt\big{]} in (5.12) and use (5.5) to get, rearranging the terms,
[TABLE]
Now we let . The right hand side has a limit (possibly ), as the integrand is positive. The left hand side clearly converges to . This implies that also the limit of the right hand side is finite and
[TABLE]
The claim follows rearranging the terms.
Theorem 5.6** (Verification theorem).**
Let (3.8) hold. Let be a mild solution to (5.6) and assume that . We have the following.
- (i)
* over .*
- (ii)
Let and assume that there exists such that
[TABLE]
Then .
Proof.
(i) By (5.11), for all such that (5.10) holds, we have , which yields this claim.
(ii) Let such that (5.14) holds. If , then, from (5.11), we immediately get , which, combined with item (i), yields the claim. We now prove that it cannot be . Assume, by contradiction, that . Then, by (5.14), we have
[TABLE]
By (5.8), is bounded. Hence, Assumption 3.1-(iv), the fact that and (5.15) imply \mathbb{E}\big{[}\int_{0}^{T}e^{-\lambda t}l\big{(}X(t;x,u^{*}(\cdot)),u^{*}(t)\big{)}\,dt\big{]}<\infty for all . Then, we can argue as in the proof of Proposition 5.5 getting (5.13) with in this case and, using again (5.14),
[TABLE]
Letting we get , a contradiction, as is finite.
5.2 Optimal feedback controls
As usual, the verification theorem is composed of two statements: the first one states that the solution to the HJB equation enjoys the property of being smaller than the value function; the second one is the most important from the point of view of the control problem, as it furnishes a sufficient condition of optimality ((5.14) in our case). Then, the problem becomes the so-called synthesis of an optimal control, i.e. to produce a control verifying such condition. The answer relies in the study of the closed loop equation.
Let be a mild solution to HJB equation (5.6). Assuming that the infimum of the map
[TABLE]
is attained and defining the multivalued function (feedback map)
[TABLE]
the closed loop equation (CLE) associated with our problem and to is indeed a stochastic differential inclusion:
[TABLE]
We have the following result.
Corollary 5.7**.**
Let (3.8) hold. Let be a mild solution to (5.6) and assume that . Let and assume that the feedback map defined in (5.17) admits a measurable selection and consider the SDE
[TABLE]
Assume that (5.19) has a mild solution in , i.e. there exists such that
[TABLE]
Define, for , and assume that . Then . In particular the couple is optimal at .
*Moreover, if is single-valued and the mild solution to (5.19) is unique, then the optimal control is unique. *
Proof.
Consider the couple and observe that is the unique mild solution (in the strong probabilistic sense) of the state equation associated to the control , so that . By construction such couple satisfies (5.14). Then, by Theorem 5.6-(ii) we obtain that it is optimal.
Let us address now the uniqueness issue. We observe that, if is another optimal couple at , we immediately have, by (5.11) and the fact that ,
[TABLE]
As the integrand is always negative and as is single-valued, this implies that -a.e. we have \hat{u}(\cdot)=\Phi\big{(}X(\cdot;x,\hat{u}(\cdot))\big{)}. This shows that solves (5.19). Then uniqueness of mild solutions to (5.19) gives the claim.
We conclude the section commenting on the extension of our results to the case when the control problem is considered in the so-called weak formulation. So far, we have considered our family of stochastic optimal control problems in the strong formulation. It is possible to consider the problem also in the so-called weak formulation, i.e. letting the filtered probability space and the Wiener process vary with the control strategy (see, e.g., [48, Ch. 2]). More precisely, in the weak formulation, the control strategy is a -tuple . Calling the set such control strategies, the objective is to minimize the cost (3.9) over . The resulting value function is, in principle, smaller than . The main advantage in choosing such formulation is that existence of optimal control strategies in feedback form is easier to obtain. The verification theorem above also holds when we consider the control problem in its weak formulation. Indeed, the proof of Theorem 5.6 works for every filtered probability space and any cylindrical Brownian motion on it. Hence, letting the filtered probability space and the cylindrical Brownian motion vary, one gets that over . Moreover, if (5.14) holds for a given control strategy (111111Elements of are, rigorously speaking, -tuples; however, for simplicity, we denote them simply by .) , then we have . One gets the following.
Corollary 5.8**.**
Let (3.8) hold. Let be a mild solution to (5.6) and assume that . Let and assume that the feedback map defined in (5.17) admits a measurable selection . Assume now that (5.19) has a martingale solution121212Weak-mild solution in the terminology of [14]. (see [10, p. 220] or [23, Def. 3.1, p. 75] for the definition) in some filtered probability space \left(\overline{\Omega},\overline{\mathcal{F}},\big{\{}\overline{\mathcal{F}_{t}}\big{\}}_{t\geq 0},\,\overline{\mathbb{P}}\right) and for some -valued cylindrical Brownian motion defined on it. Define, for , \overline{u_{\phi}}(s)=\phi\big{(}\overline{X_{\phi}}(s;x)\big{)} and assume (131313In the sense that the -tuple identified by belongs to .). Then . In particular \big{(}\overline{u_{\phi}}(\cdot),\overline{X_{\phi}}(\cdot;x)\big{)} is an optimal couple.
6 Applications
In the present section we provide two examples of application of our results.
The first example, fully developed, concerns the optimal control of the stochastic heat equation in a given space region when the control can be exercised only at the boundary . Precisely, we consider the case when the control at the boundary enters through a Neumann-type boundary condition, corresponding to control the heat flow at the boundary. The existence and uniqueness of mild solutions to the associated elliptic HJB equation in this case is guaranteed (under suitable conditions) by the results of [16].
The second example concerns the optimal control of a stochastic differential equation with delay in the control process (see [29, 30] for the treatment of the same problem over finite horizon). In this case, the result we give needs to assume the existence of a mild solution to the associated elliptic HJB equation. The reason for that is that a theory of mild solutions for elliptic HJB equations associated to this kind problem has not been yet developed in the elliptic case. Indeed, unlike the first example, this kind of equations is not covered by the results of [16], due to the lack of -smoothing. In this case it is needed an ad hoc treatment of the equation, dealing with the specific case at hand, to show the existence of mild solutions (see, e.g., the aforementioned references [29, 30] in the parabolic case). Although a result of this kind for elliptic equation seems straightforward, a rigorous statement of this result has not been rigourously fixed yet. For this reason, we limit ourselves to provide a weaker result taking the existence of mild solutions to the associated HJB equation as an assumption and leaving the investigation of that for future work. Due to the lack of a rigourous background on which relying our results, we do not state in this case a theorem and just keep the arguments at the level of an informal exposition.
6.1 Neumann Boundary control of a stochastic heat equation with additive noise
We consider the optimal control of a nonlinear stochastic heat equation in a given space region when the control can be exercised only at the boundary of .
6.1.1 Problem setup
Let be an open, connected, bounded subset of with regular (in the sense of [37, Sec. 6]) boundary 141414We stress that such conditions may allow corners in the boundary: in particular, when squares satisfy the required regularity.. We consider the controlled dynamical system driven by the following SPDE in the time interval :
[TABLE]
where:
- •
is the stochastic process describing the evolution of the temperature distribution and is the state variable of the system;
- •
is the stochastic process representing the heat flow at the boundary; it is the control variable of the system and acts at the boundary of it: this is the reason of the terminology “boundary control";
- •
is the outward unit normal vector at the boundary ;
- •
is the initial state (initial temperature distribution) in the region ;
- •
is a cylindrical Wiener process in ;
- •
.
Assume that this equation is well posed (in some suitable sense, see below for the precise setting) for every given in a suitable set of admissible control processes and denote its unique solution by to underline the dependence of the state on the control and on the initial datum . The controller aims at minimizing, over the setof admissible controls, the objective functional
[TABLE]
where are given measurable functions bounded from below and is a discount factor.
6.1.2 Infinite dimensional setting
We now rewrite the state equation (6.1) and the functional (6.2) in an infinite dimensional setting in the space . For more details, we refer to [16, Sec. 5] and references therein. Consider the realization of the Laplace operator with vanishing Neumann boundary conditions 151515To be precise, is the closure in of the set of functions having vanishing normal derivative at the boundary .:
[TABLE]
It is well-known (see, e.g., [39, Ch. 3]) that generates a strongly continuous analytic semigroup \big{\{}e^{tA_{N}}\big{\}}_{t\geq 0} in . Moreover, is a self-adjoint and dissipative operator. In particular , where denotes the resolvent set of . So, if , then is invertible and . Moreover (see, e.g., [37, App. B]) the operator is compact. Consequently, there exists an orthonormal complete sequence such that the operator is diagonal with respect to it:
[TABLE]
for a suitable sequence of eigenvalues repeated according to their multiplicity (they are nonnegative due to dissipativity of ). We assume that such sequence is increasingly ordered. Then, , as clearly the constant functions belong to Ker , and for each , since, as an immediate consequence of the Gauss-Green formula, only the constant functions belong to Ker . Moreover, [46, Sec. 5.6.2, p. 395] (see also [37, App. B]) provides also a growth rate for the sequence of eigenvalues; indeed
[TABLE]
We have (see, e.g., [37, App. B]) the isomorphic identification
[TABLE]
where denotes the Sobolev space of exponent . Next, consider the following problem with Neumann boundary condition:
[TABLE]
Given any and , there exists a unique solution to (6.7). Moreover, the operator (Neumann map)
[TABLE]
is continuous (see [38, Th. 7.4]). So, in view of (6.6), the map
[TABLE]
is continuous. In [16, Sec. 5], it is shown that the natural abstract reformulation of the original control problem in the space is
[TABLE]
where , , and for . We are now in the framework of (3.1), with , , , , and . Let us consider, as set of admissible controls,
[TABLE]
where and will be specified later according to (3.4). Defining
[TABLE]
and
[TABLE]
the functional (6.2) can be rewritten in the Hilbert space framework as
[TABLE]
6.1.3 HJB equation and verification theorem
Setting , the HJB equation associated to the minimization of (6.11) is
[TABLE]
Since the semigroup \big{\{}e^{tA_{N}}\big{\}}_{t\geq 0} is strongly continuous and analytic, then by [42, Th. 6.13(c)] the operator can be extended to for every and
[TABLE]
Hence, Assumption 3.1(i) and (iii) is satisfied with , , and . Consequently, recalling (3.4), we choose .
Now, assume the following.
- (H1)
satisfies Assumption 3.1(ii).
- (H2)
Conditions (A1) and (A2) of Remark 5.2 hold true with .
- (H3)
, so 161616According to Remark 4.9 it is possible to deal with the case when , and so , has polynomial growth.. Moreover the map , defined by
[TABLE]
is Lipschitz continuous. These conditions imply that satisfies condition (A3) of Remark 5.2.
Then, under such assumptions, by Remark 5.2, for sufficiently large there exists a unique mild solution to (6.12). By definition of mild solution, we have . Furthermore, Assumption A.2 is verified through Remark A.3 in this case. Hence Proposition A.4 applies yielding and enabling the application of Theorem 5.6. We now discuss the validity of the above assumptions (H1)–(H3).
- •
On the validity of (H1). First of all, we note that in Assumption 3.1(ii), we can take as small as we want; indeed, if this assumption holds true for some , then it holds true also for all . By (6.4), the operator is diagonal with respect to the orthonormal basis with eigenvalues . Assumption 3.1(ii) rewrites as
[TABLE]
Applying Fubini-Tonelli’s Theorem and considering (6.5) we see that (6.14) holds if
[TABLE]
Let be such that
[TABLE]
(recall that , so always verifies (6.16)). Considering that can be taken as small as we want and combining (6.15) and (6.16), we conclude that (H1) holds if we may take in (6.16)
[TABLE]
In particular, if , then (H1) holds true for all .
- •
On the validity of (H2). By (6.5), we have, for ,
[TABLE]
The operator is diagonal too with respect to and
[TABLE]
Assume now further that is diagonal with respect to and nondegenerate, i.e. for every where for every . Set for . Then is diagonal too. Moreover and and
[TABLE]
Hence, with the agreement if , we have
[TABLE]
Since |\Gamma_{G}(t)|_{\mathcal{L}(H)}=\sup_{k\in\mathbb{N}}\big{|}\Gamma_{G}(t)e_{k}\big{|}_{H}, then, with the agreement that if , conditions (A1) and (A2) of Remark 5.2 hold true if and only if
[TABLE]
Assume that
[TABLE]
and let and be such that for some and every . Considering (6.5), let and be such that for every Calling it is clear that, for a suitable ,
[TABLE]
Hence, to prove (6.19) above, we take and we rewrite (6.19) (up to a constant depending on ) as
[TABLE]
Noting that , we can estimate
[TABLE]
Therefore, (H2) is satisfied whenever (6.20) holds for some such that . As can be taken arbitrarily small, we conclude that (H2) can be fulfilled if (6.20) holds for some such that
[TABLE]
- •
On the simultaneous validity of (H1)–(H2). Looking at (6.17) and (6.22), we see that (H1)-(H2) can be simultaneously fulfilled by choosing a suitable if is diagonal with respect to and (6.20) is verified for some such that
[TABLE]
These requirements can be fulfilled only for dimension .
- •
On the validity of (H3). This is guaranteed, for instance, if is bounded, is continuous and bounded, is measurable.
6.1.4 Optimal Feedback Controls
In the framework of the previous subsection, we look now at the existence of optimal feedback controls.
Theorem 6.1**.**
Let (H1)–(H3) of the previous subsection hold. Assume that the multi-valued map
[TABLE]
admits a Lipschitz continuous selection and that is Lipschitz continuous. Set . Then the SDE
[TABLE]
admits a unique mild solution (in the sense of (5.20)) admitting a version with continuous trajectories. As a consequence, Corollary 5.7(i) applies providing the optimality of the couple \big{(}{u_{\phi}}(\cdot),{X_{\phi}}(\cdot;x)\big{)}, where for .
Proof.
By the assumptions, the map is Lipschitz continuous too. Then the proof follows the classical fixed point arguments as in standard results of existence and uniqueness of SDEs in infinite dimension, see e.g. [10, Theorem 7.5]. Here we only need to take care of dealing with in place of in the convolution term and use (3.2) with .
The assumption that defined in (6.24) admits a Lipschitz continuous selection is guaranteed, for example, if , is strictly convex,
[TABLE]
is Fréchet differentiable, and has Lipschitz continuous inverse. Indeed, in this case the infimum in (6.24) is uniquely achieved (hence, is single-valued) at
[TABLE]
Hence, if we are able to check that is Lipschitz continuous, we can then apply Corollary 5.7(i) in its strongest form to get uniqueness of the optimal control constructed.
On the other hand, checking that is Lipschitz continuous might be, in general, a very difficult task171717This can be done assuming more regularity of — hence of — and proving a suitable property of . See, e.g., the approach used in [31] or in [29]., whereas mere continuity of is a condition already “contained” in the definition of mild solution to (6.12). Hence, it would be meaningful to provide a Peano type result 181818This is not straightforward: in infinite dimension Peano’s Theorem fails in general (see [24]). of existence of mild solutions to CLE (6.25). This seems possible when a selection of in (6.24) is known to be only continuous and bounded on bounded sets, as
- (i)
the semigroup is compact;
- (ii)
as is continuous and bounded by construction, the map is continuous and bounded.
Indeed, in such a framework, it seems possible to use the methods of [7, Prop. 3] (see also [22]), passing through the use of the so-called Skorohod representation theorem, to construct martingale solutions to (6.25); hence, to construct optimal feedback controls in the weak formulation.
Remark 6.2**.**
In the specific case we are handling, where the diffusion term is just additive in the equation, a way to construct the solution in the original probability space might consist in constructing a pathwise solution dealing with a parameterized family of deterministic problems with parameter (see [2], [9, Sections 14.2 and 15.2], [19], [40]). Once this is done, the problem is to prove that the family of solutions constructed by admits an adapted selection. The existence of a selection measurable with respect to can be obtained using measurable selection theorems (see again [2]); proving that this selection is also adapted is a problematic task, which is still open. In the case when one knows ex ante that the pathwise solution is unique for a.e. , then F. Flandoli (personal communication) showed us how to accomplish this task. Unfortunately, in our case, the uniqueness of the solutions of the deterministic equations for a.e. only holds when the properties of the coefficients allow to find directly mild solutions to SDE (6.25).
6.2 Stochastic optimal control with delay in the control variable
Here we consider an infinite horizon version of a control problem studied in [29, 30]. Consider the following linear controlled one dimensional SDE:
[TABLE]
where
- •
is a standard one dimensional Brownian motion;
- •
, ;
- •
represents the maximum delay the control takes to affect the system;
- •
is a (real-valued) function weighting the aftereffects of the control on the system; we consider here the case of distributed delay, i.e. when .
The initial data are the initial state and the past history of the control. The control takes values in a closed subset and belongs to (defined by (3.5) with ).
Such kind of equations (even in a deterministic framework) have been used to model the effect of advertising on the sales of a product [27, 28, 17], the effect of investments with time to build on growth [13, 1], to model optimal portfolio problems with execution delay [3], to model the interaction of drugs with tumor cells [35, p. 17].
Denoting by the unique solution to (6.26), the goal of the problem is to minimize, over all control strategies in , the following objective functional
[TABLE]
where and are measurable and bounded from below. It is important to note that here and do not depend on the past of the state and/or control. This is a very common feature of many applied problems.
A standard way to approach these delayed control problems, introduced in [47] for the deterministic case and extended to the stochastic case in [27], is to reformulate them as equivalent infinite dimensional control problems without delay191919It must be noted that, under suitable restrictions on the data, one can treat (stochastic) optimal control problems with delay avoiding to look at them as infinite dimensional systems (see [18]). However, this is possible only in very special cases, leaving out a lot of of concrete applications.. The details are given in [29] for the finite horizon case, which is completely similar to the infinite horizon case, with the obvious changes (see also [17] for the infinite horizon case in a deterministic framework with a different embedding space). Consider the Hilbert space , set , and assume, without loss of generality, . The state equation (6.26) is rephrased in as a linear SDE with state variable as follows:
[TABLE]
where
[TABLE]
[TABLE]
and the initial datum is defined as
[TABLE]
It is well known that is the generator of a -semigroup of linear bounded operators on . Note that the infinite dimensional datum depends on the “initial past” of the control. It turns out that , so (6.27) is rewritten as
[TABLE]
Setting , the HJB equation associated to the minimization of (6.29) is
[TABLE]
Notice that , where the latter symbol denotes the directional derivative along the direction . So, the nice feature of the equation above is that the nonlinearity on the gradient only involves the directional derivative . Note also that here we do not have the so called structural condition ; this prevents the use of techniques based on Backward SDEs (see, e.g., [21]) to tackle the problem.
Now we check if the assumptions of our main result Theorem 5.6 are verified. First of all, it is easy to check that Assumption 3.1 and Assumption A.5 hold. The third assumption, i.e. the existence of a mild solution to (6.30) needs to be discussed.
In [29], the authors study a finite horizon optimal control problem with the same state equation (6.26) and a similar objective functional. Exploiting only partial smoothing properties of the transition semigroup associated to the state equation (6.28) with null control, the authors are able to provide, under suitable reasonable assumptions on the data, existence and uniqueness results for the parabolic HJB equation associated to the control problem.
We believe that the approach of [29] can be adapted to our infinite horizon case, getting a mild solution to HJB (6.30). Then, to apply our theory one should prove that such function is Lipschitz continuous on compact sets, which enables to apply Proposition A.6 to get . To get this goal one can proceed as in [29] by assuming more regularity on the data of the problem. More precisely, assuming that and that the Hamiltonian is differentiable with Lipschitz continuous derivative, [29] proves that the mild solution . This fact, in particular, implies the required Lipschitz continuity of . In [30] the authors also provide a verification theorem for their finite horizon problem. They use an approximation procedure of the solution of the HJB equation, which our results allow to avoid here.
Appendix A Appendix
Recall that, given , the pseudo-inverse is defined as the operator that associates to each the element of having minimum norm.202020Existence and uniqueness of such an element follows from the fact that is a closed operator and applying the results of [11, Sec. II.4.29, p. 74]). Note that is bounded, so it can be extended to a bounded operator .
Lemma A.1**.**
We have
[TABLE]
Proof.
Assume first that . In this case . Then, using Remark 2.4, we write
[TABLE]
If , we can take a sequence converging to . Considering (A.1) on and passing to the limit the claim follows taking into account that is bounded.
Assumption A.2**.**
The operator is such that for every
- (i)
*there exists such that ; *
- (ii)
, as .
Remark A.3**.**
Note that for every and . So, in view of the fact that is bounded, Assumption A.2 is verified, in particular, if , and, for sufficiently small ,
[TABLE]
This applies, e.g., to the case when is dissipative and generates an analytic semigroup, and with and (see the example of Section 6.1).
Proposition A.4**.**
Let Assumption A.2 holds. Then .
Proof.
Fix , and let be as in Assumption A.2(i). Noting that for every , by Assumption A.2(i) we can write
[TABLE]
Moreover, by Assumption A.2(ii), we have
[TABLE]
Fix now . Using (A.2) we write
[TABLE]
Mean value theorem applied to the function yields (see also Remark 2.4)
[TABLE]
Hence, (A) rewrites as
[TABLE]
Moreover, we can estimate
[TABLE]
Now we are going to take the limit for in (A). To this purpose, we observe that, as and \big{\{}z(t)\big{\}}_{t\in(0,\varepsilon)} is compact in , we have
[TABLE]
By definition of (see (A.2)), we have . Hence, (A.7) provides
[TABLE]
Hence, combining (A.3), (A.6) and (A.8), we get
[TABLE]
Moreover, (A.3) and the continuity of the maps and entails
[TABLE]
Combining (A), (A.9), (A.10), and Lemma A.1, the claim follows.
Assumption A.5**.**
.
Proposition A.6**.**
Let Assumption A.5 hold and let be Lipschitz continuous on compact sets. Then .
Proof.
Let . Observe that, as , we have , for every , and
[TABLE]
Let . We can split
[TABLE]
The set \bigg{\{}z(t)+\displaystyle\int_{0}^{t}{e^{sA}G}k\,ds\bigg{\}}_{t\in(0,1)}\bigcup\,\bigg{\{}z(t)+tGk\bigg{\}}_{t\in(0,1)}\subset K is precompact. Hence, by Lipschitz continuity of on compact sets, we have for some independent of
[TABLE]
We let now in (A). Combining with (A.13) and (A.11) we get
[TABLE]
provided that the limit in the right hand side above exists, as we are going to show. We write
[TABLE]
By the equalities above and considering that , we have
[TABLE]
and the claim follows from (A.14).
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] R. Aid, S. Federico, H. Pham, and B. Villeneuve. Explicit investment rules with time-to-build and uncertainty. J. Econom. Dynam. Control , 51:240–256, 2015.
- 2[2] A. Bensoussan and R. Temam. Équations stochastiques du type Navier-Stokes. J. Funct. Anal. , 13(2):195–222, 1973.
- 3[3] B. Bruder and H. Pham. Impulse control on finite horizon with execution delay. Stoch. Proc. Appl. , 119(3):1436–1469, 2009.
- 4[4] S. Cerrai. A Hille-Yosida theorem for weakly continuous semigroups. Semigroup Forum , 49(3):349–367, 1994.
- 5[5] S. Cerrai. Stationary Hamilton-Jacobi equations in Hilbert spaces and applications to a stochastic optimal control problem. SIAM J. Control Optim. , 40(3):824–852, 2001.
- 6[6] S. Cerrai and F. Gozzi. Strong solutions of Cauchy problems associated to weakly continuous semigroups. Differential Integral Equations , 8(3):465–486, 1995.
- 7[7] A. Chojnowska-Michalik and B. Gołdys. Existence, uniqueness and invariant measures for stochastic semilinear equations on Hilbert spaces. Probab. Theory Related Fields , 102(3):331–356, 1995.
- 8[8] G. Da Prato and A. Lunardi. On the Ornstein-Uhlenbeck operator in spaces of continuous functions. J. Funct. Anal. , 131(1):94–114, 1995.
