A nonlinear Bismut-Elworthy formula for HJB equations with quadratic Hamiltonian in Banach spaces
Davide Addona, Elena Bandini, Federica Masiero

TL;DR
This paper develops a nonlinear Bismut-Elworthy formula for quadratic growth BSDEs in Banach spaces, enabling solutions to complex HJB equations and stochastic control problems in high-dimensional settings.
Contribution
It extends the Bismut-Elworthy formula to cases with unbounded pseudo-inverse diffusion operators and quadratic growth generators in Banach spaces.
Findings
Derived a Bismut-Elworthy formula for quadratic BSDEs in Banach spaces.
Applied the formula to solve semilinear Kolmogorov equations with quadratic growth.
Addressed stochastic control problems with quadratic cost functions.
Abstract
We consider a Backward Stochastic Differential Equation (BSDE for short) in a Markovian framework for the pair of processes , with generator with quadratic growth with respect to . The forward equation is an evolution equation in an abstract Banach space. We prove an analogue of the Bismut-Elworty formula when the diffusion operator has a pseudo-inverse not necessarily bounded and when the generator has quadratic growth with respect to . In particular, our model covers the case of the heat equation in space dimension greater than or equal to 2. We apply these results to solve semilinear Kolmogorov equations for the unknown , with nonlinear term with quadratic growth with respect to and final condition only bounded and continuous, and to solve stochastic optimal control problems with quadratic growth.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
A nonlinear Bismut-Elworthy formula for HJB equations with quadratic Hamiltonian in Banach spaces
Davide ADDONA [email protected] Dipartimento di Matematica e Applicazioni, Università di Milano-Bicocca, Milano, Italy
Elena BANDINI [email protected] Dipartimento di Matematica e Applicazioni, Università di Milano-Bicocca, Milano, Italy
Federica MASIERO [email protected] Dipartimento di Matematica e Applicazioni, Università di Milano-Bicocca, Milano, Italy
Abstract
We consider a Backward Stochastic Differential Equation (BSDE for short) in a Markovian framework for the pair of processes , with generator with quadratic growth with respect to . The forward equation is an evolution equation in an abstract Banach space. We prove an analogue of the Bismut-Elworty formula when the diffusion operator has a pseudo-inverse not necessarily bounded and when the generator has quadratic growth with respect to . In particular, our model covers the case of the heat equation in space dimension greater than or equal to 2. We apply these results to solve semilinear Kolmogorov equations for the unknown , with nonlinear term with quadratic growth with respect to and final condition only bounded and continuous, and to solve stochastic optimal control problems with quadratic growth.
Keywords: Stochastic heat equation in and dimensions, nonlinear Bismut-Elworthy formula, quadratic Backward Stochastic Differential Equation, Hamilton Jacobi Bellman equation.
MSC 2010: 60H10; 60H30; 93E20; 35Q93.
1 Introduction
In this paper we deal with Markovian BSDEs whose generator has quadratic growth with respect to , and we generalize to this framework the Bismut-Elworthy type formula introduced in [10], where the Lispchitz case was studied. More precisely, our BSDE is related to a forward stochastic differential equation of the form
[TABLE]
where is a Banach space which is continuously and densely embedded in a real and separable Hilbert space . The operator is the generator of a contraction analytic semigroup in , which turns out to be strongly continuous or analytic in , and is a cylindrical Wiener process in . We assume that the stochastic convolution
[TABLE]
is well defined as a Gaussian process in , and that it admits an -continuous version.
The presence of the diffusion operator in (1.1) allows us to deal with stochastic heat equations in and space dimensions, while stochastic heat equations in one space dimension can be considered without any regularization of the white noise, that is in the case with . Moreover, we consider dissipative maps in (1.1) in order to have more generality in the structure of the equation. Notice that, under this latter assumption, is well defined only on the Banach space , while it is not even defined on the whole Hilbert space ; this is a natural situation arising in many evolution equations, see e.g. [6] and [4].
The solution of equation (1.1) will be denoted by , or also by , to stress the dependence on the initial conditions, and the transition semigroup related to will be denoted by
[TABLE]
At least formally, the generator of is the second order differential operator
[TABLE]
This is the link with the solution, in mild sense, of the semilinear Kolmogorov equation in (see e.g. [5]):
[TABLE]
We recall that by mild solution of equation (1.2) we mean a bounded and continuous function , once Gâteaux differentiable with respect to , and satisfying the integral equality
[TABLE]
Second order differential equations are a widely studied topic in the literature, see e.g. [5]. In the case of only locally Lipschitz continuous, we cite [15], [22], [20] and also [21], where in particular the quadratic case is studied with datum only continuous. We also mention the monograph [3], where semilinear Kolmogorov equations related to forward equations of reaction diffusion type more general than the one considered here are studied, but requiring Lipschitz continuity of the final datum.
We will consider equation (1.2) under the assumptions that the final datum is bounded and continuous, and that has quadratic growth with respect to the derivative . In order to prove existence and uniqueness of a mild solution of the form (1.3) for the Kolmogorov equation (1.2), we aim at representing this mild solution in terms of a Markovian BSDE of the form
[TABLE]
We recall that, in order to solve partial differential equations by means of BSDEs, one of the crucial tasks is the identification of with the derivative of taken in the directions of the diffusion operator. In this regard, we refer to the seminal paper [23] for the finite dimensional case, and to [11] for the infinite dimensional extension in Hilbert spaces: in both papers the driver is Lipschitz continuous in and in , and and are differentiable. We also mention [19], where an extension to the Banach space case is studied with the same assumptions of Lipschitz continuity and differentiability on the data.
In the present paper we do not make differentiability assumptions on the coefficients: thank to a variant of the nonlinear Bismut-Elworthy formula for BSDEs introduced in [10], we are still able to prove that the solution of the BSDE (1.4) gives the mild solution of the Kolmogorov equation (1.2). Bismut-Elworthy formulas for the transition semigroup of equations of type (1.1) with invertible diffusion operator are a classical topic in the literature, see e.g. [5]. In [3] the case of an operator like the one in (1.1), with pseudo-inverse which is not necessarily bounded, is also considered. According to these classical Bismut formulas, for every , , and for every bounded and continuous real function defined on , one has
[TABLE]
where ( being the general diffusion operator)
[TABLE]
In [10] a nonlinear Bismut-Elworthy formula for the process solution of the BSDE (1.4) is proved when is Lipschitz continuous with respect to and the process takes its values in a Hilbert space . According to this formula, for , for every direction ,
[TABLE]
Formula (1.6) is used in [10] to solve a semilinear Kolmogorov equation of the form of (1.2). When the Hamiltonian function is Lipschitz continuous with respect to the derivative of , semilinear Kolmogorov equations of the type of (1.2) can be solved also by using the estimates coming from the classical Bismut formulas (1.5) and by a fixed point argument, see e.g. [3], [5], [14]. In the quadratic case this procedure does not work anymore: for this reason, nonlinear versions of Bismut-Elworthy formulas, that give an alternative way to solve equations like (1.2), are particularly interesting in such a framework. In [21], a nonlinear version of the Bismut-Elworthy formula has been provided and has been applied to semilinear Kolmogorov equations of the type of (1.2), with quadratic hamiltonian, and in a Hilbert space.
In the present paper, we generalize (1.6) to the Banach space framework, and to the case of diffusion operator that has unbounded pseudo-inverse operator. In this context, the nonlinear Bismut formula (1.6) has its own independent interest, and moreover it allows to solve the Kolmogorov equation with Hamiltonian function quadratic with respect to . We first provide an analogous of the nonlinear Bismut formula given in [10] in the case of Banach space framework and Lispchitz continuous generator. Then, we prove a nonlinear Bismut formula in the quadratic case when and are differentiable. To this end, denoted by a solution to the Markovian BSDE (1.4) and assuming that and are differentiable, the two main ingredients are the identification
[TABLE]
and an a priori estimate on of the form ( being a constant depending on )
[TABLE]
which is obtained with techniques similar to the ones used in [21], see also [7] and [26]. Both (1.7) and (1.8) are new in the Banach space framework and in the case of quadratic generator with respect to . Finally, differentiability assumptions are removed by an approximation procedure, obtained by suitably generalizing the one introduced in [25].
Our results can be applied to a stochastic optimal control problem consisting in minimizing a cost functional of the form
[TABLE]
over all the admissible controls taking values in and not necessarily bounded. Here has quadratic growth with respect to , and is the solution of the controlled state equation
[TABLE]
with or . The aim of this latter part of the work is to characterize the value function as the solution of the associated Hamilton Jacobi Bellman (HJB in the following) equation, and to provide a feedback law for optimal controls. If , namely when the controls affect the system only through the noise (the so called structure condition holds true), the optimal control problem (1.9) can be completely solved, see Theorem 6.10. When , the optimal control problem can be completely solved by restricting ourselves to the class of more regular controls taking values in , see Theorem 6.15. In the general case of and -valued controls, we are able to provide an “-optimal solution” of the problem in the sense that the value function can be approximated by a sequence of functions which are solutions of approximating HJB equations, and we can obtain an -optimal control in feedback form, see Theorem 6.25.
The paper is organized as follows: in Section 2 we fix the notations and we give the results on the forward process. In Section 3 we introduce the forward backward system: here the main results are the identification (1.7) of with , which is new in the case of quadratic with respect to and in the Banach space framework, and the a priori estimate (1.8) on not involving derivatives of the coefficients of the BSDE. In Section 4 we give the nonlinear Bismut formula (1.6) in the Banach space and with Lipschitz continuous with respect to , then in Section 5 we extend formula (1.6) to the case of quadratic with respect to . In both Sections 4 and 5, the Bismut formula is applied to solve the corresponding semilinear Kolmogorov equation (1.2). Finally in Section 6 we apply the previous results to solve the stochastic optimal control problem (1.9).
2 Notations and preliminary results on the forward process
We assume that is a real and separable Banach space which admits a Schauder basis, and that is continuously and densely embedded in a real and separable Hilbert space . and are respectively endowed with the norms and . We fix a complete probability space endowed with a filtration satisfying the usual conditions.
We list below some notations that are used in the paper. Let be a given Banach space endowed with the norm . For any and any , we set
- •
the space of -valued measurable functions defined on , normed by
[TABLE]
- •
the space of adapted processes , defined on and with values in , normed by
[TABLE]
- •
(resp. ) the space of all adapted processes , continuous on (resp. on ) and with values in , normed by
[TABLE]
If we simply write .
- •
the space of all predictable processes with values in normed by
[TABLE]
If we simply write .
We denote by the space of all bounded linear operators from to , endowed with the usual operator norm. denotes the dual space of , and denotes the duality between and .
We say that a function belongs to the class if is continuous and Gâteaux differentiable on and if the gradient is strongly continuous. If we simply write . We say that is in if is continuous and Gâteaux differentiable with respect to every and the gradient is strongly continuous. For more details on this classes of Gâteaux differentiable functions see [11, Section 2.2].
2.1 The forward equation
We are given the Markov process in (also denoted to stress the dependence on the initial conditions) solution to the equation
[TABLE]
where is a cylindrical Wiener process with values in , see e.g. [6] for details on cylindrical Wiener processes in infinite dimensions. From now on will be the natural filtration generated by the Wiener process and augmented in the usual way.
We assume the following on the coefficients of equation (2.1).
Hypothesis 2.1**.**
* is a linear operator which generates a contraction analytic semigroup on the Hilbert space and there exist such that for any and any . Further, the restriction of to generates a contraction (or analytic) semigroup on .* 2. 2.
The stochastic convolution
[TABLE]
admits an -continuous version, and, for any , (when we write instead of ). 3. 3.
* is a measurable and dissipative map, and .* 4. 4.
The restriction of to is a map from to which is measurable and dissipative (where no confusion is possible, we simply write instead of ). and is Fréchet differentiable. Further, there exist , and for any an element , such that, for any , ,
[TABLE] 5. 5.
.
By Hypothesis 2.1-1. and the Kuratowski theorem, see e.g. [24], Chapter I, Theorem 3.9, it follows that is a Borel set in .
Remark 2.2**.**
Since by Hypothesis 2.1-3.-4. is differentiable and dissipative, we get
[TABLE]
In particular, from the Hahn-Banach theorem, there exists such that , and therefore . Further, from [6, Appendix D] we have
[TABLE]
Remark 2.3**.**
Since generates a contraction semigroup on , then is dissipative, and for any we have , , see Example D.8 in [6].
We now give an example of spaces and and of operator satisfying Hypothesis 2.1-1.-2.
Example 2.4**.**
Let with , be an open bounded set, and . Further, let be the realization in of the operator
[TABLE]
with boundary conditions , where and
[TABLE]
where is the normal vector to the boundary of . As shown for example in [17], satisfies Hypothesis 2.1- Moreover, [3, Lemma 6.1.2] with shows that Hypothesis 2.1- is satisfied with this choice of , and .
In the following proposition we collect important results on the solution of the forward equation (2.1). We recall that, given and , a mild solution to (2.1) is an adapted process which satisfies
[TABLE]
Proposition 2.5**.**
Let Hypothesis 2.1 hold true. Then the following hold.
- (i)
For any , , the problem (2.1) admits a unique mild solution , for any . If generates a strongly continuous semigroup on , then the process is also continuous up to . Moreover, there exists a positive constant such that, for any ,
[TABLE]
where
[TABLE]
- (ii)
For any , , the mild solution to (2.1) is Gâteaux differentiable as a map from to , and
[TABLE]
Moreover, is Gâteaux differentiable as a map from to , and
[TABLE]
- (iii)
For any , and ,
[TABLE]
Proof.
Item can be proved arguing as in [6, Theorem 7.13].
The first part of and inequality (2.5) follow from [19, Propositions 3.10 & 3.13]. We claim that
[TABLE]
If the claim is true, since is densely embedded into , by approximation we immediately deduce (2.6) for any . In order to prove (2.8), we consider and the approximating processes , , where . Then, is a strict solution to
[TABLE]
The dissipativity of and implies , which gives . Letting we get (2.8).
It remains to prove . To this end, we recall that (see e.g. [19]), for any , the process is a mild solution to
[TABLE]
and therefore
[TABLE]
Let and let be an approximating sequence of in . If we replace to in (2.12), from we deduce that the left-hand side of (2.12) and the first term in the right-hand side of (2.12) converge respectively to and to , as . As far as the integral in the right-hand side of (2.12) is considered, with replaced by , again from we infer that
[TABLE]
as . Thanks to Hypothesis 2.1-, estimate (2.4) and (2.6), we can apply the dominated convergence theorem and therefore
[TABLE]
as , which gives (2.7).
Now we show that for any and any , the process belongs to a.e. in and -a.s., and satisfies useful estimates.
Proposition 2.6**.**
Let Hypothesis 2.1 holds true, and let , and . Then, , a.e. in and -a.s., and there exists a positive constant such that, for any ,
[TABLE]
Proof.
Let . We prove (2.13) for , the case can be proved by analogous computations.
We first assume that . Let be a strict solution to (2.11), otherwise we can approximate it by smooth processes, as in the proof of Proposition 2.5, item (ii). The dissipativity of in gives
[TABLE]
for any . Integrating between [math] and we get
[TABLE]
Since for any , from (2.6) we deduce that for any . Thus (2.13) holds for , and any .
Let us now consider . From interpolation estimates (see e.g. [17, Section 2.2])
[TABLE]
By replacing by in (2.14), we get
[TABLE]
with . We conclude that (2.13) holds for , and any .
Let us now consider , and let be an approximating sequence of in . Then, from (2.6), for any we get
[TABLE]
Since (2.13) holds for any , it follows that is a Cauchy sequence in , and therefore there exists a process such that in . Since is a bounded operator on , it follows that
[TABLE]
in . Therefore, also by (2.15), a.e. in and -a.s., which means that a.e. in and -a.s., and a.e. in and -a.s. In particular, we get
[TABLE]
Again, by applying interpolation estimates we see that (2.13) holds for , and any .
We end this section by giving pointwise estimates of . In particular, we improve the result of Proposition 2.6, by obtaining that belongs to for any , -a.s.
Proposition 2.7**.**
Let Hypothesis 2.1 holds true and let , and . Then, for any and ,
[TABLE]
and if in addition , then (2.19) gives
[TABLE]
where .
Proof.
We prove estimate (2.18), then (2.19) follows from analogous arguments. Fix , and let us consider . We recall that generates an analytic semigroup on and therefore belongs to for any and any , and for any and some positive constant . This means that for any and, recalling (2.7),
[TABLE]
From Hypothesis 2.1-4. and (2.6) we deduce that
[TABLE]
for some positive constant independent of . Then, for any ,
[TABLE]
for some positive constant independent of . Further, if , then
[TABLE]
Taking the expectation in (2.18) and (2.19) we get respectively (2.16) and (2.17).
3 The forward-backward system
We consider the following forward-backward system of stochastic differential equations (FBSDE for short) for the unknown (also denoted by to stress the dependence on the initial conditions and ): for given and ,
[TABLE]
The second equation is of backward type for the unknown and depends on the Markov process . Under suitable assumptions on the coefficients (the so-called generator of the BSDE) and we look for a solution consisting of a pair of processes . More precisely, we will assume that is Lipschitz continuous with respect to and locally Lipschitz continuous and with quadratic growth with respect to , as stated below.
Hypothesis 3.1**.**
The functions and in (3.1) satisfy the following.
- (i)
* is continuous, and there exists a nonnegative constant such that for every .*
- (ii)
* is measurable and, for every fixed , the map is continuous. Moreover, there exist nonnegative constants and such that*
[TABLE]
for every , , and .
Theorem 3.2**.**
Assume that Hypotheses 2.1 and 3.1 hold true, and for any , let be a solution to the FBSDE (3.1). Then, there exists a unique solution of the Markovian BSDE in (3.1) such that
[TABLE]
where is a constant that may depend on . Moreover, setting ,
[TABLE]
and there exists a Borel function such that
[TABLE]
Proof.
The first part of the result substantially follows from [16]. Identities (3.2)-(3.3) are a consequence of the Markov property of , see for instance Theorem 4.1 in [8] or the proof of Theorem 5.1 in [12].
We recall some further estimates for the solution of the forward-backward system (3.1). In particular, , for any . The corresponding proof can be found e.g. in [21].
Proposition 3.3**.**
Assume that Hypotheses 2.1 and 3.1 hold true, and for any , let be a solution to the FBSDE (3.1). Then, for all ,
[TABLE]
where is a constant that may depend on .
At this point, we aim at proving a stability result for the BSDE when the final datum and the generator are approximated by sequences of Fréchet differentiable functions , converging pointwise respectively to and , and such that, for all , , , ,
[TABLE]
To provide such approximations we extend the result in [25] valid for Hilbert spaces: by using Schauder basis, the approximation performed in that paper can be achieved also in Banach spaces, along the lines of what is done in [18]. We start by introducing the following objects.
Definition 3.4**.**
- i)
Denote by the normalized Schauder basis in and by an orthonormal basis of . For any , we define the projections and as follows:
[TABLE]
for any and with and , .
- ii)
We consider nonnegative smooth kernels and , , such that
[TABLE]
- iii)
For any , we set for any , and
[TABLE]
It is not hard to prove the following lemma.
Lemma 3.5**.**
Le and satisfy Hypothesis 3.1. Then the following hold.
- (i)
For any , the function in (3.6) is Fréchet differentiable, satisfies estimate (3.4), and
[TABLE]
- (ii)
For any , the function in (3.7) is Fréchet differentiable with respect to , satisfies estimates (3.4)-(3.5), and
[TABLE]
We can now give a stability result for the Markovian BSDE in (3.1) related to a forward process taking values in the Banach space , when the final datum and the generator are approximated respectively by the sequences and . Notice that a similar result is proved in [21], where the forward process takes its values in a Hilbert space : there the final datum and the generator are approximated in the norm of the uniform convergence by means of their inf-sup convolutions. Clearly, the following result holds true if we approximate only or .
Proposition 3.6**.**
Assume that Hypotheses 2.1 and 3.1 hold true. For any , let be a solution to the FBSDE (3.1). Let be the solution of the BSDE in the forward-backward system
[TABLE]
that is, the FBSDE (3.1) with final datum equal to in (3.6) in place of , and with generator in (3.7) in place of . Then, for all , the unique solution of the Markovian BSDE in (3.1) is such that
[TABLE]
Proof.
Thanks to (3.4), (3.5) and to Proposition 3.3, the pair of processes is bounded in , uniformly with respect to . The BSDE satisfied by the pair of the difference processes is
[TABLE]
Writing the previous equation in the integral form, we get
[TABLE]
where in the last passage we have used that
[TABLE]
which, by the Girsanov Theorem (see, e.g., [6, Theorem 10.14]), is a cylindrical Wiener process under an equivalent probability measure . Taking the -conditional expectation , we get
[TABLE]
By taking the absolute value, the expectation and by applying the Gronwall lemma, we deduce that, for all , in as , with respect to the probability measure and also with respect to the original probability measure.
For what concerns the estimate of , by applying the Itô formula to we get
[TABLE]
Let us consider the right-hand side of the above inequality. Thanks to estimates (3.4), (3.5) and to the boundedness of and in , the first two terms converge to [math] as by the dominated convergence theorem. For what concerns the third term, by applying Hôlder’s inequality with conjugate exponents, we get
[TABLE]
as . The stability result for follows, and we can pass to the case of general in a usual way.
We now state a result on differentiability for the solution of a Markovian BSDE with generator with quadratic growth, with respect to the initial datum .
Proposition 3.7**.**
Assume that Hypotheses 2.1 and 3.1 hold true, and for any , let be a solution to the FBSDE (3.1). Assume moreover that is Gâteaux differentiable with bounded derivative, and that is Gâteaux differentiable with respect to , and . Then the triple of processes is Gâteaux differentiable as a map from with values in and, for any ,
[TABLE]
Moreover, there exists a constant , only dependent on , such that
[TABLE]
Proof.
In the case of a Markovian BSDE with generator quadratic with respect to and related to a forward process taking values in a Hilbert space, the result is given in Theorem 4.5 of [2]. Since in Proposition 2.5 we have proved the differentiability of with respect to , the same conclusions hold when the forward process takes values in the Banach space , namely
[TABLE]
The stronger estimate (3.15) comes from Proposition 2.5, estimate (2.6).
3.1 Identification of and a priori estimates on
We now prove an a priori estimate on depending only on the -norm of the final datum. The novelty towards [21] is that we work in a Banach space and the pseudo-inverse of the diffusion operator is the unbounded operator . In order to get this estimate and also for the subsequent results of the paper, it will be crucial to prove the identification
[TABLE]
which is new in the Banach space framework and in the case of quadratic generator with respect to . We have to make the following assumption:
Hypothesis 3.8**.**
There exists a Banach space dense in such that and is continuous.
Remark 3.9**.**
Notice that if is a bounded open domain with smooth boundary, and is the Laplace operator in dimension with Dirichlet boundary conditions, then we can take and all the requirements of Hypothesis 3.8 are verified.
Theorem 3.10**.**
Assume that Hypotheses 2.1, 3.1 and 3.8 hold true, that is Gâteaux differentiable with bounded derivative, and that is Gâteaux differentiable with respect to , and . For any , let be the solution to the FBSDE (3.1). Then the triple of processes is Gâteaux differentiable as a map from with values in . Moreover, setting , then, -a.s.,
[TABLE]
Proof.
The differentiability properties of and the identification formula (3.16) directly follow respectively from Proposition 3.7 and formula (3.2) in Theorem 3.2.
Let us now prove identification formula (3.17) for . Since we are in a Banach space framework, we will follow the lines of the proof of Theorem 3.17 in [19]. However, a substantial difference with respect to [19] is that here we deal with a generator with quadratic growth with respect to , instead of Lipschitz continuous. Fix . By the definition of the function , we can write
[TABLE]
where we have used the notation . Notice that towards [19] we do not have , but we only know that for any . As in [19], we define a family of predictable processes with real values in the following way:
[TABLE]
We will briefly write , where by we mean the trajectory of up to time .
Let us set for . From now on we fix , and small enough such that . We also identify with its dual , and we write for . Multiplying both sides of (3.18), with replaced by , by and taking the expectation, we get
[TABLE]
It is immediate that
[TABLE]
so (3.19) simplifies in
[TABLE]
By dividing both sides of the previous equality by and letting , we get
[TABLE]
We will prove that
[TABLE]
If (3.21) and (3.22) hold, then, by (3.20), for every , for almost every . By the arbitrariness of , we would have, for almost every , Z^{t,x}_{\sigma}\varsigma=\nabla_{x}\big{(}v(\sigma,X^{t,x}_{\sigma})\big{)}(-A)^{-\alpha}\varsigma, -a.s. for all , and the formula (3.17) would follow.
Let us thus show that (3.21) and (3.22) hold true. We start by proving (3.21). One proceeds as in [1], following also [19]. In particular, for , we define
[TABLE]
where depends on the trajectories of up to time , and the dependence is given by the definition of . The process is defined as the solution of (3.23), which is not considered as a stochastic differential equation, as specified in [1, p. 476]. Equation (3.23) can be solved step by step in each interval
[TABLE]
is well defined for every , see [19] for more details. Moreover, is a function of the trajectories of up to time , that is, , and we can write
[TABLE]
Now we define a probability measure such that
[TABLE]
By the Girsanov Theorem, under , is a cylindrical Wiener process in . By this construction of , it is also clear that for every , is pathwise differentiable with respect to and , see also [1, p. 476].
By (3.18), the random varaible is square integrable and
[TABLE]
Therefore, by the Cauchy–Schwarz inequality the expectation of is well defined. We claim that
[TABLE]
As a matter of fact,
[TABLE]
where in the last passage we have used the dominated convergence theorem being bounded.
Now notice that, in , is a mild solution to the equation
[TABLE]
On the other hand, in , we consider the process which is a mild solution to the equation
[TABLE]
Then the process under and the process under have the same law, so (3.24) yields
[TABLE]
Let us set and , -a.s. for any . Arguing as in [19], one can prove that
[TABLE]
Formula (3.26) in turn allows to show that
[TABLE]
so that formula (3.25) gives
[TABLE]
By (3.27) we have
[TABLE]
so (3.21) is proved.
It remains to prove (3.22). Recalling identifications (3.2)-(3.3), we have
[TABLE]
which is the analogous of formula (3.25) with in place of . Now we notice that
[TABLE]
where we have used the notation
[TABLE]
[TABLE]
On the other hand, the pair of processes is solution to the FBSDE
[TABLE]
Moreover, taking into account (3.30) and the linearity of the BSDE (3.36), we get that the pair satisfies the estimates
[TABLE]
By Hypothesis 3.1,
[TABLE]
Therefore, collecting (3.29)-(3.30), (3.37)-(3.38) and (3.39), (3.1) gives
[TABLE]
which goes to zero as goes to zero. This shows that (3.22) holds true and concludes the proof.
Corollary 3.11**.**
Under the assumptions of Theorem 3.10 we have
[TABLE]
where denotes an extension of the operator to the whole space . Moreover, there exists a constant , that may depend also on , and , such that
[TABLE]
Proof.
Since is dense in , by (3.17) in Theorem 3.10, for almost every and almost surely with respect to the law of , the operator extends to an operator defined on the whole , which we still denote .
Moreover, from (3.17) and by the Markov property, we get
[TABLE]
The conclusion (3.40) follows from the fact that by (3.15), where is a constant that does not depend on .
Now we use the previous result to give a priori estimates on .
Proposition 3.12**.**
Assume that Hypotheses 2.1 and 3.1 hold true, and for any , let be the solution to the FBSDE (3.1). Then there exists a positive constant only depending on , such that
[TABLE]
Proof.
In the following will denote a positive constant which may depend on , but not on , and that may vary from line to line. We fix .
We start by proving estimate (3.41). We first take and differentiable. By Proposition 3.7, the triple of processes is Gâteaux differentiable as a map from with values in , and for any , the triple of processes is solution to (3.14), and satisfies estimate (3.15).
Let us now introduce the process
[TABLE]
where is the probability measure such that is a Brownian motion in .
Let us fix . Arguing as in [21, Proposition 3.6] it follows that
[TABLE]
Therefore, is a -submartingale, which implies, thanks to identification formula (3.17), that
[TABLE]
Further, since is differentiable and Lipschitz continuous with respect to and , and is bounded (see (2.6)), we deduce that
[TABLE]
It remains to estimate . To this aim, we recall the well-known estimate
[TABLE]
for some and any . Formulas (3.45), (3.17) and (2.17) give
[TABLE]
which, together with (3.43) and (3.44), allows us to conclude that
[TABLE]
Let now fix . We notice that in this case we can write . Therefore,
[TABLE]
which provides (3.41) in the case of and differentiable.
Finally, the case and non differentiable can be obtained by approximating and with and in (3.7) and (3.6), respectively. For the proof we refer to [21, Proposition 3.6].
Let us now prove estimate (3.42). Again, at first we prove the result when and are differentiable and then we generalize it by approximation. Let us fix . For any , the submartingale property of gives
[TABLE]
Moreover, for any we split
[TABLE]
Let us evaluate separately and . Concerning , identification formula (3.17), (3.45) and (2.16) give
[TABLE]
Hence, from (3.44), (3.46) and (3.1) it follows that
[TABLE]
By applying Fubini’s theorem and (3.49), we infer that
[TABLE]
As far as is concerned, we take advantage from (3.44) and (3.1). Then, for we get
[TABLE]
Thus collecting (3.46), (3.1) and (3.51), we have
[TABLE]
so that
[TABLE]
Let us now fix , and let us consider a sequence such that as in . Taking (3.52) with replaced by and letting , it follows that
[TABLE]
Inequality (3.42) follows from (3.53) by taking and .
4 The Bismut-Elworthy formula and the semilinear Kolmogorov equation: the Lipschitz case
Recall that we deal with a process taking values in a Banach space and solution to equation (2.1), with special diffusion operator with pseudo-inverse which is not bounded.
In the present section we adequate to our framework the results in [10]. More precisely, in Subsection 4.1 we present a nonlinear version of the Bismut-Elworthy formula in the case of Lipschitz generator, which extends the one provided in [10] in the case of a process taking values in a Hilbert space, and with a bounded diffusion operator with bounded inverse. Providing the Bismut-Elworthy formula in the case of Lipschitz generator is a fundamental step in order to obtain the analogous formula in the quadratic case. Moreover, it allows us to give an existence and uniqueness result in the Banach framework for the semilinear Kolmogorov related to the process , and with coefficients and not necessarily differentiable, see Subsection 4.2.
For and we define the real valued random variables
[TABLE]
Notice that, for any , the process is well defined thanks to formula (2.13) in Proposition 2.6. In what follows we prove some useful estimates on the process .
Lemma 4.1**.**
Assume that Hypotheses 2.1 hold true. For any , let be the unique mild solution to (2.1). Then, for any and for any ,
[TABLE]
and also
[TABLE]
Proof.
We compute
[TABLE]
where in the latter inequality we have used formula (2.13) of Proposition 2.6 with . Analogously, we have
[TABLE]
4.1 The Bismut formula
We can now give a version of the Bismut-Elworthy formula in the case of Lipschitz generator and in the Banach space framework. We consider only the case of final datum and generator bounded with respect to , since we aim to treat such a model in the quadratic case. We start with the case when the coefficients are also differentiable. An analogous result is proved in [10] in the Hilbert space framework using the Malliavin calculus. Since here the process takes its values in a Banach space, we avoid the use of the Malliavin calculus, by exploiting instead techniques similar to the ones used in the proof of Theorem 3.10.
In the rest of the section we will assume the following, that substitutes Hypothesis 3.1.
Hypothesis 4.2**.**
The functions and in (3.1) satisfy the following.
- (i)
* is continuous, and there exist a nonnegative constant such that for every .*
- (ii)
* is measurable and, for every fixed , the map is continuous. Moreover, there exist nonnegative constants and such that*
[TABLE]
for every , , and .
Theorem 4.3**.**
Let Hypotheses 2.1 and 4.2 hold true, and for any , let be a solution of the forward-backward system (3.1), and let be the process defined in (4.1). Assume moreover that is Gâteaux differentiable with bounded derivative, and that is Gâteaux differentiable with respect to , and . Then for , ,
[TABLE]
Proof.
Let be a given square integrable -valued predictable process, and be a mild solution to the equation
[TABLE]
We also consider the pair of processes solution to the Markovian BSDE
[TABLE]
Arguing similarly to the proof of Theorem 3.10, we define
[TABLE]
which are solution to the forward-backward system (3.36) with . We already know (see formula (3.26) with ) that
[TABLE]
Now we want to prove a similar identification for the pair . To this aim, for any , we consider the Markovian BSDE in (3.1) on the time interval , and with initial condition given at time ; from Proposition 3.7 we know that the derivative with respect to in the direction satisfies the following BSDE, that we write in integral form: for any , -a.s.,
[TABLE]
Let us take and in (4.11), and let us integrate both sides with respect to . By inverting the order of integration where necessary, and using the Markov property, it is immediate to get
[TABLE]
By (4.6) and (4.7), together with (4.8), we can conclude that
[TABLE]
since these two pairs of processes satisfies the same BSDE. By density, arguing as in Corollary 3.11, we infer that formulas (4.8) and (4.12) hold true for any square integrable -valued predictable process . Now, let , and let us take
[TABLE]
Notice that, since -a.s., thanks to Proposition 2.6, for any , -a.s., and so
[TABLE]
which belongs to . Therefore, for all we have -a.s., where denotes the mild solution to the forward equation in (3.36) with with given by (4.13). With this choice of equalities (4.8) and (4.12) can be rewritten as
[TABLE]
Let us now set
[TABLE]
By (4.14), and can be rewritten as
[TABLE]
Notice that the right-hand sides in (4.1) and in (4.16) are nothing else (modulo a renormalization) than the terms appearing in the right-hand sides of the first two equations in (3.14). Now we aim at finding an expression for and that does not involve the derivative of and : this in turn will furnish an expression of that does not involve the derivatives of and , as in formula (4.4). To this end, let us consider the process
[TABLE]
and let us define a probability measure such that
[TABLE]
By the Girsanov theorem, under is a cylindrical Wiener process in . Arguing as in the proof of Theorem 3.10, we also notice that the process under and the process under have the same law. Therefore,
[TABLE]
By differentiating inside the expectation with respect to and changing the order of integration, we get
[TABLE]
and so, recalling (4.1),
[TABLE]
Similarly, , and this proves (4.4) when . The general case with follows by density, thanks to estimates (3.42) and (4.2).
In the next result we remove the differentiability assumption on and in Theorem 4.3.
Theorem 4.4**.**
Let Hypotheses 2.1 and 4.2 hold true, and for any , let be a solution of the forward-backward system (3.1), and let be the process defined in (4.1). Then, for , , the Bismut formula given in (4.4) holds true.
Proof.
The proof follows the same lines of the one of Theorem 3.10 in [10]. The main ingredients are formula (3.17) in Theorem 3.10 and Proposition 3.6, which provide respectively the identification of in the Banach space case and with the diffusion operator , and the stability result for the BSDE in (3.1) when the generator and the final datum are approximated by (3.7)-(3.6). We underline that approximations (3.6)-(3.7) preserve the boundedness and the growth, and are only of pointwise type. Notice that in [10], the final datum and the generator are approximated by means of their inf-sup convolutions, and so the approximation is uniform. However, thanks to the aforementioned stability properties for the BSDE, our pointwise approximations (3.7)-(3.6) are sufficient to obtain the desired result.
4.2 The semilinear Kolmogorov equation
By means of Theorem 4.4, we can give an existence and uniqueness result in the Banach framework for the semilinear Kolmogorov related to the the process , and with coefficients and not necessarily differentiable, as it is assumed in [19].
Let , be the transition semigroup related to the process solution of the forward equation (2.1), namely, for every bounded and measurable function , . We consider the following semilinear Kolmogorov equation
[TABLE]
where is the generator of the transition semigroup , that is, at least formally,
[TABLE]
We introduce the notion of mild solution of the nonlinear Kolmogorov equation (4.18), see e.g. [11].
Definition 4.5**.**
A function is a mild solution of the semilinear Kolmogorov equation (4.18) if , and
[TABLE]
Theorem 4.6**.**
Let Hypotheses 2.1 and 4.2 hold true. Then the semilinear Kolmogorov equation (4.18) has a unique mild solution given by the formula
[TABLE]
where, for any , denotes the solution to the FBSDE (3.1). In addition, we have, -a.s.,
[TABLE]
Proof.
If the data and are also differentiable, the result can be proved as in [19], Theorem 6.2. When the data are not differentiable, the Bismut formula (4.4) is still true, see Theorem 4.4, and the result can be proved arguing as in [10], Theorem 4.2.
5 The Bismut-Elworthy formula and the semilinear Kolmogorov equation: the quadratic case
We are ready to state and prove the main result of the paper, which is a nonlinear Bismut-Elworthy formula as the one in Theorem 4.4, but in the case of quadratic generator. This in particular will give an existence and uniqueness result for the Kolmogorov equation (4.18) in the quadratic case and in the Banach framework, see Theorem 5.4.
Theorem 5.1**.**
Let Hypotheses 2.1 and 3.1 hold true. For any , let be the solution of the forward-backward system (3.1) and let be the process defined in (4.1). Then, for , and ,
[TABLE]
Proof.
We split the proof into two steps: we first prove the statement when is differentiable with respect to and , and then we remove this additional assumption.
STEP . We start by considering differentiable with respect to , and . For all , let us denote by the solution of the Markovian BSDE in (3.1) with final datum equal to in (3.6) in the place of :
[TABLE]
By estimate (3.40) in Corollary 3.11, for any , there exists a constant , depending on , which is bounded for every and blows up as , and such that
[TABLE]
In particular,
[TABLE]
Therefore, the generator acts as a Lipschitz generator with respect to in the BSDE (5.2), so the Bismut-Elworthy formula stated in Theorem 4.4 holds true for the BSDE (5.2): for every ,
[TABLE]
At this point we aim at taking the limit as in (5.4).
We start by considering the right-hand side of (5.4). By the properties of the approximations together with (4.2), by the dominated convergence theorem and the pointwise convergence of to we have
[TABLE]
Therefore,
[TABLE]
In order to compute the limit of the remaining term in the right-hand side of (5.4), we will show that
[TABLE]
We notice that
[TABLE]
We start by estimating the term . We have
[TABLE]
We recall that, by estimate (3.41) in Proposition 3.12, and since , there exists a constant , not depending on , such that
[TABLE]
So, since and , for
[TABLE]
We only show the convergence of since the convergence of follows in a simpler way by the boundedness of and of (uniform in ), and by the convergence of to in . Using Hölder inequality with and , for some , together with estimate (4.3) in Lemma 4.1, we get
[TABLE]
as , since in .
Let us now estimate the term . To this end, we recall that, by Theorem 3.3, , are bounded in and , are bounded in , by a constant independent on . Moreover, by Proposition 3.6, converges to is and converges to in , for any . By using again Hölder’s inequality for some , , and estimate (4.3) in Lemma 4.1, we get
[TABLE]
as . Collecting all the previous results, we deduce that, for every ,
[TABLE]
In particular, by taking in (5.6),
[TABLE]
which shows that exists. Moreover, arguing as in the end of the proof of Theorem 4.1 in [21], we deduce that for all .
STEP . Let us now remove the differentiability assumptions on . For any , let be the function defined in (3.7). From Lemma 3.5 we know that is differentiable and it preserves the Lipschitz constant, so that
[TABLE]
Moreover, from Lemma 3.5 we have as for any , and for any
[TABLE]
for any . We consider the BSDE with generator equal to in the place of :
[TABLE]
By the first part of the proof, for any ,
[TABLE]
We aim at taking the limit as We start by considering the first term in the right-hand side of (5.9), and we will show that
[TABLE]
We start by splitting the integral above as follows:
[TABLE]
In order to estimate the term , we notice that
[TABLE]
Concerning , by (3.4)-(3.5) we can argue as for in Step , and get that .
Let us now consider the term . From Hypothesis 3.1 and formulas (3.4) and (3.5) it follows
[TABLE]
where is a positive constant depending on and . Arguing as for is Step , it is possible to prove that
[TABLE]
On the other hand, recalling that pointwise as , we get that by the dominated convergence theorem.
Let us now estimate . To this end, we notice that
[TABLE]
Arguing as for the term in Step , we deduce that as . As far as is considered, we get
[TABLE]
Arguing as for in Step it follows that
[TABLE]
Since pointwise converges to , we can again apply the dominated convergence theorem which gives as . We can thus conclude that, for every ,
[TABLE]
As in the end of Step , arguing as at the end of Theorem 4.1 in [21] we can show that, for any , .
We now state two corollaries: the former is about integral estimates of , the latter is about the identification of with without differentiability assumptions. Notice that, by means of the Bismut formula (5.1), we can also recover estimate (3.42) on .
Corollary 5.2**.**
Let . Under the assumptions of Theorem 5.1, the process belongs to , and there exists a constant depending only on such that
[TABLE]
Proof.
Integrating (5.1) between and we get
[TABLE]
We have
[TABLE]
For what concerns , we split it as
[TABLE]
From (4.3) and Proposition 3.3 we have
[TABLE]
On the other hand, we consider the function under the integral sign in and we split it as follows:
[TABLE]
We argue as in the proof of Theorem 5.1, Step . In particular, arguing as for the estimate of we infer that for some positive constant . On the other hand, as far as is considered, arguing as in the estimate of , we get that for some positive constant . Hence,
[TABLE]
and this concludes the proof.
In the following we prove that the identification of with the directional derivative of remains true also when and are not differentiable.
Corollary 5.3**.**
Under the assumptions of Theorem 5.1, for every ,
[TABLE]
Proof.
Let and be respectively approximated by and in (3.6) and (3.7), and let be the solution of the BSDE with final datum and generator . By Theorem 4.6 we already know that . On the other hand, we have shown in Theorem 5.1 that is differentiable and that , a.e. and a.s., as . Moreover, by computing the joint quadratic variation between the process , and , it turns out that
[TABLE]
By taking a subsequence (that for simplicity we call again ) and letting in both sides, from Proposition 3.6 we get
[TABLE]
which gives formula (5.12).
Using Theorem 3.10, we can give an existence and uniqueness result for the Kolmogorov equation (4.18) and we can provide a Feynman-Kac formula in the quadratic case and in the Banach framework.
Theorem 5.4**.**
Let Hypotheses 2.1 and 3.1 hold true. Then there exists a unique mild solution of the semilinear Kolmogorov equation (4.18) given by the formula
[TABLE]
where is the solution to the FBSDE (3.1), and -a.s.,
[TABLE]
In particular,
[TABLE]
If in addition is Gâteaux differentiable with bounded derivative, and is Gâteaux differentiable with respect to , and , then
[TABLE]
Proof.
For the first part without differentiability assumptions on and , it is enough to apply Theorem 5.1 and Corollary 5.3 to get existence of the solution, as well as the estimate for . The uniqueness follows from the uniqueness of the solution of the related BSDE. The estimate for is a direct consequence of Proposition 3.12. The second part of the result can be proved in a standard way by means of Proposition 3.7 and the identification of proved in Theorem 3.10, see e.g. the proof of Theorem 6.2 in [11].
6 A quadratic optimal control problem
In this section we deal with the controlled state equation
[TABLE]
where or , and is the control process belonging to a suitable space of -valued functions. We will study the optimal control problem associated to equation (6.3) with cost functional defined by
[TABLE]
that we are going to minimize over all admissible controls. We define the value function of the optimal control problem as
[TABLE]
For any , we introduce the spaces of admissible control processes
[TABLE]
where is endowed with the norm
[TABLE]
We first prove some results about well posedness of the controlled equation (6.3). The main novelty towards Section 2 and the known results in the literature is that the controls are not necessarily bounded, together with the fact that evolves in a Banach space E.
Beside Hypothesis 2.1 we assume the following.
Hypothesis 6.1**.**
There exists such that with continuous embedding.
Remark 6.2**.**
Let be an operator satisfying Hypothesis 2.1-(i). If Hypothesis 6.1 holds true, then we have the following.
- (i)
For any and , and there exists a positive constant such that
[TABLE]
- (ii)
For any and , there exists a positive constant such that
[TABLE]
- (iii)
For any and , there exists a positive constant such that
[TABLE]
Remark 6.3**.**
Hypothesis 6.1 may be replaced by the weaker condition in Remark 6.2-(i). However, this condition would not imply Remark 6.2-(ii)-(iii).
Example 6.4**.**
Let be a bounded domain with smooth boundary. Set , , and let be the Laplace operator with Dirichlet boundary conditions. Then, Hypothesis 6.1 is satisfied with .
We will deal with mild solutions to (6.3), namely adapted processes such that
[TABLE]
for any , -a.s. For any , , we set
[TABLE]
Lemma 6.5**.**
*Let be an operator satisfying Hypothesis 2.1-(i), and assume that Hypothesis 6.1 holds true for some positive constant . Let , and set be the conjugate exponent of , i.e., . Then the following hold. *
- (i)
Case and .
For any , for any , -a.s., and there exists a positive constant such that
[TABLE]
- (ii)
Case and .
For any , for any , -a.s., and there exists a positive constant such that
[TABLE]
- (iii)
Case and .
*For any , for any , -a.s., and satisfies estimate (6.10) for some positive constant . *
Proof.
Let us prove item , items and follow from similar arguments. From Hypothesis 6.1, we have
[TABLE]
Therefore,
[TABLE]
Thanks to Lemma 6.5, arguing as in [6, Theorem 7.11] we deduce the following result, which is the counterpart of Proposition 2.5-(i) for the controlled equation.
Proposition 6.6**.**
Let Hypothesis 2.1 holds true, and assume that Hypothesis 6.1 holds true for some positive constant . Let , , and set be the conjugate exponent of . Then the following hold.
- (i)
Case , .
For any and , there exists a unique mild solution to (6.3) belonging to . Moreover, there exists a positive constant such that, for any ,
[TABLE]
- (ii)
Case , .
For any and , there exists a unique mild solution to (6.3) belonging to . Moreover, there exists a positive constant such that, for any ,
[TABLE]
- (iii)
Case , .
For any and , there exists a unique mild solution to (6.3) belonging to . Moreover, there exists a positive constant such that, for any ,
[TABLE]
Proof.
We show item , the proof of items and being analogous. Since by Lemma 6.5 the convolution defined in (6.9) is a well defined -valued process for any , it is possible to argue as in [6, Theorem 7.11]. Therefore, by applying the fixed point theorem we infer that for any , and , there exists a unique mild solution to (6.3) with replaced by its Yosida approximations , , such that satisfies (6.11). Further, the sequence converges as to the mild solution to (6.3). In particular, estimate (6.11) holds true also for .
6.1 The structure condition: the case
In this section we deal with control processes , and with the controlled equation
[TABLE]
satisfying the so called structure condition: the control affects the system only through the noise.
We make the following assumptions on the cost functional (6.4).
Hypothesis 6.7**.**
Let and be two measurable functions satisfying the following properties.
(i)
* is continuous and bounded.*
(ii)
For all , , the function is bounded and continuous from onto . For all , , the function is continuous from onto . Further, there exist positive constants such that, for all , , ,
[TABLE]
(iii)
There exists a positive constant such that, for all , , ,
[TABLE]
Remark 6.8**.**
Under Hypothesis 6.7-(ii), it is easy to see that there exist positive constants such that
[TABLE]
We introduce the Hamiltonian function
[TABLE]
Arguing as in [13, Lemma 3.1] we deduce an analogous result.
Lemma 6.9**.**
Let Hypotheses 6.7 be satisfied. Then, the function in (6.19) is Borel measurable, and there exists a positive constant such that
[TABLE]
Further, if the minimum in (6.19) is attained, it is attained in a ball of radius , i.e.,
[TABLE]
Finally, there exists a positive constant such that, for any , ,
[TABLE]
The HJB equation associated to the control problem (6.5), related to the controlled state equation (6.16), is given by
[TABLE]
where is defined in (6.19). The HJB equation (6.23) turns out to be a semilinear Kolmogorv equation as (4.18), with and satisfying Hypotehsis 3.1. So by Theorem 5.4 its mild solution can be represented in terms of the solution of the forward-backward system (3.1).
In the following Theorem we state and prove the fundamental relation, and we characterize the optimal control with a feedback law.
Theorem 6.10**.**
Let Hypotheses 2.1, 6.7 hold true, and assume that Hypothesis 6.1 holds true with a constant such that . Let be the mild solution of (6.16), be the value function of the control problem (6.5), and be the mild solution of the HJB equation (6.23). Then, for any and , the so called fundamental relation holds true:
[TABLE]
In particular, , for all . Moreover, if there exists a measurable function satisfying
[TABLE]
then
[TABLE]
and, thanks to (6.21), the process defined by
[TABLE]
belongs to and it is optimal.
Proof.
The proof is standard and follows the same lines of [13, Proposition 4.1]. We notice that, by Proposition 6.6-, problem (6.16) admits a unique mild solution for any . Further, for any , we introduce the family of stopping times defined by
[TABLE]
Then we proceed as in [13, Proposition 4.1], by applying the Girsanov Theorem and using the fact that satisfies Hypothesis 3.1-(ii), and that the pair of processes , solution to the Markovian BSDE in (3.1), are identified respectively with the solution of the HJB equation (6.23) and with its directional derivative . Namely, by Theorems 5.1 and 5.4, and .
6.2 The case with a special running cost
In the present section we deal with control processes , and with the controlled equation
[TABLE]
The controlled equation (6.26) has a different structure towards (6.16) considered in Subsection 6.1, so the problem is different, and we need different assumptions on the cost functional (6.4).
Hypothesis 6.11**.**
Let and be two measurable functions satisfying the following properties.
(i)
* is continuous and bounded.*
(ii)
For all , , the function is bounded and continuous from onto . For all , , the function is continuous from onto . Further, there exists positive constants such that, for all , and ,
[TABLE]
(iii)
There exists a positive constant such that, for any , , ,
[TABLE]
Remark 6.12**.**
Condition (6.28) in Hypothesis 6.11 implies that, if does not take values in , then . In particular, , so we can limit ourselves to consider here the space of admissible controls .
Remark 6.13**.**
Under Hypothesis 6.11-(ii), there exist positive constants such that, for any , , , we have .
We introduce the Hamiltonian function
[TABLE]
Arguing again as in [13, Lemma 3.1], we infer the following properties of .
Lemma 6.14**.**
Let Hypotheses 6.11 be satisfied. Then, the function in (6.29) is Borel measurable and there exists a positive constant such that
[TABLE]
Further, if the minimum in (6.29) is attained, it is attained in a ball of radius , i.e.,
[TABLE]
Finally, for any , , , there exists a positive constant such that
[TABLE]
The HJB equation associated to the control problem (6.5), related to the controlled state equation (6.26), is given by
[TABLE]
where is defined in (6.29). Again, the HJB equation (6.33) turns out to be a semilinear Kolmogorv equation as (4.18), with and satisfying Hypotehsis 3.1. So by Theorem 5.4 its mild solution can be represented in terms of the solution of the forward-backward system
[TABLE]
which is nothing else than the forward-backward system (3.1) with instead of .
As in Subsection 6.1, in the following Theorem we state and prove the fundamental relation, and we characterize the optimal control with a feedback law.
Theorem 6.15**.**
Let Hypotheses 2.1, 6.11 hold true, and assume that Hypothesis 6.1 holds true with a constant such that . Let be the mild solution of (6.26), be the value function of the control problem (6.5), and be the mild solution of the HJB equation (6.33). Then, for any and ,
[TABLE]
In particular, , for all , . Moreover, if there exists a measurable function satisfying
[TABLE]
then
[TABLE]
and, thanks to (6.31), the process
[TABLE]
belongs to and it is optimal.
Proof.
Notice that by Proposition 6.6-, for any there exists a unique mild solution to (6.26) which satisfies (6.12). The proof is similar to the one of Theorem 6.10. The main difference consists in the fact that, for any given , we introduce a family of stopping times depending on the norm :
[TABLE]
Then, we set , , and we introduce the process
[TABLE]
Afterwards, we apply the Girsanov Theorem: writing in (6.26), we get that is mild solution to
[TABLE]
By (6.32) in Lemma 6.14, we see that Hypothesis 3.1-(ii) is verified by . We conclude by arguing again as in [13, Proposition 4.1] and in Theorem 6.10.
6.3 The case with a general running cost
In this subsection we deal with the general controlled equation (6.26) under Hypothesis 6.7 on the coefficients of the cost functional, and we consider control processes . Unlike the two cases just treated, in this framework the HJB equation would not have the structure of equation (4.18 ) since the Hamiltonian function would depend on , not only on the directional derivative , see e.g. [9], formula (6.67) and the discussion related to formulas (4.278)-(4.279). Up to our knowledge, when in only continuous, the well posedness of such an equation is an open problem: in [3] an equation of this type is solved in mild sense with Lipschitz type assumptions on the final datum .
For this reason, we will not end up identifying the value function (6.5) with the solution of the HJB equation, but instead we will approximate it. The following result will be used in the aforementioned approximation of the value function.
Proposition 6.16**.**
Assume that Hypothesis 2.1 holds true. Let , and . Then,
[TABLE]
where and are respectively the mild solutions to (6.26) with control and .
Proof.
Let us set and let us assume that is a strict solution to
[TABLE]
otherwise we can use an approximation argument as in the proof of Proposition 2.5(ii). Then, the non-positivity of , the dissipativity of , the Cauchy-Schwartz inequality and the Young inequality give
[TABLE]
Integrating between and and applying the Gronwall Lemma, we get
[TABLE]
and we immediately deduce (6.36).
Thanks to Proposition 6.16 we deduce that, up to a subsequence, we can approximate in by means of mild solutions of problem (6.26), with replaced by , where satisfies in . In the following Proposition we prove that a similar approximation holds true in .
Proposition 6.17**.**
Let Hypothesis 2.1 holds true. Let , , and set be the conjugate exponent of . Assume that Hypothesis 6.1 holds true for some positive constant such that . Let and be such that in . Then, for any ,
[TABLE]
where be such that - a.s. for a.e. .
Proof.
As usual, we limit ourselves to consider the case . For any , let us set , where and are mild solutions to (6.26) with initial datum and control processes and , respectively. Further, let us denote by the subset of such that and on for a.e. . Then, for any
[TABLE]
which gives
[TABLE]
Let us estimate and separately. As far as is concerned, from the boundedness of on , Hypothesis 2.1- and (6.13), it follows that
[TABLE]
on . Further, from (6.6) it follows that
[TABLE]
on , for any . Since is continuous on , from (6.36) we infer that on as for any . The dominated convergence theorem implies that as on .
Concerning , from (6.6) and arguing as above we get
[TABLE]
on . This concludes the proof.
6.3.1 The approximate optimal control problem
We will consider the Hamiltonian function in (6.29) under Hypothesis 6.7. This prevents us to obtain directly estimates as those in Lemmas 6.9 and 6.14, since we don’t have the structure condition and the assumptions on are not sufficient to bound the term . For this reason, for any we introduce the function defined by
[TABLE]
Lemma 6.18**.**
Let be an operator satisfying Hypothesis 2.1-(i). Then the function in (6.38) satisfies the following conditions: for any ,
[TABLE]
Proof.
The first inequality directly comes from (6.38). On the other hand, for any , and , by Remark 6.8 we have
[TABLE]
For any , we introduce the approximate Hamiltonian function
[TABLE]
Estimates in Lemma 6.18 give the following result, which is analogous to Lemma 6.14.
Lemma 6.19**.**
Let Hypothesis 6.7 be satisfied, and let be an operator satisfying Hypothesis 2.1-(i). Then, for any , the function in (6.39) is Borel measurable, and there exists a positive constant such that
[TABLE]
Further, if the minimum in (6.39) is attained, it is attained in a ball of radius , i.e.,
[TABLE]
In particular, there exists a positive constant such that, for any , ,
[TABLE]
For any , we introduce the approximate cost functional defined by
[TABLE]
and the associated approximated optimal control problem
[TABLE]
The HJB equation associated to the control problem (6.43), related to the controlled state equation (6.26), is given by
[TABLE]
where is defined in (6.39). The HJB equation (6.44) is the analogous of (6.33) in Section 6.2. So again by Theorem 5.4, its solution can be represented in terms of the solution of the forward-backward system
[TABLE]
which is nothing else than the forward-backward system (6.34) with instead of .
We consider the following assumptions.
Hypothesis 6.20**.**
For any , there exists a measurable function satisfying
[TABLE]
We state now the analogous of Theorem 6.15 for the approximate optimal control problems (6.43).
Theorem 6.21**.**
Let Hypotheses 2.1, 6.7 hold true, and assume that Hypothesis 6.1 holds true with . Let be the solution of equation (6.26) and for any , let be the function defined in (6.43), and be the mild solution of the HJB equation (6.44). Then, for any and ,
[TABLE]
where is the solution to (6.49). In particular, , for all . Finally, if Hypothesis 6.20 holds true, then
[TABLE]
and, thanks to (6.31), the process
[TABLE]
belongs to and it is optimal.
6.3.2 A characterization of the value function
In the present section we show that the value function of the optimal control problem (6.5) can be approximated by the sequence of mild solutions to (6.44), that are identified with the approximated value functions , see formula (6.51) in Theorem 6.21. As a byproduct, we deduce that the sequence defined in (6.52) is a minimizing sequence for (6.5), and it is a bounded sequence in .
We start by introducing the Yosida approximations of , namely a suitable sequence which converges to in . Since for any , this would allow to approximate in terms of .
Definition 6.22**.**
For any ,
- (i)
we denote by any admissible control such that .
- (ii)
we denote by the Yosida approximations of , i.e.,
[TABLE]
Lemma 6.23**.**
Let be an operator satisfying Hypothesis 2.1-(i). Let and , , with , be the processes introduced in Definition 6.22. Then, for any , and ,
[TABLE]
for some positive constants , not depending neither on nor on . In particular, , -a.s., a.e. in as , and
[TABLE]
Proof.
Estimate (6.53) directly follows from the properties of . Further, the fact that -a.s., a.e. in as follows from the properties of Yosida approximations. Then, convergence (6.55) follows from the dominated convergence theorem, Finally, it easily follows that
[TABLE]
for any and any , where is the same positive constant as in (6.53). Interpolation estimates give (6.54).
Proposition 6.24**.**
Let Hypotheses 2.1, 6.7, 6.1 hold true. Let and let , , with , be the processes introduced in Definition 6.22, and let , be respectively the cost functionals in (6.4), (6.42). Then for any we have
[TABLE]
Proof.
Since pointwise converges to , a.e. in , -a.s., from (6.55) in Lemma 6.23 and Proposition 6.17 it follows that -a.s. in as for any . By dominated convergence theorem we deduce that
[TABLE]
To estimate the convergence of the approximate running cost in (6.38), we consider separately the two terms in (6.38). We stress that
[TABLE]
Arguing as above, from Hypothesis 6.7-(iii) and (6.37) we get
[TABLE]
Further, from (6.17) in Hypothesis 6.7-(ii) and (6.53) we infer that
[TABLE]
for any , -a.s. By dominated convergence theorem we get
[TABLE]
Moreover, the continuity of with respect to and the dominated convergence theorem give
[TABLE]
Finally, since , from (6.54) with we have
[TABLE]
and this concludes the proof.
The following theorem constitutes the main result of the section.
Theorem 6.25**.**
Let Hypotheses 2.1, 6.7, 6.20 hold true, and assume that Hypothesis 6.1 holds true with . For any , let and denote respectively the process in (6.52) and the mild solution to (6.44). Let , be respectively the functions in (6.5), (6.4). Then, for any ,
[TABLE]
Moreover, is bounded in .
Proof.
Let and . For any , let , be the processes introduced in Definition 6.22. By Proposition 6.24, there exists such that , for any which in turn gives
[TABLE]
Notice that, from the definitions of , in (6.5), (6.43), it follows that
[TABLE]
Further, from (6.54) with we get that , and therefore the definition of implies
[TABLE]
Then, collecting (6.57), (6.58) and (6.59), for any . Hence, , and the arbitrariness of gives
[TABLE]
Then the first equality in (6.56) follows from (6.60), recalling that, by Theorem 6.21, for any .
On the other hand, since for any ,
[TABLE]
so that, taking into account (6.60), the second equality in (6.56) follows.
Finally, let us prove that is bounded in . Assume by contradiction that there exists a subsequence such that for any . Then,
[TABLE]
On the other hand, since is nonnegative and satisfies (6.28),
[TABLE]
Therefore , which contradicts (6.56).
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] J. M. Bismut. Martingales, the Malliavin calculus and hypoellipticity under general Hörmander’s conditions. Z. Wahrsch. Verw. Gebiete , 56:469–505, 1981.
- 2[2] P. Briand and F. Confortola. BSD Es with stochastic lipschitz condition and quadratic PDE s in Hilbert spaces. Stochastic Process. Appl. , 118(5):818–838, 2008.
- 3[3] S. Cerrai. Second order PDE’s in finite and infinite dimension, A probabilistic approach , volume 1762 of Lecture Notes in Mathematics . Springer-Verlag, Berlin, 2001.
- 4[4] G. Da Prato and J. Zabczyk. Ergodicity for infinite-dimensional systems , volume 229 of London Mathematical Society Lecture Note Series . Cambridge University Press, Cambridge, 1996.
- 5[5] G. Da Prato and J. Zabczyk. Second order partial differential equations in Hilbert spaces. , volume 293 of London Mathematical Society Note Series . Cambridge University Press, Cambridge, 2002.
- 6[6] G. Da Prato and J. Zabczyk. Stochastic equations in infinite dimensions , volume 152 of Encyclopedia of Mathematics and its Applications . Cambridge University Press, Cambridge, second edition, 2014.
- 7[7] F. Delbaen, Y. Hu, and X. Bao. Backward SD Es with superquadratic growth. Probab. Theory Related Fields , 150(1-2):145–192, 2011.
- 8[8] S. El Karoui, N. Peng and M. C. Quenez. Backward stochastic differential equations in finance. Mathematical finance , 7(1):1–71, 1997.
