Fr\'echet differentiability of mild solutions to SPDEs with respect to the initial datum
Carlo Marinelli, Luca Scarpa

TL;DR
This paper proves high-order Fréchet differentiability of mild solutions to jump-diffusion SPDEs in Hilbert spaces, enabling the construction of classical solutions to related non-local Kolmogorov equations.
Contribution
It establishes n-th order Fréchet differentiability of solutions with respect to initial data for a broad class of jump-diffusion SPDEs, extending previous differentiability results.
Findings
Proved well-posedness of the SPDEs in the mild sense.
Established first-order Gâteaux differentiability of solutions.
Demonstrated higher-order Fréchet differentiability of solutions.
Abstract
We establish n-th order Fr\'echet differentiability with respect to the initial datum of mild solutions to a class of jump-diffusions in Hilbert spaces. In particular, the coefficients are Lipschitz continuous, but their derivatives of order higher than one can grow polynomially, and the (multiplicative) noise sources are a cylindrical Wiener process and a quasi-left-continuous integer-valued random measure. As preliminary steps, we prove well-posedness in the mild sense for this class of equations, as well as first-order G\^ateaux differentiability of their solutions with respect to the initial datum, extending previous results in several ways. The differentiability results obtained here are a fundamental step to construct classical solutions to non-local Kolmogorov equations with sufficiently regular coefficients by probabilistic means.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Fréchet differentiability of mild solutions to SPDEs with
respect to the initial datum
Carlo Marinelli1 and Luca Scarpa2
1Department of Mathematics, University College London, Gower Street, London WC1E 6BT, UK.
2Fakultät für Mathematik, Universität Wien, Oskar-Morgenstern-Platz 1, 1090 Wien, Austria.
(August 2, 2019)
Abstract
We establish -th order Fréchet differentiability with respect to the initial datum of mild solutions to a class of jump-diffusions in Hilbert spaces. In particular, the coefficients are Lipschitz continuous, but their derivatives of order higher than one can grow polynomially, and the (multiplicative) noise sources are a cylindrical Wiener process and a quasi-left-continuous integer-valued random measure. As preliminary steps, we prove well-posedness in the mild sense for this class of equations, as well as first-order Gâteaux differentiability of their solutions with respect to the initial datum, extending previous results by Marinelli, Prévôt, and Röckner in several ways. The differentiability results obtained here are a fundamental step to construct classical solutions to non-local Kolmogorov equations with sufficiently regular coefficients by probabilistic means.
1 Introduction
Our goal is to obtain existence and uniqueness of mild solutions, and, especially, their differentiability with respect to the initial datum, to a class of stochastic evolution equations on Hilbert spaces of the form
[TABLE]
Here is a linear -accretive operator, is a cylindrical Wiener process, is a compensated integer-valued quasi-left-continuous random measure, and the coefficients , , satisfy suitable measurability and Lipschitz continuity conditions. Precise assumptions on the data of the problem are stated in Sections 2.1 and 3 below.
The results extend (and partially supersede) those obtained in [15] in several ways: (a) well-posedness is established here in much greater generality, in particular allowing to be a quite general random measure, rather than just a compensated Poisson measure as in [15]. Moreover, using a more precise maximal estimate for stochastic convolutions, solutions are no longer needed to be sought in spaces of processes with finite second moment (yet more general well-posedness results are going to appear in [14]); (b) the sufficient conditions on the coefficients of (1.1) for the differentiability of its solution with respect to the initial datum are the natural ones. For instance, roughly speaking, Fréchet differentiability of , , and imply Fréchet differentiability of the solution map , while in [15] a condition on , , and was needed. In fact, the proof in [15] was based on an implicit function theorem with parameters, for which the assumption seems indispensable, while here we use a direct approach based on the definition of derivative; (c) we study the -th order differentiability of the solution map for arbitrary natural , instead of considering only first and second-order differentiability as in [15]. In this regard it is worth mentioning that we just assume that the derivatives of , , and of order higher than one satisfy a polynomial growth condition. While this assumption causes non-trivial technical difficulties, it is more natural than much more restrictive boundedness conditions that are often found in the literature: a possible example of coefficients with nonbounded higher derivatives is given in Example 6.1 below.
There are several reasons to study the differentiability of solutions to stochastic equations in infinite dimensions with respect to the initial datum (or, more generally, with respect to parameters), among which the probabilistic construction of solutions to Kolmogorov equations is our main motivation. This vast and mature field of investigation is still very active, especially regarding stochastic equations with additive Wiener noise: see, e.g., [12] for classical results in the finite-dimensional case, [9] for basic results in the Hilbertian setting, and [4, 6, 7, 20] for accounts of more recent developments. On the other hand, the case of equations with discontinuous noise, for which the associated Kolmogorov equations are of non-local type, is much less investigated, especially in the infinite-dimensional setting (see [15] for simple results and [19] for a special case). As an application of the above-mentioned differentiability results, we shall construct, in a forthcoming work, classical solutions to non-local Kolmogorov equations with sufficiently regular coefficients. As is well known, such results are essential to consider Kolmogorov equations motivated by applications, that usually have less regular coefficients. In fact, a typical approach is, roughly speaking, to regularize the coefficients of the equation, thus obtaining a family of approximating Kolmogorov equations that are sufficiently simple to have classical solutions, and to obtain a solution to the original problem passing to the limit, in an appropriate sense, with respect to the regularization parameter. In this spirit, our ultimate goal is the extension of the results in [18] to non-local Kolmogorov equations associated to stochastic evolution equations with jumps in a generalized variational setting as considered in [17].
Since the literature on the problem at hand is very large, it is not easy to provide an accurate comparison of our results with existing ones, apart of the remarks already made. We should nonetheless mention the recent work [2], which considers a problem analogous to ours, but without discontinuous noise term and with coefficients with bounded derivatives of all orders. Here, the authors exploit the smoothing property of an analytic semigroup and study differentiability in negative order spaces.
The remaining text is organized as follows: in §2, after fixing some notation, we recall a characterization of Gâteaux and Fréchet differentiability, as well as some maximal estimates for deterministic and stochastic convolutions, all of which are essential tools. Well-posedness of (1.1), i.e. existence and uniqueness of a mild solution and its continuous dependence on the initial datum, is proved in §3. The remaining sections are devoted to differentiability properties of the mild solution to (1.1) with respect to the initial datum: first-order Gâteaux and Fréchet differentiability are treated in §4 and §5, respectively, and -th order Fréchet differentiability is considered in §6.
Acknowledgements. A large part of the work for this paper was done during several visits of the first-named author to the Interdiszplinäres Zentrum für Komplexe Systeme (IZKS) at the University of Bonn, Germany, and a visit to the University of Vienna, Austria. The warm hospitality of his hosts (S. Albeverio and U. Stefanelli, respectively) and the good working conditions are gratefully acknowledged. The second-named author is funded by Vienna Science and Technology Fund (WWTF) through Project MA14-009. The authors are indebted to G. Luise for contributing to a preliminary draft.
2 Preliminaries
2.1 Notation
The spaces of linear bounded operators from a Banach space to a further Banach space will be denoted by , and stands for the space of Hilbert-Schmidt operators from to if and are Hilbert spaces. The closed ball of radius in will be denoted by .
All stochastic elements will be defined on a fixed filtered probability space , with the filtration complete and right-continuous, and a fixed final time. Moreover, will always denote a fixed real separable Hilbert space with norm . For any and , we shall use the notation for the space of adapted càdlàg -valued processes such that
[TABLE]
and we set . We recall that these are Banach spaces if , and quasi-Banach spaces if . In the latter case the triangle inequality is reversed, but one has
[TABLE]
to which we shall also refer, with a harmless abuse of terminology, as the triangle inequality. Moreover, is a complete metric space for every when endowed with the distance
[TABLE]
as it follows from the inequality , which holds true for every , and . For brevity we shall write . Entirely analogously, endowed with the distance
[TABLE]
is a complete metric space for every .
Let be a real separable Hilbert space and a cylindrical Wiener process on . Let be a Blackwell measurable space and an integer-valued quasi-left-continuous random measure on , independent of , with dual predictable projetion (compensator) , and . We recall that the assumption on as a Blackwell space is usually required in the literature on random measures (see [11, §1a]), and it ensures for example that is separable and generated by a countable algebra. We also recall that the quasi-left-continuity of implies that the random measure is non-atomic (see, e.g., [11, Corollary 1.19, p. 70]). A map will be called predictable if it is -measurable, where stands for the predictable -algebra of (the target space is always assumed to be endowed with the Borel -algebra). Moreover, for any such predictable map , we set, for any , ,
[TABLE]
and
[TABLE]
where the infima are taken with respect to -measurable maps , only. One may actually show that as well as are (quasi-)Banach space and that
[TABLE]
For a proof of this statement, as well as of other properties of such mixed-norm spaces involving random measures (even in a more general setting), we refer to [10]. For us, however, it is enough to know that they are quasi-normed spaces, and the “norms” just introduced on spaces where the underlying measure is random is only a convenient notation. We shall also need to consider spaces where is replaced by , with , and the corresponding notation will be self-explanatory.
We shall use standard notation of stochastic calculus: we write, for instance, and to denote the maximal function and the left-limit function of a càdlàg function , respectively. Further notation related to deterministic and stochastic convolutions, as well as to different notions of derivative for maps between infinite-dimensional spaces, will be introduced where they first appear. For any , we use the notation to indicate that there exists a constant such that . If depends on some further quantities that we need to keep track of we shall indicate them in a subscript. We use the classical notation and for and , respectively.
2.2 Notions of derivative
Let , be Banach spaces, and be a subspace of . A function is Gâteaux differentiable at along if there exists a continuous linear map such that
[TABLE]
The linear map , which is necessarily unique, will be denoted by and is called the Gâteaux derivative of at (along the subspace , if ). If and is also Lipschitz continuous with Lipschitz constant , it easily follows from the definition that : indeed, for all we have
[TABLE]
The map is Fréchet differentiable at along the subspace if there exists a continuous linear map such that
[TABLE]
The (unique) map will be denoted by and is called the Fréchet derivative of at (along the subspace , in case ). It is well known that Fréchet differentiability implies Gâteaux differentiability, while the converse is not true. We shall often use the following characterization of Fréchet differentiability, of which we include a proof for the convenience of the reader.
Lemma 2.1**.**
A map is Fréchet differentiable at with if and only if for each bounded set one has
[TABLE]
uniformly with respect to .
Proof.
Let be Fréchet differentiable at with , and set . Then as . Let be a bounded set and a real number such that is included in the ball of of radius centered at zero. For any there exists such that for every with . Therefore, for any such that , one has and
[TABLE]
i.e. as uniformly with respect to . Let us now prove the converse implication: assume that (2.1) holds for every , uniformly with respect to , and that, by contradiction, is not Fréchet differentiable at , i.e. that does not converge to zero as . In particular, there exists a sequence converging to zero such that does not converge to zero. We claim that it cannot happen that
[TABLE]
as . In fact, setting , , and , this would imply that \varepsilon_{n}^{-1}\bigl{(}\phi(x_{0}+\varepsilon_{n}h_{n})-\varphi(x_{0})-\varepsilon_{n}Lh_{n}\bigr{)} converges to zero as , which is equivalent to . ∎
By a simple scaling argument it is evident that it is sufficient to consider as bounded subset the unit ball in . One can thus say that is Fréchet differentiable at along a subspace if there exists a continuous linear map such that
[TABLE]
For a comprehensive treatment of differential calculus for functions between topological vector spaces we refer to [1] for basic results in the case of Banach spaces, and to [3, 5] for the general case.
2.3 Estimates for deterministic and stochastic convolutions
Throughout this section stands for a strongly continuous linear semigroup of contractions on , and for its generator. Clearly, is necessarily a linear maximal monotone operator.
Here and in the following we shall use to denote convolution of and an -valued measurable function on , defined as
[TABLE]
under the minimal assumption that for all in a set of interest, usually a bounded interval of .
The following estimate for convolutions is trivial, but sufficient for our purposes.
Lemma 2.2**.**
For every and for every measurable adapted process such that , it holds that and
[TABLE]
Proof.
Minkowski’s inequality and contractivity of immediately yield
[TABLE]
We shall also need estimates for stochastic convolutions with respect to the cylindrical Wiener process , for which we shall always use the following notation: for any -valued process , the stochastic convolution is the process defined as
[TABLE]
under a stochastic integrability assumption on . There is an extensive literature on maximal estimates for stochastic convolutions, mostly obtained through the so-called factorization method by Da Prato, Kwapień, and Zabczyk [8], which requires to generate a holomorphic semigroup. The following estimate instead requires to be maximal monotone and can be proved by relatively elementary techniques of stochastic calculus (see, e.g., [13] for a proof in a more general context).
Proposition 2.3**.**
For every and for every progressively measurable, the stochastic convolution admits a modification in and
[TABLE]
Finally, a key role is played by the following maximal estimate for stochastic convolutions with respect to the compensated random measure . For a predictable -valued process , the stochastic convolution of with respect to will be denote by and defined as
[TABLE]
under a stochastic integrability assumption on with respect to .
Lemma 2.4**.**
For every and for every , the stochastic convolution admits a càdlàg modification and
[TABLE]
A proof can be found in [16]. A generalization of this inequality to -valued processes will appear in [14].
3 Well-posedness
This section is devoted to the proof of well-posedness of equation (1.1). We show existence and uniqueness of a mild solution, as well as its continuous dependence on the initial datum, in spaces of processes with finite moments of order . Although only the case is needed in the following sections on differentiability of the solution with respect to the initial datum, the general case is necessary to deal with initial data or driving random measures admitting finite moments of order strictly less than one. An example is given by -stable random measures with .
The following assumptions (A0)–(A4) on the coefficients and the initial datum of (1.1) are in force throughout the paper.
- (A0)
The initial datum is an -measurable random variable with values in ;
- (A1)
is a linear maximal monotone operator on , and is the strongly continuous semigroup of contractions generated by on ;
- (A2)
The function is such that is measurable and adapted for every , and there exists a constant such that
[TABLE]
for all , , and ;
- (A3)
The function is such that is progressively measurable for all , and there exists a constant such that
[TABLE]
for all , , and ;
- (A4)
The function is such that is -measurable for all . Moreover,
- (i)
if or , then there exists a -measurable function such that
[TABLE]
for all , , and ;
- (ii)
if , then there exist functions , , satisfying the same measurability properties of , with , and -measurable functions , such that, for ,
[TABLE]
for all , , and .
Further assumptions will be made when needed.
The concept of solution to (1.1) we shall work with is the following.
Definition 3.1**.**
An -valued adapted càdlàg process is a mild solution to (1.1) if
- (i)
for all -a.s.;
- (ii)
for all -a.s.;
- (iii)
there exists such that for all ;
- (iv)
one has
[TABLE]
as an identity in the sense of modifications.
In order to formulate the well-posedness result in the mild sense for (1.1), it is convenient to introduce an assumption depending on a parameter :
- (A5p)
Setting if , there exists a continuous increasing function , with , such that
[TABLE]
Theorem 3.2**.**
Let and (A5p)* be satisfied. For any , equation (1.1) admits a unique mild solution such that , with implicit constant independent of . Moreover, the solution map is Lipschitz continuous from to .*
Proof.
We are going to use a fixed-point argument in the metric space , with sufficiently small. By a classical patching argument, this will imply existence and uniqueness of a solution in . Let be the map formally defined on as
[TABLE]
Let us show that is in fact well defined on and that its image is contained in : one has
[TABLE]
where \big{\lVert}S(\cdot)u_{0}\big{\rVert}_{\mathbb{S}^{p}}\leq\lVert u_{0}\rVert_{L^{p}(\Omega;H)} by contractivity of the semigroup ; the elementary lemma 2.2 and linear growth of imply
[TABLE]
similarly, proposition 2.3 yields
[TABLE]
finally, it follows by proposition 2.4 that \big{\lVert}S\diamond_{\mu}G(u_{-})\big{\rVert}^{p}_{\mathbb{S}^{p}}\lesssim\big{\lVert}G(u_{-})\big{\rVert}^{p}_{\mathbb{G}^{p}}, where, if ,
[TABLE]
and, similarly, if ,
[TABLE]
Analogous arguments show that that is a contraction of , with to be chosen later. In fact, one has, with a slightly simplified notation,
[TABLE]
Let us estimate the three terms separately. The Lipschitz continuity of , , and yields
[TABLE]
so that
[TABLE]
Since is continuous with , it follows that there exists and a constant , which depends on , such that
[TABLE]
hence, by the Banach-Caccioppoli contraction principle, for any there exists a fixed point of the contraction , which is thus the unique solution in to (1.1). Choosing such that , with , and repeating the same argument on each interval , with , a unique solution to (1.1) can be constructed on the whole interval . Furthermore, for any , by (3.1)-(3.5), the unique solution satisfies
[TABLE]
where the implicit constant is independent of . Hence, there is small enough such that
[TABLE]
Performing now a patching argument as above on yields the desired estimate
[TABLE]
The argument to show the Lipschitz-continuity of is similar: let , , and , be the unique solutions to (1.1) with initial datum and , respectively. Using a patching argument as above, it suffices to show that is Lipschitz continuous on . To this purpose, One has
[TABLE]
where is a positive constant (that depends on ). Rearranging terms and performing a patching argument as above immediately yields the Lipschitz continuity of . ∎
Remark 3.3*.*
It immediately follows from the Lipschitz continuity of the solution map that one also has, in the same notation used above,
[TABLE]
with implicit constant depending on and .
4 Gâteaux differentiability of the solution map
In the previous section we have shown that the solution map is Lipschitz continuous from to . We are now going to show that Gâteaux differentiability of the coefficients of (1.1) implies Gâteaux differentiability of the solution map. For some applications (e.g. to study Kolmogorov equations associated to stochastic PDEs) it is sufficient to consider non-random initial data and to consider first-order derivatives as linear maps from to , i.e., roughly speaking, to consider only non-random directions of differentiability. However, the more general case of random initial data and random directions of differentiability considered here as well as in the next sections is conceptually not more difficult and, apart of being interesting in its own right because treated at the natural level of generality, it is necessary to study, for instance, higher-order stability issues of stochastic models with respect to perturbations of the initial datum.
We shall make the following additional assumption, which is assumed to hold throughout this section.
- (G1)
The maps and are Gâteaux differentiable for all , and the maps
[TABLE]
are Gâteaux differentiable for all .
The Gâteaux derivatives of , and (in their -valued argument) are denoted by
[TABLE]
Recalling that and are Lipschitz continuous in their -valued argument, uniformly over , we infer that
[TABLE]
for all , , and . Similarly, the Lipschitz continuity of implies, if , that
[TABLE]
and, if , that
[TABLE]
for all , , , and .
We begin with two general results that will be extensively used in the sequel. The first lemma is an immediate corollary of the well-posedness results.
Lemma 4.1**.**
Under the assumptions of Theorem 3.2, let be the unique mild solution to (1.1) with initial condition . For any , the linear stochastic evolution equation
[TABLE]
admits a unique mild solution that depends continuously on the initial datum .
Proof.
The linear maps and are bounded, uniformly over , hence, a fortiori, Lipschitz continuous. Analogously, the linear map has norm (and, a fortiori, Lipschitz constant) bounded by (with if ) on . Theorem 3.2 thus implies that, for any , (4.1) admits a unique mild solution , which depends continuously on . ∎
Note that, since the equation for is linear, it is immediate that the map is linear and continuous from to .
The next lemma will play a crucial role both in the proof of the Gâteaux differentiability of the solution map in this section, as well as in the proof of its Fréchet differentiability in the next section, taking into account Lemma 2.1.
Lemma 4.2**.**
Under the assumptions of Theorem 3.2, let and , the the unique mild solutions to (1.1) with initial conditions and , respectively. Moreover, let be the unique mild solution to (4.1) with initial condition . One has
[TABLE]
Proof.
Let , and consider the evolution equation
[TABLE]
One easily sees that it admits a unique mild solution , which coincides with the restriction of to . In particular, for any ,
[TABLE]
A completly analogous flow property holds for and . Then one has, by the triangle inequality,
[TABLE]
where, by abuse of notation, the (deterministic and stochastic) convolutions are defined on , in accordance to (4.2), and . We are going to estimate , and separately. To simplify the notation, let us set, for a generic mapping ,
[TABLE]
(with obvious modifications if and are replaced by and ), and note that
[TABLE]
(the formal operators and clearly depend also on , but we do not need to explicitly denote this fact). Recalling the elementary estimate of Lemma 2.2, one has
[TABLE]
where, by the Lipschitz continuity of ,
[TABLE]
The terms and can be handled similarly, thanks to the maximal inequalities of §2.3:
[TABLE]
where
[TABLE]
and
[TABLE]
where
[TABLE]
Recalling that is continuous with , these estimates imply that for every there exists such that, for any with , one has
[TABLE]
Fixing then sufficiently small and rearranging the terms yields
[TABLE]
where the implicit constant depends on and , , are “supported” on . Let be a subdivision of the interval such that for all . Then we have, for every , with obvious meaning of the notation,
[TABLE]
where
[TABLE]
Backward recursion thus yields
[TABLE]
where the first summand on the right-hand side is zero. To conclude the proof it suffices to show that
[TABLE]
for every . We shall show that this is true for , as both other cases are entirely similar (in fact slightly simpler): it is enough to observe that, for any satisfying suitable measurability conditions and for any , the obvious inequality
[TABLE]
implies , hence
[TABLE]
The main result of this section is the following. Note that, since the (standard) definition of Gâteaux derivative requires a Banach space framework, we shall confine ourself to the case .
Theorem 4.3**.**
Let and (A5p)* be satisfied. Then the solution map of (1.1) is Gâteaux differentiable from to , and its Gâteaux derivative at is , where is the unique mild solution to (4.1).*
Proof.
By Lemma 4.2, it is enough to show that
[TABLE]
converges to zero as tends to zero. By assumption (G1) it immediately follows that, as ,
[TABLE]
for a.a. . Moreover, recalling that the operator norms of and are bounded by the Lipschitz constants of and , respectively, the triangle inequality yields
[TABLE]
for a.a. . Since , the right-hand side belongs to as well as to , hence the first two terms in (4.3) converge to zero as by the dominated convergence theorem. Similarly, setting if , one has
[TABLE]
where the implicit constant is equal to for , and to for . Since
[TABLE]
as , as well as
[TABLE]
for all , -almost surely, for both and , one has, thanks to (A5p) and the dominated convergence theorem, recalling that ,
[TABLE]
-a.s. as . A further application of the dominated convergence theorem hence yields that the third term in (4.3) converges to zero as , thus completing the proof. ∎
5 Fréchet differentiability of the solution map
We are going to show that the Fréchet differentiability of the coefficients of (1.1) implies the Fréchet differentiability of the solution map. We shall work under the following assumption, that is assumed to hold throughout this section.
- (F)
The maps and are Fréchet differentiable for all , and the maps
[TABLE]
are Fréchet differentiable for all .
The Fréchet derivatives of and (in their -valued argument), denoted by
[TABLE]
satisfy the boundedness properties
[TABLE]
for all (see § 2.2). Similarly, and in complete analogy to the previous section, the Lipschitz continuity assumptions on , and imply that,
[TABLE]
The main result of this section is the following theorem, which states that the solution map is Fréchet differentiable along subspaces of vectors with finite higher moments.
Theorem 5.1**.**
Let . If (A5p)* and (A5q*)* hold, then the solution map of (1.1) is Fréchet differentiable from to along and its Fréchet derivative at is the map , where is the unique mild solution to the stochastic evolution equation*
[TABLE]
Proof.
For any , equation (5.1) admits a unique mild solution , as it follows immediately by the boundedness properties of the Fréchet derivatives of , and , and by hypothesis (A5q). Therefore the map is well defined from to , and it is obviously linear and continuous. To prove that this map is the Fréchet derivative of the solution map , thanks to the characterization of Fréchet differentiability of Lemma 2.1, it is enough to show that
[TABLE]
uniformly over belonging to bounded subsets of . By Lemma 4.2, for this it suffices to show that each term in (4.3) converges to zero uniformly with respect to belonging to the unit ball of . Since , it is evident that if belongs to then belongs to , where . Hence, denoting by , , the terms appearing in (4.3), by homogeneity
[TABLE]
Hence it suffices to show that , and converge to zero uniformly with respect to bounded in . That is, we need to show that, for any and , there exists such that implies for all and . For any measurable , one clearly has
[TABLE]
where, by the Lipschitz continuity of ,
[TABLE]
The set is bounded in , with , hence uniformly integrable on . In particular, for any there exists such that, for any with , one has
[TABLE]
hence . Let be arbitrary but fixed. Markov’s inequality yields, for any ,
[TABLE]
Therefore there exists such that, setting , one has . It is important to note that depends on , but not on , while depends on . The Fréchet differentiability hypothesis on amounts to saying that, for any and ,
[TABLE]
In particular, one has
[TABLE]
for a.a. , where, by the Lipschitz continuity of ,
[TABLE]
for a.a. . Therefore, by the dominated convergence theorem,
[TABLE]
that is, for any there exists depending only on and such that
[TABLE]
for all such that . It remains to observe that
[TABLE]
for a.a. to get that for all such that . Since depends only on , we conclude that there exists such that for all .
Let us now consider the term : the argument is similar to the one just carried out, so we provide slightly less detail. We have to show that converges to [math] uniformly with respect to . For any measurable , one has, with obvious meaning of the notation,
[TABLE]
where, by the Lipschitz-continuity of ,
[TABLE]
Choosing as before, using the uniform integrability of the family combined with the Markov inequality, we infer that for any there exists such that . The Fréchet differentiability of implies that, for any ,
[TABLE]
in , where, by the Lipschitz continuity of ,
[TABLE]
for a.a. . Hence, the dominated convergence theorem yields
[TABLE]
that is, for any there exists depending only on and such that
[TABLE]
for all such that , from which also for all such that . Hence, there exists such that for all with .
The convergence to zero of as , uniformly with respect to , while still similar to the above arguments, is slightly more delicate as random measures are involved. As already shown in the proof of Theorem 4.3, one has, recalling that Fréchet differentiability implies Gâteaux differentiability,
[TABLE]
as . We need to show that the convergence holds uniformly over bounded in . Let and . For any measurable , the Lipschitz continuity assumptions on and (A5p) imply, setting if , that
[TABLE]
As the set is bounded in , hence uniformly integrable, for any there exists (by Markov’s inequality) such that, choosing as before, we have
[TABLE]
On one has, possibly outside a set of -measure zero, for both and ,
[TABLE]
where the right-hand side converges to zero by the characterization of Fréchet differentiability of Lemma 2.1, and is bounded by for all . Since and -a.s. in , the dominated convergence theorem and (A5p) yield
[TABLE]
as , uniformly with respect to . Proceeding exactly as in the case of , we conclude that there exists such that for all .
We have thus shown that in , uniformly over in any bounded subset of , as claimed. ∎
6 Fréchet differentiability of higher order
In this section we show that the -th order Fréchet differentiability of the coefficients of (1.1), in a suitable sense, implies the -th order Fréchet differentiability of the solution map. We shall work under the following assumptions, that are stated in terms of the parameter , :
- (Fn)
The maps and are times Fréchet differentiable for all , and the maps , , , are times Fréchet differentiable for all . Moreover, there exists a constant such that, for every ,
[TABLE]
for all , and
[TABLE]
We also stipulate that (F1) is simply hypothesis (F) of the previous section. It would be possible to replace the functions , and with different ones, thus reaching a bit more generality, but it does not seem to be worth the (mostly notational) effort.
Example 6.1*.*
Let us give an explicit example where assumption (Fn) is satisfied with a suitable choice of and not for . We shall consider for simplicity and concentrate only on : typical examples for and can be produced following the same argument. Let , where is a smooth bounded domain, and consider the function
[TABLE]
It is not difficult to check that , is Lipschitz-continuous (hence ), and
[TABLE]
However, the derivatives are not bounded in for any . Furthermore, let us fix , and define the operator
[TABLE]
Clearly, is well-defined, Lipschitz-continuous and linearly bounded, so that (A2) is satisfied. Moreover, using the fact that it a standard matter to check that is Fréchet-differentiable, and its derivative is given by
[TABLE]
so that also assumption (F) is satisfied. Note in particular that the first derivative is also bounded in thanks to the Lipschitz-continuity of . Furthermore, using the fact that a direct computation shows that for every , with , is Fréchet-differentiable -times and
[TABLE]
For every , by the Hölder inequality and the properties of and we have that
[TABLE]
so that assumption (Fn) is satisfied for every with the choice . However, note that the higher-order derivatives of are not bounded in because of the choice of the function : hence, coefficients in this form cannot be treated using available results in literature (as for example [15]). On the other hand, these are nonetheless included in our analysis.
In the following we shall write, for compactness of notation, in place of . If (identified with the solution map , which is well defined if assumption (A5p) holds) is times Fréchet differentiable along , we have
[TABLE]
Under the assumptions of Theorem 5.1, is once Fréchet differentiable and satisfies the equation
[TABLE]
where is the identity map. This equation has to be interpreted in the sense that, for any , , setting , one has
[TABLE]
Note that by Lemma 4.1 this equation admits a unique solution also for , and that . However, if belongs only to , we can no longer claim that is the Fréchet derivative of , as Theorem 5.1 does not necessarily apply.
We are now going to introduce a system of equations, indexed by , that are formally expected to be satisfied by , , if they exist. For any , the equation for can be written as
[TABLE]
where , and are the formal -th Fréchet derivatives of , and , respectively, excluding the terms involving the (formal) derivative of of order . More precisely, assume that , and are Banach spaces and , are times Fréchet differentiable. The chain rule implies that there exists a function such that
[TABLE]
We set \Phi_{n}:=\tilde{\Phi}^{B}_{n}\bigl{(}{u}^{(1)},{u}^{(2)},\ldots,{u}^{(n-1)}\bigr{)}. The definition of and is, mutatis mutandis, identical.
The concept of solution for equation (6.1) is intended as in the case of the first order derivative equation, i.e. in the sense of testing against arbitrary directions. More precisely, we shall say that
[TABLE]
is a solution to (6.1) if, for any
[TABLE]
the process satisfies
[TABLE]
Let us show some properties of the coefficients , and . We are going to use some algebraic properties of the “representing” map . In particular, although a (kind of) explicit expression for can be written in terms of a variant of the Faà di Bruno formula (as it was done for example in [2]), for our purposes it suffices to know that is a sum of terms of the form
[TABLE]
with , , for all . Moreover, since is an -linear map on with values in (with being the cartesian product of by itself -times), one has that, for any , is a sum of terms of the form
[TABLE]
where , and is an element of the permutation group of . We shall also need the following identities, that we write already in the specific form needed later, although they are obviously a consequence of the definition of :
[TABLE]
where we have written, as customary, in place of . We are going to write, for the convenience of the reader, the first three formal derivatives of and the expressions for (the corresponding calculations for , , , and are entirely analogous). One has
[TABLE]
where we have used Schwarz’s theorem on the symmetry of higher-order continuous Fréchet derivatives.
The first result that we present concerns the existence and uniqueness of solutions to equation (6.1) in the sense specified above. More precisely, we show in the next proposition that equation (6.1) admits a unique solution , belonging to \mathscr{L}_{n}\bigl{(}\mathbb{L}^{p_{1}},\ldots,\mathbb{L}^{p_{n}};\mathbb{S}^{p}\bigr{)}. Note that to study differentiability we shall restrict to the case (see Remark 6.3 below). However, since well-posedness for linear stochastic equations for multilinear maps such as (6.1) could be interesting in its own right, we shall provide a general result considering arbitrary .
Proposition 6.2**.**
Let and be such that and
[TABLE]
Assume that
- (i)
hypothesis **(Fn*)** is satsfied;*
- (ii)
hypothesis **(A5r*)** holds for all .*
Then (6.1) admits a unique solution
[TABLE]
Proof.
First of all, let us explain why , if it exists, must be -linear (in the algebraic sense). Since is indeed a linear map, we can use induction as follows: assuming that is -linear for all , with , we are going to show that is -linear. The inductive assumption and the functional form of , , and imply that they are -linear. Considering the equation
[TABLE]
assuming that a solution exists for every , , it suffices to show that the map is -linear, which is immediate.
Let us focus now on existence. We are going to reason by induction on the order of (formal) derivation . The claim is certainly true for : Theorem 4.3 implies, thanks for assumption (ii), that for every , hence also for every , as then is contractively embedded in . Let us now assume the the claim is true for all , and consider with , for , such that
[TABLE]
In order to control the norm of it is enough to estimate
[TABLE]
In fact, recalling that , and are bounded linear operators (in the same sense as in the proofs of Theorems 4.3 and 5.1), one has, for any , omitting the indication of the arguments for simplicity of notation,
[TABLE]
where the implicit constant does not depend on (and also not on ). We proceed now as in the proof of Lemma 4.2: choosing sufficiently small and partitioning in intervals of lenght not exceeding , it follows from that
[TABLE]
as claimed. Let us consider the second term on the right-hand side of the previous inequality (the first one can be handled in a completely similar way). As already seen, the generic term in is of the form
[TABLE]
where , , , and is an element of the permutation group of . Since implies
[TABLE]
one has
[TABLE]
so that setting
[TABLE]
it holds
[TABLE]
Assumption (Fn) now implies
[TABLE]
which yields, thanks to the estimate ,
[TABLE]
where the implicit constant depends also on . Here and in the following we write, for simplicity of notation, for any càdlàg function . Hölder’s inequality yields
[TABLE]
where, as before, . It follows by the definition of and the inductive assumption that
[TABLE]
hence, recalling that \big{\lVert}u^{*m}\big{\rVert}_{\mathbb{L}^{p_{0}}}=\big{\lVert}u\big{\rVert}^{m}_{\mathbb{S}^{mp_{0}}}\lesssim 1+\big{\lVert}u_{0}\big{\rVert}^{m}_{\mathbb{L}^{mp_{0}}} by Theorem 3.2,
[TABLE]
Estimating the norm of is similar: using the same notation used thus far, the generic term in is of the type
[TABLE]
and hypothesis (Fn) implies
[TABLE]
for all , -a.s., for both and (we can identify again and with depending on the value of , and similarly for and ). This yields, after standard computations already detailed more than once,
[TABLE]
It hence follows by the inductive assumption, as before, that
[TABLE]
Since were arbitrary, we have proved that implies
[TABLE]
thus completing the induction argument by arbitrariness of . ∎
Remark 6.3*.*
If , condition (6.3) becomes
[TABLE]
which implies and , hence if . In particular, if , then , i.e. must be bounded almost surely. If , then will also be finite, and strictly larger than if . Furthermore, if , then implies . In fact, for this to be true it suffices that , which is equivalent to . But since , we can simply choose , which yields, excluding the case ,
[TABLE]
or, equivalently, .
We repeat, however, that even under these conditions we cannot yet claim that identifies the -th Fréchet derivative of . In fact, we shall prove that satisfies the equation for when “tested” on , with satisfying a strictly stronger constraint than just .
Before considering Fréchet differentiability of -th order, we need some preparations. The following two lemmata are used to apply the theorem on the Fréchet differentiability of the composition of two Fréchet differentiable functions.
By the assumptions (A2), (A3) and (A4), it follows immediately that the superposition operators associated to , and on , i.e. , can be considered as maps, denoted by the same symbols for simplicity,
[TABLE]
Lemma 6.4**.**
Let , , , and satisfy
[TABLE]
If hypothesis (Fn)* is satisfied, then , and are -times Fréchet differentiable in along , with*
[TABLE]
for all .
Proof.
We proceed by induction on , and we treat only the third term, as all other cases are analogous (in fact slightly simpler). If , the proof is exactly the same as the corresponding one of Theorem 5.1. In particular, one has
[TABLE]
hence, given and with ,
[TABLE]
as , uniformly over belonging to bounded subsets of . Assuming now that the statement is true for , let us show that it also holds for . By the inductive hypothesis we thus have
[TABLE]
Let and . The -th Fréchet derivatives
[TABLE]
exists for all , hence, setting , , one has, as ,
[TABLE]
for all , -a.s., uniformly with respect to in bounded sets of . For any , the fundamental theorem of calculus yields
[TABLE]
hence, since is arbitrary,
[TABLE]
for any , where, as already done before, if . The left-hand side of (6.5) is thus dominated for all , -a.s., modulo a constant, by the same expression appearing on the right-hand side of the previous inequality. This implies
[TABLE]
where, by Hölder’s inequality,
[TABLE]
In fact, these three inequalities follow from
[TABLE]
respectively, all of which are immediate consequences of the assumptions. The dominated convergence theorem thus yields
[TABLE]
as . It remains to show that the convergence is uniform with respect to bounded in . To this end, we proceed as in the case : for every measurable , the computations just carried out yield
[TABLE]
where the implicit constant depends on . Since are bounded in and , the product is bounded in . Therefore, as by assumption, it follows that \bigl{(}v_{1}^{*}\,\cdots\,v_{k+1}^{*}\bigr{)}^{p} is uniformly integrable. Similarly, defining by
[TABLE]
Hölder’s inequality yields
[TABLE]
where the right-hand side is finite by assumption. Since , \bigl{(}u^{*m}v_{1}^{*}\,\cdots\,v_{k+1}^{*}\bigr{)}^{p} is uniformly integrable. Finally, defining by
[TABLE]
Hölder’s inequality yields, recalling that ,
[TABLE]
hence \bigl{(}v_{1}^{*}\,\cdots\,v_{j}^{*}v_{j+1}^{*(m+1)}\bigr{)}^{p} is also uniformly integrable. One can now choose the set and proceed exactly as in the proof of Theorem 5.1 for the case to conclude. ∎
The previous lemma implies, in particular, that
[TABLE]
are times Fréchet differentiable for every . Indeed, for any such , one has , implying in particular that . Setting now , one has , , and .
In fact, if one has , from which and , hence . If one has , from which , and , hence . The assertion follows then from Lemma 6.4.
We can now state the main result of this section, as well as of the whole paper.
Theorem 6.5**.**
Let ,
[TABLE]
Assume (Fn)* and (A5r*)* for all . Then the solution map is times Fréchet differentiable. Moreover, is the unique mild solution to*
[TABLE]
Note that this equation is nothing else than (6.1), and must be interpreted as the latter, i.e. in the sense of testing against an -tuple of vectors in . Moreover, the initial condition of the equation is the identity map if , and zero if .
Proof.
We shall assume, for simplicity, that , as the argument in the general case , is entirely analogous. We are going to argue by induction on . The statement is true for by Theorem 5.1. Now we assume that the statement is true for all , , and we prove it for . Let , with . Thanks to Proposition 6.2 and the remarks following its proof, the equation
[TABLE]
admits a unique mild solution , because
[TABLE]
We are going to show that in . Let : for brevity, we shall use the notation and . One has
[TABLE]
where, by the inductive hypothesis, and in . We need to prove that the left-hand side of (6.6) converges to zero as in uniformly over belonging to bounded sets of . Thanks to (6.2), one has
[TABLE]
and we claim that
[TABLE]
in as , uniformly over belonging to bounded subsets of . In fact, all terms in are of the form
[TABLE]
with and , . Now, let be such that (which is possible because ). Then is times Fréchet differentiable by Lemma 6.4. Moreover, by the inductive hypothesis applied to , we have that is times Fréchet differentiable along if
[TABLE]
Therefore, if satisfies this condition, each term of the form (6.8) is Fréchet differentiable along by the theorem on the Fréchet differentiability of composite functions (see for example [1, Prop. 1.4]). Hence, (6.7) is indeed true, and the expression within parentheses in the first term on the right-hand side of (6.6) converges to
[TABLE]
in as , uniformly over belonging to bounded subsets of . Let us now consider the second term on the right-hand side of (6.6). One has, recalling that for every by inductive hypothesis,
[TABLE]
where the second term on the right-hand side converges to
[TABLE]
in as , uniformly over bounded in , because again everything depends only on derivatives of order at most and we can apply the usual criteria on Fréchet differentiability of multilinear maps and composite functions. Note that this term cancels out with the corresponding one obtained previously. Going back then to (6.6), testing by an arbitrary element , and using Lemma 2.4, we infer that
[TABLE]
Taking supremum over bounded in and using the Lipschitz-continuity of , we infer that, for every ,
[TABLE]
By the continuity of we can choose sufficiently small such that, after rearranging terms,
[TABLE]
Using the same argument leading to (6.4) in the proof of Proposition 6.2, a classical patching argument yields then
[TABLE]
on the whole time interval . Taking into account the remarks made above we have that
[TABLE]
We conclude that the left-hand side of converges to zero in as , uniformly with respect to belonging to any bounded subset of , as required. ∎
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] A. Ambrosetti and G. Prodi, A primer of nonlinear analysis , Cambridge University Press, Cambridge, 1995. MR 1336591 (96a:58019)
- 2[2] A. Andersson, A. Jentzen, R. Kurniawan, and T. Welti, On the differentiability of solutions of stochastic evolution equations with respect to their initial values , Nonlinear Anal. 162 (2017), 128–161. MR 3695960
- 3[3] V. I. Averbukh and O. G. Smolyanov, Differentiation theory in linear topological spaces , Uspekhi Mat. Nauk 22 (1967), no. 6 (138), 201–260. MR 0223886
- 4[4] V. I. Bogachev, N. V. Krylov, M. Röckner, and S. V. Shaposhnikov, Fokker-Planck-Kolmogorov equations , American Mathematical Society, Providence, RI, 2015. MR 3443169
- 5[5] V. I. Bogachev and O. G. Smolyanov, Topological vector spaces and their applications , Springer, Cham, 2017. MR 3616849
- 6[6] S. Cerrai, Second order PDE’s in finite and infinite dimension , Lecture Notes in Mathematics, vol. 1762, Springer-Verlag, Berlin, 2001. MR 2002 j:35327
- 7[7] G. Da Prato, Kolmogorov equations for stochastic PD Es , Birkhäuser Verlag, Basel, 2004. MR 2111320 (2005 m:60002)
- 8[8] G. Da Prato, S. Kwapień, and J. Zabczyk, Regularity of solutions of linear stochastic equations in Hilbert spaces , Stochastics 23 (1987), no. 1, 1–23. MR 920798 (89b:60148)
