Fr\'echet differentiability of mild solutions to SPDEs with respect to   the initial datum

Carlo Marinelli; Luca Scarpa

arXiv:1812.09949·math.PR·December 11, 2020

Fr\'echet differentiability of mild solutions to SPDEs with respect to the initial datum

Carlo Marinelli, Luca Scarpa

PDF

TL;DR

This paper proves high-order Fréchet differentiability of mild solutions to jump-diffusion SPDEs in Hilbert spaces, enabling the construction of classical solutions to related non-local Kolmogorov equations.

Contribution

It establishes n-th order Fréchet differentiability of solutions with respect to initial data for a broad class of jump-diffusion SPDEs, extending previous differentiability results.

Findings

01

Proved well-posedness of the SPDEs in the mild sense.

02

Established first-order Gâteaux differentiability of solutions.

03

Demonstrated higher-order Fréchet differentiability of solutions.

Abstract

We establish n-th order Fr\'echet differentiability with respect to the initial datum of mild solutions to a class of jump-diffusions in Hilbert spaces. In particular, the coefficients are Lipschitz continuous, but their derivatives of order higher than one can grow polynomially, and the (multiplicative) noise sources are a cylindrical Wiener process and a quasi-left-continuous integer-valued random measure. As preliminary steps, we prove well-posedness in the mild sense for this class of equations, as well as first-order G\^ateaux differentiability of their solutions with respect to the initial datum, extending previous results in several ways. The differentiability results obtained here are a fundamental step to construct classical solutions to non-local Kolmogorov equations with sufficiently regular coefficients by probabilistic means.

Equations505

⎩ ⎨ ⎧ d u (t) + A u (t) d t = f (t, u (t)) d t + B (t, u (t)) d W (t) + \int_{Z} G (t, z, u (t -)) \overset{μ}{ˉ} (d t, d z), u (0) = u_{0} .

⎩ ⎨ ⎧ d u (t) + A u (t) d t = f (t, u (t)) d t + B (t, u (t)) d W (t) + \int_{Z} G (t, z, u (t -)) \overset{μ}{ˉ} (d t, d z), u (0) = u_{0} .

\big{\lVert}Y\big{\rVert}_{\mathbb{S}^{p}(t_{0},t_{1})}:=\Bigl{(}\mathop{{}\mathbb{E}}\nolimits\sup_{t\in[t_{0},t_{1}]}\lVert Y(t)\rVert^{p}\Bigr{)}^{1/p}<+\infty,

\big{\lVert}Y\big{\rVert}_{\mathbb{S}^{p}(t_{0},t_{1})}:=\Bigl{(}\mathop{{}\mathbb{E}}\nolimits\sup_{t\in[t_{0},t_{1}]}\lVert Y(t)\rVert^{p}\Bigr{)}^{1/p}<+\infty,

\big{\lVert}Y_{1}+Y_{1}\big{\rVert}_{\mathbb{S}^{p}(t_{0},t_{1})}\leq 2^{1/p}\bigl{(}\big{\lVert}Y_{1}\big{\rVert}_{\mathbb{S}^{p}(t_{0},t_{1})}+\big{\lVert}Y_{2}\big{\rVert}_{\mathbb{S}^{p}(t_{0},t_{1})}\bigr{)},

\big{\lVert}Y_{1}+Y_{1}\big{\rVert}_{\mathbb{S}^{p}(t_{0},t_{1})}\leq 2^{1/p}\bigl{(}\big{\lVert}Y_{1}\big{\rVert}_{\mathbb{S}^{p}(t_{0},t_{1})}+\big{\lVert}Y_{2}\big{\rVert}_{\mathbb{S}^{p}(t_{0},t_{1})}\bigr{)},

d_{p,t_{0},t_{1}}(Y_{1},Y_{2}):=\big{\lVert}Y_{1}-Y_{2}\big{\rVert}_{\mathbb{S}^{p}(t_{0},t_{1})}^{1\wedge p},

d_{p,t_{0},t_{1}}(Y_{1},Y_{2}):=\big{\lVert}Y_{1}-Y_{2}\big{\rVert}_{\mathbb{S}^{p}(t_{0},t_{1})}^{1\wedge p},

(Y_{1},Y_{2})\mapsto\big{\lVert}Y_{1}-Y_{2}\big{\rVert}_{L^{p}(\Omega;H)}^{1\wedge p}

(Y_{1},Y_{2})\mapsto\big{\lVert}Y_{1}-Y_{2}\big{\rVert}_{L^{p}(\Omega;H)}^{1\wedge p}

\big{\lVert}g\big{\rVert}_{L^{q}(\nu;H)}:=\biggl{(}\int_{\mathopen{]}0,T\mathclose{]}\times Z}\lVert g\rVert^{q}\,d\nu\biggr{)}^{1/q},\qquad\big{\lVert}g\big{\rVert}_{L^{p}(\Omega;L^{q}(\nu;H))}:=\biggl{(}\mathop{{}\mathbb{E}}\nolimits\biggl{(}\int_{\mathopen{]}0,T\mathclose{]}\times Z}\lVert g\rVert^{q}\,d\nu\biggr{)}^{p/q}\biggr{)}^{1/p}

\big{\lVert}g\big{\rVert}_{L^{q}(\nu;H)}:=\biggl{(}\int_{\mathopen{]}0,T\mathclose{]}\times Z}\lVert g\rVert^{q}\,d\nu\biggr{)}^{1/q},\qquad\big{\lVert}g\big{\rVert}_{L^{p}(\Omega;L^{q}(\nu;H))}:=\biggl{(}\mathop{{}\mathbb{E}}\nolimits\biggl{(}\int_{\mathopen{]}0,T\mathclose{]}\times Z}\lVert g\rVert^{q}\,d\nu\biggr{)}^{p/q}\biggr{)}^{1/p}

\big{\lVert}g\big{\rVert}_{\mathbb{G}^{p}}:=\begin{cases}\big{\lVert}g\big{\rVert}_{L^{p}(\Omega;L^{2}(\nu;H))}\qquad&\text{if }p\in\mathopen{]}0,1],\\[4.0pt] \displaystyle\inf_{g_{1}+g_{2}=g}\bigl{(}\big{\lVert}g_{1}\big{\rVert}_{L^{p}(\Omega;L^{2}(\nu;H))}+\big{\lVert}g_{2}\big{\rVert}_{L^{p}(\Omega;L^{p}(\nu;H))}\bigr{)}&\text{if }p\in\mathopen{]}1,2\mathclose{[},\\[4.0pt] \big{\lVert}g\big{\rVert}_{L^{p}(\Omega;L^{2}(\nu;H))}+\big{\lVert}g\big{\rVert}_{L^{p}(\Omega;L^{p}(\nu;H))}&\text{if }p\in[2,\infty\mathclose{[},\end{cases}

\big{\lVert}g\big{\rVert}_{\mathbb{G}^{p}}:=\begin{cases}\big{\lVert}g\big{\rVert}_{L^{p}(\Omega;L^{2}(\nu;H))}\qquad&\text{if }p\in\mathopen{]}0,1],\\[4.0pt] \displaystyle\inf_{g_{1}+g_{2}=g}\bigl{(}\big{\lVert}g_{1}\big{\rVert}_{L^{p}(\Omega;L^{2}(\nu;H))}+\big{\lVert}g_{2}\big{\rVert}_{L^{p}(\Omega;L^{p}(\nu;H))}\bigr{)}&\text{if }p\in\mathopen{]}1,2\mathclose{[},\\[4.0pt] \big{\lVert}g\big{\rVert}_{L^{p}(\Omega;L^{2}(\nu;H))}+\big{\lVert}g\big{\rVert}_{L^{p}(\Omega;L^{p}(\nu;H))}&\text{if }p\in[2,\infty\mathclose{[},\end{cases}

G^{p} = ⎩ ⎨ ⎧ L^{p} (Ω; L^{2} (ν; H)), L^{p} (Ω; L^{2} (ν; H)) + L^{p} (Ω; L^{p} (ν; H)), L^{p} (Ω; L^{2} (ν; H)) \cap L^{p} (Ω; L^{p} (ν; H)), p \in] 0, 1], p \in [1, 2], p \in [2, \infty [.

G^{p} = ⎩ ⎨ ⎧ L^{p} (Ω; L^{2} (ν; H)), L^{p} (Ω; L^{2} (ν; H)) + L^{p} (Ω; L^{p} (ν; H)), L^{p} (Ω; L^{2} (ν; H)) \cap L^{p} (Ω; L^{p} (ν; H)), p \in] 0, 1], p \in [1, 2], p \in [2, \infty [.

ε \to 0 lim \frac{ϕ ( x _{0} + ε h ) - ϕ ( x _{0} )}{ε} = L h \forall h \in G .

ε \to 0 lim \frac{ϕ ( x _{0} + ε h ) - ϕ ( x _{0} )}{ε} = L h \forall h \in G .

∥ D_{G} ϕ (x_{0}) h ∥_{F} = ε \to 0 lim \frac{∥ ϕ ( x _{0} + ε h ) - ϕ ( x _{0} ) ∥ _{F}}{ε} \leq L_{ϕ} \frac{∥ x _{0} + ε h - x _{0} ∥ _{E}}{ε} = L_{ϕ} ∥ h ∥_{E} .

∥ D_{G} ϕ (x_{0}) h ∥_{F} = ε \to 0 lim \frac{∥ ϕ ( x _{0} + ε h ) - ϕ ( x _{0} ) ∥ _{F}}{ε} \leq L_{ϕ} \frac{∥ x _{0} + ε h - x _{0} ∥ _{E}}{ε} = L_{ϕ} ∥ h ∥_{E} .

h \to 0 lim \frac{ϕ ( x _{0} + h ) - ϕ ( x _{0} ) - L h}{∥ h ∥ _{G}} = 0.

h \to 0 lim \frac{ϕ ( x _{0} + h ) - ϕ ( x _{0} ) - L h}{∥ h ∥ _{G}} = 0.

ε \to 0 lim \frac{ϕ ( x _{0} + ε h ) - ϕ ( x _{0} ) - ε L h}{ε} = 0

ε \to 0 lim \frac{ϕ ( x _{0} + ε h ) - ϕ ( x _{0} ) - ε L h}{ε} = 0

∥ R (ε h)∥ \leq η \frac{∥ ε h ∥}{M} = η ∣ ε ∣ \frac{∥ h ∥}{M} \leq η ∣ ε ∣,

∥ R (ε h)∥ \leq η \frac{∥ ε h ∥}{M} = η ∣ ε ∣ \frac{∥ h ∥}{M} \leq η ∣ ε ∣,

h \in B sup \frac{ϕ ( x _{0} + ε h ) - ϕ ( x _{0} ) - ε L h}{ε} ⟶ 0

h \in B sup \frac{ϕ ( x _{0} + ε h ) - ϕ ( x _{0} ) - ε L h}{ε} ⟶ 0

ε \to 0 lim \frac{ϕ ( x _{0} + ε h ) - ϕ ( x _{0} ) - ε L h}{ε} = 0 uniformly on {h \in G : ∥ h ∥_{G} \leq 1.}

ε \to 0 lim \frac{ϕ ( x _{0} + ε h ) - ϕ ( x _{0} ) - ε L h}{ε} = 0 uniformly on {h \in G : ∥ h ∥_{G} \leq 1.}

S * g : R_{+} ∋ t ⟼ \int_{0}^{t} S (t - s) g (s) d s,

S * g : R_{+} ∋ t ⟼ \int_{0}^{t} S (t - s) g (s) d s,

\big{\lVert}S\ast\phi\big{\rVert}_{\mathbb{S}^{p}(0,T)}\leq\big{\lVert}\phi\big{\rVert}_{L^{p}(\Omega;L^{1}(0,T;H))}.

\big{\lVert}S\ast\phi\big{\rVert}_{\mathbb{S}^{p}(0,T)}\leq\big{\lVert}\phi\big{\rVert}_{L^{p}(\Omega;L^{1}(0,T;H))}.

\mathop{{}\mathbb{E}}\nolimits\sup_{t\leq T}\bigg{\lVert}\int_{0}^{t}S(t-s)\phi(s)\,ds\bigg{\rVert}^{p}\leq\mathop{{}\mathbb{E}}\nolimits\biggl{(}\sup_{t\leq T}\int_{0}^{t}\big{\lVert}S(t-s)\phi(s)\big{\rVert}\,ds\biggr{)}^{p}\leq\mathop{{}\mathbb{E}}\nolimits\biggl{(}\int_{0}^{T}\big{\lVert}\phi(s)\big{\rVert}\,ds\biggr{)}^{p}.\qed

\mathop{{}\mathbb{E}}\nolimits\sup_{t\leq T}\bigg{\lVert}\int_{0}^{t}S(t-s)\phi(s)\,ds\bigg{\rVert}^{p}\leq\mathop{{}\mathbb{E}}\nolimits\biggl{(}\sup_{t\leq T}\int_{0}^{t}\big{\lVert}S(t-s)\phi(s)\big{\rVert}\,ds\biggr{)}^{p}\leq\mathop{{}\mathbb{E}}\nolimits\biggl{(}\int_{0}^{T}\big{\lVert}\phi(s)\big{\rVert}\,ds\biggr{)}^{p}.\qed

S ⋄ G (t) := \int_{0}^{t} S (t - s) G (s) d W (s), t \geq 0,

S ⋄ G (t) := \int_{0}^{t} S (t - s) G (s) d W (s), t \geq 0,

\big{\lVert}S\diamond G\big{\rVert}_{\mathbb{S}^{p}(0,T)}\lesssim_{p}\big{\lVert}G\big{\rVert}_{L^{p}(\Omega;L^{2}(0,T;\mathscr{L}^{2}(K,H)))}.

\big{\lVert}S\diamond G\big{\rVert}_{\mathbb{S}^{p}(0,T)}\lesssim_{p}\big{\lVert}G\big{\rVert}_{L^{p}(\Omega;L^{2}(0,T;\mathscr{L}^{2}(K,H)))}.

S ⋄_{μ} g (t) := \int_{] 0, t]} \int_{Z} S (t - s) g (s, z) \overset{μ}{ˉ} (d s, d z), t \geq 0,

S ⋄_{μ} g (t) := \int_{] 0, t]} \int_{Z} S (t - s) g (s, z) \overset{μ}{ˉ} (d s, d z), t \geq 0,

\big{\lVert}S\diamond_{\mu}g\big{\rVert}_{\mathbb{S}^{p}}\lesssim\big{\lVert}g\big{\rVert}_{\mathbb{G}^{p}}.

\big{\lVert}S\diamond_{\mu}g\big{\rVert}_{\mathbb{S}^{p}}\lesssim\big{\lVert}g\big{\rVert}_{\mathbb{G}^{p}}.

∥ f (ω, t, x)∥

∥ f (ω, t, x)∥

∥ f (ω, t, x) - f (ω, t, y)∥

\displaystyle\big{\lVert}B(\omega,t,x)\big{\rVert}_{\mathscr{L}^{2}(K,H)}

\displaystyle\big{\lVert}B(\omega,t,x)\big{\rVert}_{\mathscr{L}^{2}(K,H)}

\displaystyle\big{\lVert}B(\omega,t,x)-B(\omega,t,y)\big{\rVert}_{\mathscr{L}^{2}(K,H)}

\displaystyle\big{\lVert}G(\omega,t,z,x)-G(\omega,t,z,y)\big{\rVert}

\displaystyle\big{\lVert}G(\omega,t,z,x)-G(\omega,t,z,y)\big{\rVert}

\displaystyle\big{\lVert}G(\omega,t,z,x)\big{\rVert}

\displaystyle\big{\lVert}G_{j}(\omega,t,z,x)-G_{j}(\omega,t,z,y)\big{\rVert}

\displaystyle\big{\lVert}G_{j}(\omega,t,z,x)-G_{j}(\omega,t,z,y)\big{\rVert}

\displaystyle\big{\lVert}G_{j}(\omega,t,z,x)\big{\rVert}

u = S (\cdot) u_{0} + S * f (u) + S ⋄ B (u) + S ⋄_{μ} G (u_{-})

u = S (\cdot) u_{0} + S * f (u) + S ⋄ B (u) + S ⋄_{μ} G (u_{-})

1_{\{p>1\}}\biggl{(}\int_{Z\times[t_{0},t_{1}]}g_{1}^{p}(\omega,s,z)\,d\nu\biggr{)}^{1/p}+\biggl{(}\int_{Z\times[t_{0},t_{1}]}g_{2}^{2}(\omega,s,z)\,d\nu\biggr{)}^{1/2}\leq\kappa(t_{1}-t_{0})\qquad\forall\omega\in\Omega.

1_{\{p>1\}}\biggl{(}\int_{Z\times[t_{0},t_{1}]}g_{1}^{p}(\omega,s,z)\,d\nu\biggr{)}^{1/p}+\biggl{(}\int_{Z\times[t_{0},t_{1}]}g_{2}^{2}(\omega,s,z)\,d\nu\biggr{)}^{1/2}\leq\kappa(t_{1}-t_{0})\qquad\forall\omega\in\Omega.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Fréchet differentiability of mild solutions to SPDEs with

respect to the initial datum

Carlo Marinelli1 and Luca Scarpa2

1Department of Mathematics, University College London, Gower Street, London WC1E 6BT, UK.

2Fakultät für Mathematik, Universität Wien, Oskar-Morgenstern-Platz 1, 1090 Wien, Austria.

(August 2, 2019)

Abstract

We establish $n$ -th order Fréchet differentiability with respect to the initial datum of mild solutions to a class of jump-diffusions in Hilbert spaces. In particular, the coefficients are Lipschitz continuous, but their derivatives of order higher than one can grow polynomially, and the (multiplicative) noise sources are a cylindrical Wiener process and a quasi-left-continuous integer-valued random measure. As preliminary steps, we prove well-posedness in the mild sense for this class of equations, as well as first-order Gâteaux differentiability of their solutions with respect to the initial datum, extending previous results by Marinelli, Prévôt, and Röckner in several ways. The differentiability results obtained here are a fundamental step to construct classical solutions to non-local Kolmogorov equations with sufficiently regular coefficients by probabilistic means.

1 Introduction

Our goal is to obtain existence and uniqueness of mild solutions, and, especially, their differentiability with respect to the initial datum, to a class of stochastic evolution equations on Hilbert spaces of the form

[TABLE]

Here $A$ is a linear $m$ -accretive operator, $W$ is a cylindrical Wiener process, $\bar{\mu}$ is a compensated integer-valued quasi-left-continuous random measure, and the coefficients $f$ , $B$ , $G$ satisfy suitable measurability and Lipschitz continuity conditions. Precise assumptions on the data of the problem are stated in Sections 2.1 and 3 below.

The results extend (and partially supersede) those obtained in [15] in several ways: (a) well-posedness is established here in much greater generality, in particular allowing $\bar{\mu}$ to be a quite general random measure, rather than just a compensated Poisson measure as in [15]. Moreover, using a more precise maximal estimate for stochastic convolutions, solutions are no longer needed to be sought in spaces of processes with finite second moment (yet more general well-posedness results are going to appear in [14]); (b) the sufficient conditions on the coefficients of (1.1) for the differentiability of its solution with respect to the initial datum are the natural ones. For instance, roughly speaking, Fréchet differentiability of $f$ , $B$ , and $G$ imply Fréchet differentiability of the solution map $u_{0}\mapsto u$ , while in [15] a $C^{1}$ condition on $f$ , $B$ , and $G$ was needed. In fact, the proof in [15] was based on an implicit function theorem with parameters, for which the $C^{1}$ assumption seems indispensable, while here we use a direct approach based on the definition of derivative; (c) we study the $n$ -th order differentiability of the solution map for arbitrary natural $n$ , instead of considering only first and second-order differentiability as in [15]. In this regard it is worth mentioning that we just assume that the derivatives of $f$ , $B$ , and $G$ of order higher than one satisfy a polynomial growth condition. While this assumption causes non-trivial technical difficulties, it is more natural than much more restrictive boundedness conditions that are often found in the literature: a possible example of coefficients with nonbounded higher derivatives is given in Example 6.1 below.

There are several reasons to study the differentiability of solutions to stochastic equations in infinite dimensions with respect to the initial datum (or, more generally, with respect to parameters), among which the probabilistic construction of solutions to Kolmogorov equations is our main motivation. This vast and mature field of investigation is still very active, especially regarding stochastic equations with additive Wiener noise: see, e.g., [12] for classical results in the finite-dimensional case, [9] for basic results in the Hilbertian setting, and [4, 6, 7, 20] for accounts of more recent developments. On the other hand, the case of equations with discontinuous noise, for which the associated Kolmogorov equations are of non-local type, is much less investigated, especially in the infinite-dimensional setting (see [15] for simple results and [19] for a special case). As an application of the above-mentioned differentiability results, we shall construct, in a forthcoming work, classical solutions to non-local Kolmogorov equations with sufficiently regular coefficients. As is well known, such results are essential to consider Kolmogorov equations motivated by applications, that usually have less regular coefficients. In fact, a typical approach is, roughly speaking, to regularize the coefficients of the equation, thus obtaining a family of approximating Kolmogorov equations that are sufficiently simple to have classical solutions, and to obtain a solution to the original problem passing to the limit, in an appropriate sense, with respect to the regularization parameter. In this spirit, our ultimate goal is the extension of the results in [18] to non-local Kolmogorov equations associated to stochastic evolution equations with jumps in a generalized variational setting as considered in [17].

Since the literature on the problem at hand is very large, it is not easy to provide an accurate comparison of our results with existing ones, apart of the remarks already made. We should nonetheless mention the recent work [2], which considers a problem analogous to ours, but without discontinuous noise term and with coefficients with bounded derivatives of all orders. Here, the authors exploit the smoothing property of an analytic semigroup and study differentiability in negative order spaces.

The remaining text is organized as follows: in §2, after fixing some notation, we recall a characterization of Gâteaux and Fréchet differentiability, as well as some maximal estimates for deterministic and stochastic convolutions, all of which are essential tools. Well-posedness of (1.1), i.e. existence and uniqueness of a mild solution and its continuous dependence on the initial datum, is proved in §3. The remaining sections are devoted to differentiability properties of the mild solution to (1.1) with respect to the initial datum: first-order Gâteaux and Fréchet differentiability are treated in §4 and §5, respectively, and $n$ -th order Fréchet differentiability is considered in §6.

Acknowledgements. A large part of the work for this paper was done during several visits of the first-named author to the Interdiszplinäres Zentrum für Komplexe Systeme (IZKS) at the University of Bonn, Germany, and a visit to the University of Vienna, Austria. The warm hospitality of his hosts (S. Albeverio and U. Stefanelli, respectively) and the good working conditions are gratefully acknowledged. The second-named author is funded by Vienna Science and Technology Fund (WWTF) through Project MA14-009. The authors are indebted to G. Luise for contributing to a preliminary draft.

2 Preliminaries

2.1 Notation

The spaces of linear bounded operators from a Banach space $E$ to a further Banach space $F$ will be denoted by $\mathscr{L}(E,F)$ , and $\mathscr{L}^{2}(E,F)$ stands for the space of Hilbert-Schmidt operators from $E$ to $F$ if $E$ and $F$ are Hilbert spaces. The closed ball of radius $r>0$ in $E$ will be denoted by $B_{r}(E)$ .

All stochastic elements will be defined on a fixed filtered probability space $(\Omega,\mathcal{F},\mathbb{F},\mathbb{P})$ , with the filtration $\mathbb{F}:=(\mathcal{F}_{t})_{t\in[0,T]}$ complete and right-continuous, and $T>0$ a fixed final time. Moreover, $H$ will always denote a fixed real separable Hilbert space with norm $\lVert\cdot\rVert$ . For any $p>0$ and $[t_{0},t_{1}]\subseteq[0,T]$ , we shall use the notation $\mathbb{S}^{p}(t_{0},t_{1})$ for the space of adapted càdlàg $H$ -valued processes $Y$ such that

[TABLE]

and we set $\mathbb{S}^{p}:=\mathbb{S}^{p}(0,T)$ . We recall that these are Banach spaces if $p\geq 1$ , and quasi-Banach spaces if $p\in\mathopen{]}0,1\mathclose{[}$ . In the latter case the triangle inequality is reversed, but one has

[TABLE]

to which we shall also refer, with a harmless abuse of terminology, as the triangle inequality. Moreover, $\mathbb{S}^{p}(t_{0},t_{1})$ is a complete metric space for every $p>0$ when endowed with the distance

[TABLE]

as it follows from the inequality $\lvert x+y\rvert^{p}\leq\lvert x\rvert^{p}+\lvert y\rvert^{p}$ , which holds true for every $x$ , $y\in\mathbb{R}$ and $p\in\mathopen{]}0,1\mathclose{[}$ . For brevity we shall write $d_{p}:=d_{p,0,T}$ . Entirely analogously, $L^{p}(\Omega;H)$ endowed with the distance

[TABLE]

is a complete metric space for every $p>0$ .

Let $K$ be a real separable Hilbert space and $W$ a cylindrical Wiener process on $K$ . Let $(Z,\mathscr{Z})$ be a Blackwell measurable space and $\mu$ an integer-valued quasi-left-continuous random measure on $Z\times[0,T]$ , independent of $W$ , with dual predictable projetion (compensator) $\nu$ , and $\bar{\mu}:=\mu-\nu$ . We recall that the assumption on $(Z,\mathscr{Z})$ as a Blackwell space is usually required in the literature on random measures (see [11, §1a]), and it ensures for example that $\mathscr{Z}$ is separable and generated by a countable algebra. We also recall that the quasi-left-continuity of $\mu$ implies that the random measure $\nu$ is non-atomic (see, e.g., [11, Corollary 1.19, p. 70]). A map $g:\Omega\times[0,T]\times Z\to H$ will be called predictable if it is $\mathscr{P}\otimes\mathscr{Z}$ -measurable, where $\mathscr{P}$ stands for the predictable $\sigma$ -algebra of $\mathbb{F}$ (the target space $H$ is always assumed to be endowed with the Borel $\sigma$ -algebra). Moreover, for any such predictable map $g$ , we set, for any $p$ , $q\in\mathopen{]}0,\infty\mathclose{[}$ ,

[TABLE]

and

[TABLE]

where the infima are taken with respect to $\mathscr{P}\otimes\mathscr{Z}$ -measurable maps $g_{1}$ , $g_{2}$ only. One may actually show that $L^{p}(\Omega;L^{q}(\nu;H))$ as well as $\mathbb{G}^{p}$ are (quasi-)Banach space and that

[TABLE]

For a proof of this statement, as well as of other properties of such mixed-norm $L^{p}$ spaces involving random measures (even in a more general setting), we refer to [10]. For us, however, it is enough to know that they are quasi-normed spaces, and the “norms” just introduced on spaces where the underlying measure is random is only a convenient notation. We shall also need to consider spaces where $\mathopen{]}0,T\mathclose{]}\times Z$ is replaced by $\mathopen{]}t_{0},t_{1}\mathclose{]}\times Z$ , with $0\leq t_{0}\leq t_{1}\leq T$ , and the corresponding notation will be self-explanatory.

We shall use standard notation of stochastic calculus: we write, for instance, $f^{*}$ and $f_{-}$ to denote the maximal function and the left-limit function of a càdlàg function $f$ , respectively. Further notation related to deterministic and stochastic convolutions, as well as to different notions of derivative for maps between infinite-dimensional spaces, will be introduced where they first appear. For any $a,b>0$ , we use the notation $a\lesssim b$ to indicate that there exists a constant $c>0$ such that $a\leq cb$ . If $c$ depends on some further quantities that we need to keep track of we shall indicate them in a subscript. We use the classical notation $\wedge$ and $\vee$ for $\min$ and $\max$ , respectively.

2.2 Notions of derivative

Let $E$ , $F$ be Banach spaces, and $G$ be a subspace of $E$ . A function $\phi:E\to F$ is Gâteaux differentiable at $x_{0}\in E$ along $G$ if there exists a continuous linear map $L\in\mathscr{L}(G,F)$ such that

[TABLE]

The linear map $L$ , which is necessarily unique, will be denoted by $D_{\mathcal{G}}\phi(x_{0})$ and is called the Gâteaux derivative of $\phi$ at $x_{0}$ (along the subspace $G$ , if $G\neq E$ ). If $G=E$ and $\phi$ is also Lipschitz continuous with Lipschitz constant $L_{\phi}$ , it easily follows from the definition that $\lVert D_{\mathcal{G}}\phi(x_{0})\rVert_{\mathscr{L}(E,F)}\leq L_{\phi}$ : indeed, for all $h\in E$ we have

[TABLE]

The map $\phi$ is Fréchet differentiable at $x_{0}\in E$ along the subspace $G$ if there exists a continuous linear map $L\in\mathscr{L}(G,F)$ such that

[TABLE]

The (unique) map $L$ will be denoted by $D\phi(x_{0})$ and is called the Fréchet derivative of $\phi$ at $x_{0}$ (along the subspace $G$ , in case $G\neq E$ ). It is well known that Fréchet differentiability implies Gâteaux differentiability, while the converse is not true. We shall often use the following characterization of Fréchet differentiability, of which we include a proof for the convenience of the reader.

Lemma 2.1.

A map $\phi:E\to F$ is Fréchet differentiable at $x_{0}\in E$ with $D\phi(x_{0})=L$ if and only if for each bounded set $B\subset E$ one has

[TABLE]

uniformly with respect to $h\in B$ .

Proof.

Let $\phi$ be Fréchet differentiable at $x_{0}$ with $D\phi(x_{0})=L$ , and set $R(h):=\phi(x_{0}+h)-\phi(x_{0})-Lh$ . Then $R(h)/\lVert h\rVert\to 0$ as $h\to 0$ . Let $B$ be a bounded set and $M$ a real number such that $B$ is included in the ball of $E$ of radius $M$ centered at zero. For any $\eta>0$ there exists $\delta>0$ such that $\lVert R(h)\rVert/\lVert h\rVert\leq\eta/M$ for every $h$ with $\lVert h\rVert\leq\delta$ . Therefore, for any $\varepsilon$ such that $\lvert\varepsilon\rvert\leq\delta/M$ , one has $\lVert\varepsilon h\rVert\leq\delta$ and

[TABLE]

i.e. $\lVert R(\varepsilon h)\rVert/\lvert\varepsilon\rvert\to 0$ as $\varepsilon\to 0$ uniformly with respect to $h\in B$ . Let us now prove the converse implication: assume that (2.1) holds for every $B$ , uniformly with respect to $h\in B$ , and that, by contradiction, $\phi$ is not Fréchet differentiable at $x_{0}$ , i.e. that $R(h)/\lVert h\rVert$ does not converge to zero as $h\to 0$ . In particular, there exists a sequence $(k_{n})\subset E\setminus\{0\}$ converging to zero such that $R(k_{n})/\lVert k_{n}\rVert$ does not converge to zero. We claim that it cannot happen that

[TABLE]

as $\varepsilon\to 0$ . In fact, setting $\varepsilon_{n}:=\lVert k_{n}\rVert$ , $h_{n}=k_{n}/\lVert k_{n}\rVert$ , and $B:=(h_{n})$ , this would imply that $\varepsilon_{n}^{-1}\bigl{(}\phi(x_{0}+\varepsilon_{n}h_{n})-\varphi(x_{0})-\varepsilon_{n}Lh_{n}\bigr{)}$ converges to zero as $n\to\infty$ , which is equivalent to $R(k_{n})/\lVert k_{n}\rVert\to 0$ . ∎

By a simple scaling argument it is evident that it is sufficient to consider as bounded subset $B$ the unit ball in $E$ . One can thus say that $\phi:E\to F$ is Fréchet differentiable at $x_{0}\in E$ along a subspace $G\subseteq E$ if there exists a continuous linear map $L:G\to F$ such that

[TABLE]

For a comprehensive treatment of differential calculus for functions between topological vector spaces we refer to [1] for basic results in the case of Banach spaces, and to [3, 5] for the general case.

2.3 Estimates for deterministic and stochastic convolutions

Throughout this section $S$ stands for a strongly continuous linear semigroup of contractions on $H$ , and $-A$ for its generator. Clearly, $A$ is necessarily a linear maximal monotone operator.

Here and in the following we shall use $S\ast g$ to denote convolution of $S$ and an $H$ -valued measurable function $g$ on $\mathbb{R}_{+}$ , defined as

[TABLE]

under the minimal assumption that $S(t-\cdot)g\in L^{1}(0,t;H)$ for all $t$ in a set of interest, usually a bounded interval of $\mathbb{R}_{+}$ .

The following estimate for convolutions is trivial, but sufficient for our purposes.

Lemma 2.2.

For every $p>0$ and for every measurable adapted process $\phi:\Omega\times[0,T]\to H$ such that $\phi\in L^{p}(\Omega;L^{1}(0,T;H))$ , it holds that $S\ast\phi\in\mathbb{S}^{p}(0,T)$ and

[TABLE]

Proof.

Minkowski’s inequality and contractivity of $S$ immediately yield

[TABLE]

We shall also need estimates for stochastic convolutions with respect to the cylindrical Wiener process $W$ , for which we shall always use the following notation: for any $\mathscr{L}^{2}(K,H)$ -valued process $G$ , the stochastic convolution $S\diamond G$ is the process defined as

[TABLE]

under a stochastic integrability assumption on $S(t-\cdot)G$ . There is an extensive literature on maximal estimates for stochastic convolutions, mostly obtained through the so-called factorization method by Da Prato, Kwapień, and Zabczyk [8], which requires $-A$ to generate a holomorphic semigroup. The following estimate instead requires $A$ to be maximal monotone and can be proved by relatively elementary techniques of stochastic calculus (see, e.g., [13] for a proof in a more general context).

Proposition 2.3.

For every $p>0$ and for every $G\in L^{p}(\Omega;L^{2}(0,T;\mathscr{L}^{2}(K,H)))$ progressively measurable, the stochastic convolution $S\diamond G$ admits a modification in $\mathbb{S}^{p}(0,T)$ and

[TABLE]

Finally, a key role is played by the following maximal estimate for stochastic convolutions with respect to the compensated random measure $\bar{\mu}$ . For a predictable $H$ -valued process $g$ , the stochastic convolution of $g$ with respect to $\bar{\mu}$ will be denote by $S\diamond_{\mu}g$ and defined as

[TABLE]

under a stochastic integrability assumption on $S(t-\cdot)g$ with respect to $\bar{\mu}$ .

Lemma 2.4.

For every $p>0$ and for every $g\in\mathbb{G}^{p}$ , the stochastic convolution $S\diamond_{\mu}g$ admits a càdlàg modification and

[TABLE]

A proof can be found in [16]. A generalization of this inequality to $L^{q}$ -valued processes will appear in [14].

3 Well-posedness

This section is devoted to the proof of well-posedness of equation (1.1). We show existence and uniqueness of a mild solution, as well as its continuous dependence on the initial datum, in spaces of processes with finite moments of order $p\in\mathopen{]}0,+\infty\mathclose{[}$ . Although only the case $p\geq 1$ is needed in the following sections on differentiability of the solution with respect to the initial datum, the general case $p>0$ is necessary to deal with initial data or driving random measures admitting finite moments of order strictly less than one. An example is given by $\alpha$ -stable random measures with $\alpha<1$ .

The following assumptions (A0)–(A4) on the coefficients and the initial datum of (1.1) are in force throughout the paper.

(A0)

The initial datum $u_{0}$ is an $\mathscr{F}_{0}$ -measurable random variable with values in $H$ ;

(A1)

$A$ is a linear maximal monotone operator on $H$ , and $S$ is the strongly continuous semigroup of contractions generated by $-A$ on $H$ ;

(A2)

The function $f:\Omega\times[0,T]\times H\to H$ is such that $f(\cdot,\cdot,x)$ is measurable and adapted for every $x\in H$ , and there exists a constant $C_{f}>0$ such that

[TABLE]

for all $\omega\in\Omega$ , $t\in[0,T]$ , and $x,\,y\in H$ ;

(A3)

The function $B:\Omega\times[0,T]\times H\to\mathscr{L}^{2}(K,H)$ is such that $B(\cdot,\cdot,x)$ is progressively measurable for all $x\in H$ , and there exists a constant $C_{B}>0$ such that

[TABLE]

for all $\omega\in\Omega$ , $t\in[0,T]$ , and $x,\,y\in H$ ;

(A4)

The function $G:\Omega\times[0,T]\times Z\times H\to H$ is such that $G(\cdot,\cdot,\cdot,x)$ is $\mathscr{P}\otimes\mathscr{Z}$ -measurable for all $x\in H$ . Moreover,

(i)

if $p\leq 1$ or $p\geq 2$ , then there exists a $\mathscr{P}\otimes\mathscr{Z}$ -measurable function $g:\Omega\times[0,T]\times Z\to\mathbb{R}$ such that

[TABLE]

for all $\omega\in\Omega$ , $t\in[0,T]$ , $z\in Z$ and $x,\,y\in H$ ;

(ii)

if $1<p<2$ , then there exist functions $G_{1}$ , $G_{2}:\Omega\times[0,T]\times Z\times H\to H$ , satisfying the same measurability properties of $G$ , with $G=G_{1}+G_{2}$ , and $\mathscr{P}\otimes\mathscr{Z}$ -measurable functions $g_{1}$ , $g_{2}:\Omega\times[0,T]\times Z\to\mathbb{R}$ such that, for $j\in\{1,2\}$ ,

[TABLE]

for all $\omega\in\Omega$ , $t\in[0,T]$ , $z\in Z$ and $x,\,y\in H$ .

Further assumptions will be made when needed.

The concept of solution to (1.1) we shall work with is the following.

Definition 3.1.

An $H$ -valued adapted càdlàg process $u$ is a mild solution to (1.1) if

(i)

$S(t-\cdot)f(u)\in L^{1}(0,t;H)$ for all $t\in[0,T]$ $\mathbb{P}$ -a.s.;

(ii)

$S(t-\cdot)B(u)\in L^{2}(0,t;\mathscr{L}^{2}(K,H))$ for all $t\in[0,T]$ $\mathbb{P}$ -a.s.;

(iii)

there exists $p>0$ such that $S(t-\cdot)G(u_{-})\in\mathbb{G}_{p}(0,t)$ for all $t\in[0,T]$ ;

(iv)

one has

[TABLE]

as an identity in the sense of modifications.

In order to formulate the well-posedness result in the mild sense for (1.1), it is convenient to introduce an assumption depending on a parameter $p\in\mathopen{]}0,\infty\mathclose{[}$ :

(A5p)

Setting $g_{1}:=g_{2}:=g/2$ if $p\not\in\mathopen{]}1,2\mathclose{[}$ , there exists a continuous increasing function $\kappa:\mathbb{R}_{+}\to\mathbb{R}_{+}$ , with $\kappa(0)=0$ , such that

[TABLE]

Theorem 3.2.

Let $p>0$ and (A5p)* be satisfied. For any $u_{0}\in L^{p}(\Omega;H)$ , equation (1.1) admits a unique mild solution $u\in\mathbb{S}^{p}$ such that $\lVert u\rVert_{\mathbb{S}^{p}}\lesssim 1+\lVert u_{0}\rVert_{L^{p}(\Omega;H)}$ , with implicit constant independent of $u_{0}$ . Moreover, the solution map $u_{0}\mapsto u$ is Lipschitz continuous from $L^{p}(\Omega;H)$ to $\mathbb{S}^{p}$ .*

Proof.

We are going to use a fixed-point argument in the metric space $(\mathbb{S}^{p}(0,T_{0}),d_{p,0,T_{0}})$ , with $T_{0}$ sufficiently small. By a classical patching argument, this will imply existence and uniqueness of a solution in $\mathbb{S}^{p}(0,T)$ . Let $\Gamma$ be the map formally defined on $L^{p}(\Omega;H)\times\mathbb{S}^{p}$ as

[TABLE]

Let us show that $\Gamma$ is in fact well defined on $L^{p}(\Omega;H)\times\mathbb{S}^{p}$ and that its image is contained in $\mathbb{S}^{p}$ : one has

[TABLE]

where $\big{\lVert}S(\cdot)u_{0}\big{\rVert}_{\mathbb{S}^{p}}\leq\lVert u_{0}\rVert_{L^{p}(\Omega;H)}$ by contractivity of the semigroup $S$ ; the elementary lemma 2.2 and linear growth of $f$ imply

[TABLE]

similarly, proposition 2.3 yields

[TABLE]

finally, it follows by proposition 2.4 that $\big{\lVert}S\diamond_{\mu}G(u_{-})\big{\rVert}^{p}_{\mathbb{S}^{p}}\lesssim\big{\lVert}G(u_{-})\big{\rVert}^{p}_{\mathbb{G}^{p}}$ , where, if $p\in\mathopen{]}0,1]\cup[2,\infty[$ ,

[TABLE]

and, similarly, if $p\in\mathopen{]}1,2\mathclose{[}$ ,

[TABLE]

Analogous arguments show that that $\Gamma(u_{0},\cdot)$ is a contraction of $\mathbb{S}^{p}(0,T_{0})$ , with $T_{0}$ to be chosen later. In fact, one has, with a slightly simplified notation,

[TABLE]

Let us estimate the three terms separately. The Lipschitz continuity of $f$ , $B$ , and $G$ yields

[TABLE]

so that

[TABLE]

Since $\kappa$ is continuous with $\kappa(0)=0$ , it follows that there exists $T_{0}>0$ and a constant $\eta\in\mathopen{]}0,1\mathclose{[}$ , which depends on $T_{0}$ , such that

[TABLE]

hence, by the Banach-Caccioppoli contraction principle, for any $u_{0}\in L^{p}(\Omega;H)$ there exists a fixed point $u$ of the contraction $\Gamma(u_{0},\cdot)$ , which is thus the unique solution in $\mathbb{S}^{p}(0,T_{0})$ to (1.1). Choosing $T_{0}$ such that $T=nT_{0}$ , with $n\in\mathbb{N}$ , and repeating the same argument on each interval $[kT_{0},(k+1)T_{0}]$ , with $k\in\{1,\ldots,n-1\}$ , a unique solution to (1.1) can be constructed on the whole interval $[0,T]$ . Furthermore, for any $u_{0}\in L^{p}(\Omega;H)$ , by (3.1)-(3.5), the unique solution $u=\Gamma(u_{0},u)\in\mathbb{S}^{p}(0,T)$ satisfies

[TABLE]

where the implicit constant is independent of $T$ . Hence, there is $T_{0}\in(0,T)$ small enough such that

[TABLE]

Performing now a patching argument as above on $[0,T_{0}],\ldots,[(n-1)T_{0},T]$ yields the desired estimate

[TABLE]

The argument to show the Lipschitz-continuity of $u_{0}\mapsto u$ is similar: let $u_{01}$ , $u_{02}\in L^{p}(\Omega;H)$ , and $u_{1}$ , $u_{2}\in\mathbb{S}^{p}(0,T)$ be the unique solutions to (1.1) with initial datum $u_{01}$ and $u_{02}$ , respectively. Using a patching argument as above, it suffices to show that $u_{0}\mapsto u$ is Lipschitz continuous on $[0,T_{0}]$ . To this purpose, One has

[TABLE]

where $\eta<1$ is a positive constant (that depends on $T_{0}$ ). Rearranging terms and performing a patching argument as above immediately yields the Lipschitz continuity of $u_{0}\mapsto u$ . ∎

*Remark 3.3**.*

It immediately follows from the Lipschitz continuity of the solution map that one also has, in the same notation used above,

[TABLE]

with implicit constant depending on $T$ and $p$ .

4 Gâteaux differentiability of the solution map

In the previous section we have shown that the solution map $u_{0}\mapsto u$ is Lipschitz continuous from $L^{p}(\Omega;H)$ to $\mathbb{S}^{p}$ . We are now going to show that Gâteaux differentiability of the coefficients of (1.1) implies Gâteaux differentiability of the solution map. For some applications (e.g. to study Kolmogorov equations associated to stochastic PDEs) it is sufficient to consider non-random initial data and to consider first-order derivatives as linear maps from $H$ to $\mathbb{S}^{p}$ , i.e., roughly speaking, to consider only non-random directions of differentiability. However, the more general case of random initial data and random directions of differentiability considered here as well as in the next sections is conceptually not more difficult and, apart of being interesting in its own right because treated at the natural level of generality, it is necessary to study, for instance, higher-order stability issues of stochastic models with respect to perturbations of the initial datum.

We shall make the following additional assumption, which is assumed to hold throughout this section.

(G1)

The maps $f(\omega,t,\cdot)$ and $B(\omega,t,\cdot)$ are Gâteaux differentiable for all $(\omega,t)\in\Omega\times[0,T]$ , and the maps

[TABLE]

are Gâteaux differentiable for all $(\omega,t,z)\in\Omega\times[0,T]\times Z$ .

The Gâteaux derivatives of $f$ , $B$ and $G$ (in their $H$ -valued argument) are denoted by

[TABLE]

Recalling that $f$ and $B$ are Lipschitz continuous in their $H$ -valued argument, uniformly over $\Omega\times[0,T]$ , we infer that

[TABLE]

for all $\omega\in\Omega$ , $t\in[0,T]$ , and $x_{0}\in H$ . Similarly, the Lipschitz continuity of $G$ implies, if $p\not\in\mathopen{]}1,2\mathclose{[}$ , that

[TABLE]

and, if $p\in\mathopen{]}1,2\mathclose{[}$ , that

[TABLE]

for all $\omega\in\Omega$ , $t\in[0,T]$ , $z\in Z$ , and $x_{0}\in H$ .

We begin with two general results that will be extensively used in the sequel. The first lemma is an immediate corollary of the well-posedness results.

Lemma 4.1.

Under the assumptions of Theorem 3.2, let $u\in\mathbb{S}^{p}$ be the unique mild solution to (1.1) with initial condition $u_{0}\in L^{p}(\Omega;H)$ . For any $h\in L^{p}(\Omega;H)$ , the linear stochastic evolution equation

[TABLE]

admits a unique mild solution $y\in\mathbb{S}^{p}$ that depends continuously on the initial datum $h$ .

Proof.

The linear maps $D_{\mathcal{G}}f(u)$ and $D_{\mathcal{G}}B(u)$ are bounded, uniformly over $\Omega\times[0,T]$ , hence, a fortiori, Lipschitz continuous. Analogously, the linear map $D_{\mathcal{G}}G(u_{-})$ has norm (and, a fortiori, Lipschitz constant) bounded by $g_{1}+g_{2}$ (with $g_{1}:=g_{2}:=g/2$ if $p\not\in\mathopen{]}1,2\mathclose{[}$ ) on $\Omega\times[0,T]\times Z$ . Theorem 3.2 thus implies that, for any $h\in L^{p}(\Omega;H)$ , (4.1) admits a unique mild solution $y\in\mathbb{S}^{p}$ , which depends continuously on $h$ . ∎

Note that, since the equation for $y$ is linear, it is immediate that the map $h\mapsto y$ is linear and continuous from $L^{p}(\Omega;H)$ to $\mathbb{S}^{p}$ .

The next lemma will play a crucial role both in the proof of the Gâteaux differentiability of the solution map in this section, as well as in the proof of its Fréchet differentiability in the next section, taking into account Lemma 2.1.

Lemma 4.2.

Under the assumptions of Theorem 3.2, let $h\in L^{p}(\Omega;H)$ and $u$ , $u_{\varepsilon}\in\mathbb{S}^{p}$ the the unique mild solutions to (1.1) with initial conditions $u_{0}$ and $u_{0}+\varepsilon h$ , respectively. Moreover, let $y\in\mathbb{S}^{p}$ be the unique mild solution to (4.1) with initial condition $h$ . One has

[TABLE]

Proof.

Let $[t_{0},t_{1}]\subset[0,T]$ , and consider the evolution equation

[TABLE]

One easily sees that it admits a unique mild solution $v$ , which coincides with the restriction of $u$ to $[t_{0},t_{1}]$ . In particular, for any $t\geq t_{0}$ ,

[TABLE]

A completly analogous flow property holds for $u_{\varepsilon}$ and $y$ . Then one has, by the triangle inequality,

[TABLE]

where, by abuse of notation, the (deterministic and stochastic) convolutions are defined on $[t_{0},t_{1}]$ , in accordance to (4.2), and $u_{\varepsilon-}:=(u_{\varepsilon})_{-}$ . We are going to estimate $I_{1}$ , $I_{2}$ and $I_{3}$ separately. To simplify the notation, let us set, for a generic mapping $\phi$ ,

[TABLE]

(with obvious modifications if $u$ and $y$ are replaced by $u_{-}$ and $y_{-}$ ), and note that

[TABLE]

(the formal operators $Q_{1,\varepsilon}$ and $Q_{2,\varepsilon}$ clearly depend also on $y$ , but we do not need to explicitly denote this fact). Recalling the elementary estimate of Lemma 2.2, one has

[TABLE]

where, by the Lipschitz continuity of $f$ ,

[TABLE]

The terms $I_{2}$ and $I_{3}$ can be handled similarly, thanks to the maximal inequalities of §2.3:

[TABLE]

where

[TABLE]

and

[TABLE]

where

[TABLE]

Recalling that $\kappa$ is continuous with $\kappa(0)=0$ , these estimates imply that for every $\sigma>0$ there exists $\delta>0$ such that, for any $t_{0}<t_{1}$ with $t_{1}-t_{0}<\delta$ , one has

[TABLE]

Fixing then $\sigma$ sufficiently small and rearranging the terms yields

[TABLE]

where the implicit constant depends on $\delta$ and $I_{12}$ , $I_{22}$ , $I_{32}$ are “supported” on $[t_{0},t_{1}]$ . Let $t_{0}=0<t_{1}<\cdots<t_{N-1}<t_{N}=T$ be a subdivision of the interval $[0,T]$ such that $t_{n}-t_{n-1}<\delta$ for all $n$ . Then we have, for every $n\in\{1,\ldots,N\}$ , with obvious meaning of the notation,

[TABLE]

where

[TABLE]

Backward recursion thus yields

[TABLE]

where the first summand on the right-hand side is zero. To conclude the proof it suffices to show that

[TABLE]

for every $j\in\{1,2,3\}$ . We shall show that this is true for $I_{32}$ , as both other cases are entirely similar (in fact slightly simpler): it is enough to observe that, for any $\phi$ satisfying suitable measurability conditions and for any $q>0$ , the obvious inequality

[TABLE]

implies $\lVert\phi\rVert_{\mathbb{G}^{p}(t_{n-1},t)}\leq\lVert\phi\rVert_{\mathbb{G}^{p}(0,T)}$ , hence

[TABLE]

The main result of this section is the following. Note that, since the (standard) definition of Gâteaux derivative requires a Banach space framework, we shall confine ourself to the case $p\in\mathopen{[}1,+\infty\mathclose{[}$ .

Theorem 4.3.

Let $p\geq 1$ and (A5p)* be satisfied. Then the solution map of (1.1) is Gâteaux differentiable from $L^{p}(\Omega;H)$ to $\mathbb{S}^{p}$ , and its Gâteaux derivative at $u_{0}$ is $(h\mapsto y)\in\mathscr{L}(L^{p}(\Omega;H),\mathbb{S}^{p})$ , where $y$ is the unique mild solution to (4.1).*

Proof.

By Lemma 4.2, it is enough to show that

[TABLE]

converges to zero as $\varepsilon$ tends to zero. By assumption (G1) it immediately follows that, as $\varepsilon\to 0$ ,

[TABLE]

for a.a. $(\omega,t)\in\Omega\times[0,T]$ . Moreover, recalling that the operator norms of $D_{\mathcal{G}}f$ and $D_{\mathcal{G}}B$ are bounded by the Lipschitz constants of $f$ and $B$ , respectively, the triangle inequality yields

[TABLE]

for a.a. $(\omega,t)$ . Since $y\in\mathbb{S}^{p}$ , the right-hand side belongs to $L^{p}(\Omega;L^{1}(0,T))$ as well as to $L^{p}(\Omega;L^{2}(0,T))$ , hence the first two terms in (4.3) converge to zero as $\varepsilon\to 0$ by the dominated convergence theorem. Similarly, setting $G_{1}:=G_{2}:=G/2$ if $p\geq 2$ , one has

[TABLE]

where the implicit constant is equal to $1$ for $p\in[1,2\mathclose{[}$ , and to $2$ for $p\geq 2$ . Since

[TABLE]

as $\varepsilon\to 0$ , as well as

[TABLE]

for all $(t,z)\in[0,T]\times Z$ , $\mathbb{P}$ -almost surely, for both $j=1$ and $j=2$ , one has, thanks to (A5p) and the dominated convergence theorem, recalling that $y\in\mathbb{S}^{p}$ ,

[TABLE]

$\mathbb{P}$ -a.s. as $\varepsilon\to 0$ . A further application of the dominated convergence theorem hence yields that the third term in (4.3) converges to zero as $\varepsilon\to 0$ , thus completing the proof. ∎

5 Fréchet differentiability of the solution map

We are going to show that the Fréchet differentiability of the coefficients of (1.1) implies the Fréchet differentiability of the solution map. We shall work under the following assumption, that is assumed to hold throughout this section.

(F)

The maps $f(\omega,t,\cdot)$ and $B(\omega,t,\cdot)$ are Fréchet differentiable for all $(\omega,t)\in\Omega\times[0,T]$ , and the maps

[TABLE]

are Fréchet differentiable for all $(\omega,t,z)\in\Omega\times[0,T]\times Z$ .

The Fréchet derivatives of $f$ and $B$ (in their $H$ -valued argument), denoted by

[TABLE]

satisfy the boundedness properties

[TABLE]

for all $(\omega,t,x)\in\Omega\times[0,T]\times H$ (see § 2.2). Similarly, and in complete analogy to the previous section, the Lipschitz continuity assumptions on $G$ , $G_{1}$ and $G_{2}$ imply that,

[TABLE]

The main result of this section is the following theorem, which states that the solution map is Fréchet differentiable along subspaces of vectors with finite higher moments.

Theorem 5.1.

Let $q>p\geq 1$ . If (A5p)* and (A5q*)* hold, then the solution map of (1.1) is Fréchet differentiable from $L^{p}(\Omega;H)$ to $\mathbb{S}^{p}$ along $L^{q}(\Omega;H)$ and its Fréchet derivative at $u_{0}\in L^{p}(\Omega;H)$ is the map $h\mapsto y\in\mathscr{L}(L^{q}(\Omega;H),\mathbb{S}^{p})$ , where $y$ is the unique mild solution to the stochastic evolution equation*

[TABLE]

Proof.

For any $h\in L^{q}(\Omega;H)$ , equation (5.1) admits a unique mild solution $y\in\mathbb{S}^{q}$ , as it follows immediately by the boundedness properties of the Fréchet derivatives of $f$ , $B$ and $G$ , and by hypothesis (A5q). Therefore the map $h\mapsto y$ is well defined from $L^{q}(\Omega;H)$ to $\mathbb{S}^{q}$ , and it is obviously linear and continuous. To prove that this map is the Fréchet derivative of the solution map $u_{0}\mapsto u$ , thanks to the characterization of Fréchet differentiability of Lemma 2.1, it is enough to show that

[TABLE]

uniformly over $h$ belonging to bounded subsets of $L^{q}(\Omega;H)$ . By Lemma 4.2, for this it suffices to show that each term in (4.3) converges to zero uniformly with respect to $h$ belonging to the unit ball of $L^{q}(\Omega;H)$ . Since $h\mapsto y\in\mathscr{L}(L^{q}(\Omega;H),\mathbb{S}^{q})$ , it is evident that if $h$ belongs to $B_{1}(L^{q}(\Omega;H))$ then $y(h)$ belongs to $B_{R}(\mathbb{S}^{q})$ , where $R:=\lVert h\mapsto y\rVert_{\mathscr{L}(L^{q}(\Omega;H),\mathbb{S}^{q})}$ . Hence, denoting by $I_{j}$ , $j=1,2,3$ , the terms appearing in (4.3), by homogeneity

[TABLE]

Hence it suffices to show that $I_{1}$ , $I_{2}$ and $I_{3}$ converge to zero uniformly with respect to $y$ bounded in $\mathbb{S}^{q}$ . That is, we need to show that, for any $R>0$ and $\vartheta>0$ , there exists $\varepsilon_{0}=\varepsilon_{0}(R,\vartheta)$ such that $\lvert\varepsilon\rvert<\varepsilon_{0}$ implies $I_{j}(y)<\vartheta$ for all $y\in B_{R}(\mathbb{S}^{q})$ and $j\in\{1,2,3\}$ . For any measurable $E\subset\Omega$ , one clearly has

[TABLE]

where, by the Lipschitz continuity of $f$ ,

[TABLE]

The set $Y:=\{(y^{*}_{T})^{p}:\,y\in B_{R}(\mathbb{S}^{q})\}$ is bounded in $L^{q/p}(\Omega)$ , with $q>p$ , hence uniformly integrable on $(\Omega,\mathscr{F},\mathbb{P})$ . In particular, for any $\vartheta>0$ there exists $\sigma>0$ such that, for any $E\in\mathscr{F}$ with $\mathbb{P}(E)<\sigma$ , one has

[TABLE]

hence $I_{1}(E)\leq\vartheta$ . Let $y\in B_{R}(\mathbb{S}^{q})$ be arbitrary but fixed. Markov’s inequality yields, for any $n>0$ ,

[TABLE]

Therefore there exists $n>0$ such that, setting $E:=\{y^{*}_{T}>n\}$ , one has $I_{1}(E)<\vartheta$ . It is important to note that $n$ depends on $R$ , but not on $y$ , while $E$ depends on $y$ . The Fréchet differentiability hypothesis on $f$ amounts to saying that, for any $x\in H$ and $n\in\mathbb{N}$ ,

[TABLE]

In particular, one has

[TABLE]

for a.a. $(\omega,t)\in E^{c}\times[0,T]$ , where, by the Lipschitz continuity of $f$ ,

[TABLE]

for a.a. $(\omega,t)\in E^{c}\times[0,T]$ . Therefore, by the dominated convergence theorem,

[TABLE]

that is, for any $\vartheta>0$ there exists $\varepsilon_{1}$ depending only on $\vartheta$ and $n$ such that

[TABLE]

for all $\varepsilon$ such that $\lvert\varepsilon\rvert<\varepsilon_{1}(\vartheta,n)$ . It remains to observe that

[TABLE]

for a.a. $(\omega,t)\in E^{c}\times[0,T]$ to get that $I_{1}(E^{c})<\vartheta$ for all $\varepsilon$ such that $\lvert\varepsilon\rvert<\varepsilon_{1}(\vartheta,n)$ . Since $n$ depends only on $R$ , we conclude that there exists $\varepsilon_{1}=\varepsilon_{1}(\vartheta,R)$ such that $I_{1}<2\vartheta$ for all $\lvert\varepsilon\rvert<\varepsilon_{1}(\vartheta,R)$ .

Let us now consider the term $I_{2}$ : the argument is similar to the one just carried out, so we provide slightly less detail. We have to show that $I_{2}$ converges to [math] uniformly with respect to $y\in B_{R}(\mathbb{S}^{q})$ . For any measurable $E\subset\Omega$ , one has, with obvious meaning of the notation,

[TABLE]

where, by the Lipschitz-continuity of $B$ ,

[TABLE]

Choosing $E$ as before, using the uniform integrability of the family $Y$ combined with the Markov inequality, we infer that for any $\vartheta>0$ there exists $n>0$ such that $I_{2}(E)<\vartheta$ . The Fréchet differentiability of $B$ implies that, for any $x\in H$ ,

[TABLE]

in $E^{c}\times[0,T]$ , where, by the Lipschitz continuity of $B$ ,

[TABLE]

for a.a. $(\omega,t)\in E^{c}\times[0,T]$ . Hence, the dominated convergence theorem yields

[TABLE]

that is, for any $\vartheta>0$ there exists $\varepsilon_{2}$ depending only on $\vartheta$ and $n$ such that

[TABLE]

for all $\varepsilon$ such that $\lvert\varepsilon\rvert<\varepsilon_{2}(\vartheta,n)$ , from which also $I_{2}(E^{c})<\vartheta$ for all $\varepsilon$ such that $\lvert\varepsilon\rvert<\varepsilon_{2}(\vartheta,n)$ . Hence, there exists $\varepsilon_{2}=\varepsilon_{2}(\vartheta,R)$ such that $I_{2}<2\vartheta$ for all $\varepsilon$ with $\lvert\varepsilon\rvert<\varepsilon_{2}(\vartheta,R)$ .

The convergence to zero of $I_{3}$ as $\varepsilon\to 0$ , uniformly with respect to $y\in B_{R}(\mathbb{S}^{q})$ , while still similar to the above arguments, is slightly more delicate as random measures are involved. As already shown in the proof of Theorem 4.3, one has, recalling that Fréchet differentiability implies Gâteaux differentiability,

[TABLE]

as $\varepsilon\to 0$ . We need to show that the convergence holds uniformly over $y$ bounded in $\mathbb{S}^{q}$ . Let $R>0$ and $y\in B_{R}(\mathbb{S}^{q})$ . For any measurable $E\in\mathscr{F}$ , the Lipschitz continuity assumptions on $G$ and (A5p) imply, setting $G_{1}:=G_{2}:=G$ if $p\geq 2$ , that

[TABLE]

As the set $\{(y^{*}_{T})^{p}:\,y\in B_{R}(\mathbb{S}^{q})\}$ is bounded in $L^{q/p}(\Omega)$ , hence uniformly integrable, for any $\vartheta>0$ there exists $n>0$ (by Markov’s inequality) such that, choosing $E:=\{y^{*}>n\}$ as before, we have

[TABLE]

On $E^{c}$ one has, possibly outside a set of $\mathbb{P}$ -measure zero, for both $j=1$ and $j=2$ ,

[TABLE]

where the right-hand side converges to zero by the characterization of Fréchet differentiability of Lemma 2.1, and is bounded by $2ng_{j}$ for all $(t,z)\in[0,T]\times Z$ . Since $g_{1}\in L^{p}(\nu)$ and $g_{2}\in L^{2}(\nu)$ $\mathbb{P}$ -a.s. in $E^{c}$ , the dominated convergence theorem and (A5p) yield

[TABLE]

as $\varepsilon\to 0$ , uniformly with respect to $y\in B_{R}(\mathbb{S}^{q})$ . Proceeding exactly as in the case of $I_{1}$ , we conclude that there exists $\varepsilon_{3}=\varepsilon_{3}(\vartheta,R)$ such that $I_{3}<2\vartheta$ for all $\lvert\varepsilon\rvert<\varepsilon_{3}$ .

We have thus shown that $\varepsilon^{-1}(u_{\varepsilon}-u-\varepsilon y)\to 0$ in $\mathbb{S}^{p}(0,T)$ , uniformly over $h$ in any bounded subset of $L^{q}(\Omega;H)$ , as claimed. ∎

6 Fréchet differentiability of higher order

In this section we show that the $n$ -th order Fréchet differentiability of the coefficients of (1.1), in a suitable sense, implies the $n$ -th order Fréchet differentiability of the solution map. We shall work under the following assumptions, that are stated in terms of the parameter $n\in\mathbb{N}$ , $n\geq 2$ :

(Fn)

The maps $f(\omega,t,\cdot)$ and $B(\omega,t,\cdot)$ are $n$ times Fréchet differentiable for all $(\omega,t)\in\Omega\times[0,T]$ , and the maps $G(\omega,t,z,\cdot)$ , $G_{i}(\omega,t,z,\cdot)$ , $i=1,2$ , are $n$ times Fréchet differentiable for all $(\omega,t,z)\in\Omega\times[0,T]\times Z$ . Moreover, there exists a constant $m\geq 0$ such that, for every $j=2,\ldots,n$ ,

[TABLE]

for all $(\omega,t,x)\in\Omega\times[0,T]\times H$ , and

[TABLE]

We also stipulate that (F1) is simply hypothesis (F) of the previous section. It would be possible to replace the functions $g$ , $g_{1}$ and $g_{2}$ with different ones, thus reaching a bit more generality, but it does not seem to be worth the (mostly notational) effort.

*Example 6.1**.*

Let us give an explicit example where assumption (Fn) is satisfied with a suitable choice of $m>0$ and not for $m=0$ . We shall consider $B=G=0$ for simplicity and concentrate only on $f$ : typical examples for $B$ and $G$ can be produced following the same argument. Let $H=L^{2}(D)$ , where $D\subset\mathbb{R}^{d}$ is a smooth bounded domain, and consider the function

[TABLE]

It is not difficult to check that $\gamma\in C^{\infty}(\mathbb{R})$ , $\gamma$ is Lipschitz-continuous (hence $\gamma^{\prime}\in C_{b}(\mathbb{R})$ ), and

[TABLE]

However, the derivatives $\gamma^{(j)}$ are not bounded in $\mathbb{R}$ for any $j\geq 2$ . Furthermore, let us fix $\mathcal{L}\in\mathscr{L}(H,L^{\infty}(D))$ , and define the operator

[TABLE]

Clearly, $f$ is well-defined, Lipschitz-continuous and linearly bounded, so that (A2) is satisfied. Moreover, using the fact that $\mathcal{L}\in\mathscr{L}(H,L^{\infty}(D))$ it a standard matter to check that $f$ is Fréchet-differentiable, and its derivative is given by

[TABLE]

so that also assumption (F) is satisfied. Note in particular that the first derivative $Df$ is also bounded in $H$ thanks to the Lipschitz-continuity of $f$ . Furthermore, using the fact that $\mathcal{L}\in\mathscr{L}(H,L^{\infty}(D))$ a direct computation shows that for every $j\in\mathbb{N}$ , with $j>1$ , $f$ is Fréchet-differentiable $j$ -times and

[TABLE]

For every $j>1$ , by the Hölder inequality and the properties of $\gamma$ and $\mathcal{L}$ we have that

[TABLE]

so that assumption (Fn) is satisfied for every $n$ with the choice $m=n-1$ . However, note that the higher-order derivatives of $f$ are not bounded in $H$ because of the choice of the function $\gamma$ : hence, coefficients $f$ in this form cannot be treated using available results in literature (as for example [15]). On the other hand, these are nonetheless included in our analysis.

In the following we shall write, for compactness of notation, $\mathbb{L}^{q}$ in place of $L^{q}(\Omega;H)$ . If $u$ (identified with the solution map $u_{0}\mapsto u:\mathbb{L}^{p}\to\mathbb{S}^{p}$ , which is well defined if assumption (A5p) holds) is $n$ times Fréchet differentiable along $\mathbb{L}^{q_{1}},\ldots,\mathbb{L}^{q_{n}}$ , we have

[TABLE]

Under the assumptions of Theorem 5.1, $u$ is once Fréchet differentiable and $v:=Du(u_{0})$ satisfies the equation

[TABLE]

where $I$ is the identity map. This equation has to be interpreted in the sense that, for any $h\in\mathbb{L}^{q}$ , $q>p$ , setting $y:=[Du(u_{0})]h$ , one has

[TABLE]

Note that by Lemma 4.1 this equation admits a unique solution $y\in\mathbb{S}^{p}$ also for $h\in\mathbb{L}^{p}$ , and that $h\mapsto y\in\mathscr{L}(\mathbb{L}^{p},\mathbb{S}^{p})$ . However, if $h$ belongs only to $\mathbb{L}^{p}$ , we can no longer claim that $h\mapsto y$ is the Fréchet derivative of $u_{0}\mapsto u$ , as Theorem 5.1 does not necessarily apply.

We are now going to introduce a system of equations, indexed by $n\geq 2$ , that are formally expected to be satisfied by $D^{j}u(u_{0})$ , $j=1,\ldots,n$ , if they exist. For any $n\geq 2$ , the equation for $u^{(n)}$ can be written as

[TABLE]

where $\Psi_{n}$ , $\Phi_{n}$ and $\Theta_{n}$ are the formal $n$ -th Fréchet derivatives of $f(u)$ , $B(u)$ and $G(u_{-})$ , respectively, excluding the terms involving the (formal) derivative of $u$ of order $n$ . More precisely, assume that $E_{1}$ , $E_{2}$ and $E_{3}$ are Banach spaces and $\phi:E_{1}\to E_{2}$ , $F:E_{2}\to E_{3}$ are $n$ times Fréchet differentiable. The chain rule implies that there exists a function $\tilde{\Phi}^{F}_{n}$ such that

[TABLE]

We set $\Phi_{n}:=\tilde{\Phi}^{B}_{n}\bigl{(}{u}^{(1)},{u}^{(2)},\ldots,{u}^{(n-1)}\bigr{)}$ . The definition of $\Psi_{n}$ and $\Theta_{n}$ is, mutatis mutandis, identical.

The concept of solution for equation (6.1) is intended as in the case of the first order derivative equation, i.e. in the sense of testing against arbitrary directions. More precisely, we shall say that

[TABLE]

is a solution to (6.1) if, for any

[TABLE]

the process ${u}^{(n)}(h_{1},\ldots,h_{n})\in\mathbb{S}^{p}$ satisfies

[TABLE]

Let us show some properties of the coefficients $\Psi_{n}$ , $\Phi_{n}$ and $\Theta_{n}$ . We are going to use some algebraic properties of the “representing” map $\tilde{\Phi}^{F}_{n}$ . In particular, although a (kind of) explicit expression for $\tilde{\Phi}_{n}^{F}$ can be written in terms of a variant of the Faà di Bruno formula (as it was done for example in [2]), for our purposes it suffices to know that $\tilde{\Phi}_{n}^{F}$ is a sum of terms of the form

[TABLE]

with $j\in\{2,\ldots,n\}$ , $\alpha_{1}+\cdots+\alpha_{j}=n$ , $\alpha_{i}\geq 1$ for all $i\in\{1,\ldots,j\}$ . Moreover, since $D^{n}[F(\phi)]$ is an $n$ -linear map on $E_{1}^{n}$ with values in $E_{3}$ (with $E_{1}^{n}$ being the cartesian product of $E_{1}$ by itself $n$ -times), one has that, for any $(h_{1},\ldots,h_{n})\in E_{1}^{n}$ , $D^{n}[F(\phi)](h_{1},\ldots,h_{n})$ is a sum of terms of the form

[TABLE]

where $A_{j}:=\alpha_{1}+\cdots+\alpha_{j-1}$ , and $\sigma$ is an element of the permutation group of $\{1,\ldots,n\}$ . We shall also need the following identities, that we write already in the specific form needed later, although they are obviously a consequence of the definition of $\tilde{\Phi}^{F}_{n}$ :

[TABLE]

where we have written, as customary, $u^{\prime}$ in place of ${u}^{(1)}$ . We are going to write, for the convenience of the reader, the first three formal derivatives of $B(u)$ and the expressions for $\Phi_{n}$ (the corresponding calculations for $f(u)$ , $G(u_{-})$ , $\Psi_{n}$ , and $\Theta_{n}$ are entirely analogous). One has

[TABLE]

where we have used Schwarz’s theorem on the symmetry of higher-order continuous Fréchet derivatives.

The first result that we present concerns the existence and uniqueness of solutions to equation (6.1) in the sense specified above. More precisely, we show in the next proposition that equation (6.1) admits a unique solution $u^{(n)}$ , belonging to $\mathscr{L}_{n}\bigl{(}\mathbb{L}^{p_{1}},\ldots,\mathbb{L}^{p_{n}};\mathbb{S}^{p}\bigr{)}$ . Note that to study differentiability we shall restrict to the case $p_{1}=\cdots=p_{n}$ (see Remark 6.3 below). However, since well-posedness for linear stochastic equations for multilinear maps such as (6.1) could be interesting in its own right, we shall provide a general result considering arbitrary $p_{1},\ldots,p_{n}$ .

Proposition 6.2.

Let $n\geq 1$ and $p,p_{0},p_{1},\ldots,p_{n}\geq 1$ be such that $u_{0}\in\mathbb{L}^{p}\cap\mathbb{L}^{mp_{0}}=\mathbb{L}^{p\vee mp_{0}}$ and

[TABLE]

Assume that

(i)

hypothesis **(Fn*)** is satsfied;*

(ii)

hypothesis **(A5r*)** holds for all $r\in[p,\max_{i\geq 1}p_{i}]\cup\{mp_{0}\}$ .*

Then (6.1) admits a unique solution

[TABLE]

Proof.

First of all, let us explain why ${u}^{(n)}$ , if it exists, must be $n$ -linear (in the algebraic sense). Since $u^{\prime}=Du$ is indeed a linear map, we can use induction as follows: assuming that ${u}^{(j)}$ is $j$ -linear for all $j<k$ , with $k\in\{2,\ldots,n\}$ , we are going to show that ${u}^{(k)}$ is $k$ -linear. The inductive assumption and the functional form of $\Psi_{k}$ , $\Phi_{k}$ , and $\Theta_{k}$ imply that they are $k$ -linear. Considering the equation

[TABLE]

assuming that a solution exists for every $(h_{1},\ldots,h_{k})\in\mathbb{L}^{q_{1}}\times\cdots\times\mathbb{L}^{q_{k}}$ , $q_{1},\ldots,q_{k}\geq 1$ , it suffices to show that the map $(h_{1},\ldots,h_{k})\mapsto v$ is $k$ -linear, which is immediate.

Let us focus now on existence. We are going to reason by induction on the order of (formal) derivation $k\in\{1,\ldots,n\}$ . The claim is certainly true for $k=1$ : Theorem 4.3 implies, thanks for assumption (ii), that $u^{\prime}\in\mathscr{L}(\mathbb{L}^{r},\mathbb{S}^{r})$ for every $r\in[p,\max_{i\geq 1}p_{i}]$ , hence also $u^{\prime}\in\mathscr{L}(\mathbb{L}^{s},\mathbb{S}^{r})$ for every $s\geq r$ , as then $\mathbb{L}^{s}$ is contractively embedded in $\mathbb{L}^{r}$ . Let us now assume the the claim is true for all $j\leq k\in\{1,\ldots,n-1\}$ , and consider $h_{j}\in\mathbb{L}^{p_{j}}$ with $p_{j}\geq p$ , for $j=1,\ldots,k+1$ , such that

[TABLE]

In order to control the $\mathbb{S}^{p}$ norm of ${u}^{(k+1)}(h_{1},\ldots,h_{k+1})$ it is enough to estimate

[TABLE]

In fact, recalling that $Df(u)$ , $DB(u)$ and $DG(u)$ are bounded linear operators (in the same sense as in the proofs of Theorems 4.3 and 5.1), one has, for any $[t_{0},t_{1}]\subseteq[0,T]$ , omitting the indication of the arguments $(h_{j})$ for simplicity of notation,

[TABLE]

where the implicit constant does not depend on $t_{1}-t_{0}$ (and also not on $k$ ). We proceed now as in the proof of Lemma 4.2: choosing $T_{0}>0$ sufficiently small and partitioning $[0,T]$ in intervals of lenght not exceeding $T_{0}$ , it follows from ${u}^{(k+1)}(0)=0$ that

[TABLE]

as claimed. Let us consider the second term on the right-hand side of the previous inequality (the first one can be handled in a completely similar way). As already seen, the generic term in $\Phi_{k+1}(h_{1},\ldots,h_{k+1})$ is of the form

[TABLE]

where $j\in\{2,\ldots,k+1\}$ , $\alpha_{1}+\cdots+\alpha_{j}=k+1$ , $\beta:=\alpha_{1}+\cdots+\alpha_{j-1}$ , and $\sigma$ is an element of the permutation group of $\{1,\ldots,k+1\}$ . Since $j\geq 2$ implies

[TABLE]

one has

[TABLE]

so that setting

[TABLE]

it holds

[TABLE]

Assumption (Fn) now implies

[TABLE]

which yields, thanks to the estimate $\lVert\cdot\rVert_{L^{2}(0,T)}\leq T^{1/2}\lVert\cdot\rVert_{L^{\infty}(0,T)}$ ,

[TABLE]

where the implicit constant depends also on $T$ . Here and in the following we write, for simplicity of notation, $\phi^{*}:=\phi^{*}_{T}$ for any càdlàg function $\phi$ . Hölder’s inequality yields

[TABLE]

where, as before, $\beta:=\alpha_{1}+\cdots+\alpha_{j-1}$ . It follows by the definition of $\tilde{p}_{1},\ldots,\tilde{p}_{j}$ and the inductive assumption that

[TABLE]

hence, recalling that $\big{\lVert}u^{*m}\big{\rVert}_{\mathbb{L}^{p_{0}}}=\big{\lVert}u\big{\rVert}^{m}_{\mathbb{S}^{mp_{0}}}\lesssim 1+\big{\lVert}u_{0}\big{\rVert}^{m}_{\mathbb{L}^{mp_{0}}}$ by Theorem 3.2,

[TABLE]

Estimating the $\mathbb{G}^{p}$ norm of $\Theta_{k+1}$ is similar: using the same notation used thus far, the generic term in $\Theta_{k+1}(h_{1},\ldots,h_{k+1})$ is of the type

[TABLE]

and hypothesis (Fn) implies

[TABLE]

for all $(t,z)\in[0,T]\times Z$ , $\mathbb{P}$ -a.s., for both $i=1$ and $i=2$ (we can identify again $g_{1}$ and $g_{2}$ with $g$ depending on the value of $p$ , and similarly for $G_{1}$ and $G_{2}$ ). This yields, after standard computations already detailed more than once,

[TABLE]

It hence follows by the inductive assumption, as before, that

[TABLE]

Since $p_{1},\ldots,p_{k+1}$ were arbitrary, we have proved that $k/p_{0}+\sum_{j=1}^{k+1}1/p_{j}\leq 1/p$ implies

[TABLE]

thus completing the induction argument by arbitrariness of $k$ . ∎

*Remark 6.3**.*

If $p_{1}=\cdots=p_{n}=q$ , condition (6.3) becomes

[TABLE]

which implies $q\geq np$ and $p_{0}\geq(n-1)p$ , hence $p_{0}\geq p$ if $n\geq 2$ . In particular, if $q=np$ , then $p_{0}=+\infty$ , i.e. $u_{0}$ must be bounded almost surely. If $q>np$ , then $p_{0}$ will also be finite, and strictly larger than $p$ if $n\geq 2$ . Furthermore, if $q>(n+nm-m)p$ , then $u_{0}\in\mathbb{L}^{q}$ implies ${u}^{(n)}\in\mathscr{L}_{n}(\mathbb{L}^{q};\mathbb{S}^{p})$ . In fact, for this to be true it suffices that $\mathbb{L}^{q}\subseteq\mathbb{L}^{mp_{0}\vee p}$ , which is equivalent to $q\geq mp_{0}\vee p$ . But since $q\geq np\geq p$ , we can simply choose $q=mp_{0}$ , which yields, excluding the case $p_{0}=+\infty$ ,

[TABLE]

or, equivalently, $q>(n+nm-m)p$ .

We repeat, however, that even under these conditions we cannot yet claim that ${u}^{(n)}$ identifies the $n$ -th Fréchet derivative of $u$ . In fact, we shall prove that $D^{n}u$ satisfies the equation for ${u}^{(n)}$ when “tested” on $(\mathbb{L}^{q})^{n}$ , with $q$ satisfying a strictly stronger constraint than just $q>(n+mn-m)p$ .

Before considering Fréchet differentiability of $n$ -th order, we need some preparations. The following two lemmata are used to apply the theorem on the Fréchet differentiability of the composition of two Fréchet differentiable functions.

By the assumptions (A2), (A3) and (A4), it follows immediately that the superposition operators associated to $f$ , $B$ and $G$ on $\mathbb{S}^{p}$ , i.e. $\phi\mapsto f(\phi),B(\phi),G(\phi_{-})$ , can be considered as maps, denoted by the same symbols for simplicity,

[TABLE]

Lemma 6.4.

Let $p\geq 1$ , $r>0$ , $q\geq 1$ , and $n\in\mathbb{N}$ satisfy

[TABLE]

If hypothesis (Fn)* is satisfied, then $f$ , $B$ and $G$ are $n$ -times Fréchet differentiable in $\mathbb{S}^{mr}\cap\mathbb{S}^{p}=\mathbb{S}^{mr\vee p}$ along $\mathbb{S}^{q}$ , with*

[TABLE]

for all $j\in\{1,\ldots,n\}$ .

Proof.

We proceed by induction on $j$ , and we treat only the third term, as all other cases are analogous (in fact slightly simpler). If $j=1$ , the proof is exactly the same as the corresponding one of Theorem 5.1. In particular, one has

[TABLE]

hence, given $v\in\mathbb{S}^{p}$ and $w\in\mathbb{S}^{q}$ with $q>1\cdot p=p$ ,

[TABLE]

as $\varepsilon\to 0$ , uniformly over $w$ belonging to bounded subsets of $\mathbb{S}^{q}$ . Assuming now that the statement is true for $j\in\{1,\ldots,n-1\}$ , let us show that it also holds for $j+1$ . By the inductive hypothesis we thus have

[TABLE]

Let $u\in\mathbb{S}^{p}$ and $v_{1},\ldots,v_{j+1}\in\mathbb{S}^{q}$ . The $(j+1)$ -th Fréchet derivatives

[TABLE]

exists for all $(\omega,t,z)\in\Omega\times[0,T]\times Z$ , hence, setting $\mathbf{v}_{k}:=(v_{1},\ldots,v_{k})$ , $k=1,\ldots,n$ , one has, as $\varepsilon\to 0$ ,

[TABLE]

for all $(\omega,t)\in[0,T]\times Z$ , $\mathbb{P}$ -a.s., uniformly with respect to $v_{j+1}$ in bounded sets of $H$ . For any $h\in H$ , the fundamental theorem of calculus yields

[TABLE]

hence, since $h$ is arbitrary,

[TABLE]

for any $\lvert\varepsilon\rvert\leq 1$ , where, as already done before, $g_{1}:=g_{2}:=g/2$ if $p\geq 2$ . The left-hand side of (6.5) is thus dominated for all $(t,z)\in[0,T]\times Z$ , $\mathbb{P}$ -a.s., modulo a constant, by the same expression appearing on the right-hand side of the previous inequality. This implies

[TABLE]

where, by Hölder’s inequality,

[TABLE]

In fact, these three inequalities follow from

[TABLE]

respectively, all of which are immediate consequences of the assumptions. The dominated convergence theorem thus yields

[TABLE]

as $\varepsilon\to 0$ . It remains to show that the convergence is uniform with respect to $v_{1},\ldots,v_{j+1}$ bounded in $\mathbb{S}^{q}$ . To this end, we proceed as in the case $j=1$ : for every measurable $E\in\mathscr{F}$ , the computations just carried out yield

[TABLE]

where the implicit constant depends on $\kappa(T)$ . Since $v_{1},\ldots,v_{k+1}$ are bounded in $\mathbb{S}^{q}$ and $k+1\leq n$ , the product $v_{1}^{*}\,\cdots\,v_{k+1}^{*}$ is bounded in $\mathbb{L}^{q/n}$ . Therefore, as $q/n>p$ by assumption, it follows that $\bigl{(}v_{1}^{*}\,\cdots\,v_{k+1}^{*}\bigr{)}^{p}$ is uniformly integrable. Similarly, defining $s$ by

[TABLE]

Hölder’s inequality yields

[TABLE]

where the right-hand side is finite by assumption. Since $s>p$ , $\bigl{(}u^{*m}v_{1}^{*}\,\cdots\,v_{k+1}^{*}\bigr{)}^{p}$ is uniformly integrable. Finally, defining $\ell$ by

[TABLE]

Hölder’s inequality yields, recalling that $j\leq n-1$ ,

[TABLE]

hence $\bigl{(}v_{1}^{*}\,\cdots\,v_{j}^{*}v_{j+1}^{*(m+1)}\bigr{)}^{p}$ is also uniformly integrable. One can now choose the set $E$ and proceed exactly as in the proof of Theorem 5.1 for the case $j=1$ to conclude. ∎

The previous lemma implies, in particular, that

[TABLE]

are $n$ times Fréchet differentiable for every $q>(m+n)p$ . Indeed, for any such $q$ , one has $\frac{m}{q}+\frac{n}{q}=\frac{m+n}{q}<\frac{1}{p}$ , implying in particular that $\frac{1}{p}-\frac{n}{q}\in(0,1)$ . Setting now $\frac{1}{r}:=(\frac{m}{q})\vee\frac{1}{2}(\frac{1}{p}-\frac{n}{q})$ , one has $r>1$ , $\frac{1}{r}+\frac{n}{q}<\frac{1}{p}$ , and $\mathbb{S}^{q}\subseteq\mathbb{S}^{p\vee mr}$ .

In fact, if $\frac{m}{q}>\frac{1}{2}(\frac{1}{p}-\frac{n}{q})$ one has $\frac{1}{r}=\frac{m}{q}$ , from which $\frac{1}{r}+\frac{n}{q}=\frac{m}{q}+\frac{n}{q}<\frac{1}{p}$ and $q=mr>p$ , hence $\mathbb{S}^{q}\subset\mathbb{S}^{p\vee mr}$ . If $\frac{m}{q}\leq\frac{1}{2}(\frac{1}{p}-\frac{n}{q})$ one has $\frac{1}{r}=\frac{1}{2}(\frac{1}{p}-\frac{n}{q})<\frac{1}{p}-\frac{n}{q}$ , from which $\frac{1}{r}+\frac{n}{q}<\frac{1}{p}$ , and $q\geq mr$ , hence $\mathbb{S}^{q}\subseteq\mathbb{S}^{p\vee mr}$ . The assertion follows then from Lemma 6.4.

We can now state the main result of this section, as well as of the whole paper.

Theorem 6.5.

Let $n\geq 1$ ,

[TABLE]

Assume (Fn)* and (A5r*)* for all $r\in[p,q]$ . Then the solution map $(u_{0}\mapsto u):\mathbb{L}^{q}\to\mathbb{S}^{p}$ is $n$ times Fréchet differentiable. Moreover, $D^{n}u(u_{0})\in\mathscr{L}_{n}(\mathbb{L}^{q};\mathbb{S}^{p})$ is the unique mild solution ${u}^{(n)}$ to*

[TABLE]

Note that this equation is nothing else than (6.1), and must be interpreted as the latter, i.e. in the sense of testing against an $n$ -tuple of vectors in $\mathbb{L}^{q}$ . Moreover, the initial condition of the equation is the identity map if $n=1$ , and zero if $n\geq 2$ .

Proof.

We shall assume, for simplicity, that $f=B=0$ , as the argument in the general case $f\neq 0$ , $B\neq 0$ is entirely analogous. We are going to argue by induction on $\ell\in\{1,\ldots,n\}$ . The statement is true for $\ell=1$ by Theorem 5.1. Now we assume that the statement is true for all $j\leq\ell-1$ , $\ell\in\{2,\ldots,n\}$ , and we prove it for $\ell$ . Let $k\in\mathbb{L}^{q}$ , with $q>\frac{(m+\ell)!}{(m+1)!}p=(m+\ell)\ldots(m+2)p$ . Thanks to Proposition 6.2 and the remarks following its proof, the equation

[TABLE]

admits a unique mild solution ${u}^{(\ell)}\in\mathscr{L}_{\ell}(\mathbb{L}^{q};\mathbb{S}^{p})$ , because

[TABLE]

We are going to show that ${u}^{(\ell)}=D^{\ell}u(u_{0})$ in $\mathscr{L}_{\ell}(\mathbb{L}^{q};\mathbb{S}^{p})$ . Let $k\in\mathbb{L}^{q}$ : for brevity, we shall use the notation ${u}^{(\ell)}(u_{0})k:={u}^{(\ell)}(u_{0})(k,\cdot,\ldots,\cdot)\in\mathscr{L}_{\ell-1}(\mathbb{L}^{p},\mathbb{S}^{p})$ and $\Theta_{\ell}(u_{0})k:=\Theta_{\ell}(u_{0})(k,\cdot,\ldots,\cdot)\in\mathscr{L}_{\ell-1}(\mathbb{L}^{p},\mathbb{S}^{p})$ . One has

[TABLE]

where, by the inductive hypothesis, ${u}^{(\ell-1)}(u_{0})=D^{\ell-1}u(u_{0})$ and ${u}^{(\ell-1)}(u_{0}+\varepsilon k)=D^{\ell-1}u(u_{0}+\varepsilon k)$ in $\mathscr{L}_{\ell-1}(\mathbb{L}^{q};\mathbb{S}^{p})$ . We need to prove that the left-hand side of (6.6) converges to zero as $\varepsilon\to 0$ in $\mathscr{L}_{\ell-1}(\mathbb{L}^{q},\mathbb{S}^{p})$ uniformly over $k$ belonging to bounded sets of $\mathbb{L}^{q}$ . Thanks to (6.2), one has

[TABLE]

and we claim that

[TABLE]

in $\mathscr{L}_{\ell-1}(\mathbb{L}^{q};\mathbb{G}^{p})$ as $\varepsilon\to 0$ , uniformly over $k$ belonging to bounded subsets of $\mathbb{L}^{q}$ . In fact, all terms in $\Theta_{\ell-1}$ are of the form

[TABLE]

with $j\leq\ell-1$ and $\alpha_{i}\geq 1$ , $\sum\alpha_{i}=\ell-1$ . Now, let $r>(\ell+m)p$ be such that $(u_{0}\mapsto u):\mathbb{L}^{r}\to\mathbb{S}^{r}$ (which is possible because $q>(\ell+m)p$ ). Then $G:\mathbb{S}^{r}\to\mathbb{G}^{p}$ is $n$ times Fréchet differentiable by Lemma 6.4. Moreover, by the inductive hypothesis applied to $(u_{0}\mapsto u)\in(\mathbb{L}^{r}\to\mathbb{S}^{r})$ , we have that $u_{0}\mapsto u$ is $\ell-1$ times Fréchet differentiable along $\mathbb{L}^{q}$ if

[TABLE]

Therefore, if $q$ satisfies this condition, each term of the form (6.8) is Fréchet differentiable along $\mathbb{L}^{q}$ by the theorem on the Fréchet differentiability of composite functions (see for example [1, Prop. 1.4]). Hence, (6.7) is indeed true, and the expression within parentheses in the first term on the right-hand side of (6.6) converges to

[TABLE]

in $\mathscr{L}_{\ell-1}(\mathbb{L}^{q};\mathbb{G}^{p})$ as $\varepsilon\to 0$ , uniformly over $k$ belonging to bounded subsets of $\mathbb{L}^{q}$ . Let us now consider the second term on the right-hand side of (6.6). One has, recalling that $v_{j}=D^{j}u$ for every $j\leq\ell-1$ by inductive hypothesis,

[TABLE]

where the second term on the right-hand side converges to

[TABLE]

in $\mathscr{L}_{\ell-1}(\mathbb{L}^{q};\mathbb{G}^{p})$ as $\varepsilon\to 0$ , uniformly over $k$ bounded in $\mathbb{L}^{q}$ , because again everything depends only on derivatives of order at most $\ell-1$ and we can apply the usual criteria on Fréchet differentiability of multilinear maps and composite functions. Note that this term cancels out with the corresponding one obtained previously. Going back then to (6.6), testing by an arbitrary element $(k_{2},\ldots,k_{\ell})\in(\mathbb{L}^{q})^{\ell-1}$ , and using Lemma 2.4, we infer that

[TABLE]

Taking supremum over $(k_{2},\ldots,k_{\ell})$ bounded in $(\mathbb{L}^{q})^{\ell-1}$ and using the Lipschitz-continuity of $G$ , we infer that, for every $T_{0}\in\mathopen{(}0,T\mathclose{]}$ ,

[TABLE]

By the continuity of $\kappa$ we can choose $T_{0}$ sufficiently small such that, after rearranging terms,

[TABLE]

Using the same argument leading to (6.4) in the proof of Proposition 6.2, a classical patching argument yields then

[TABLE]

on the whole time interval $[0,T]$ . Taking into account the remarks made above we have that

[TABLE]

We conclude that the left-hand side of converges to zero in $\mathscr{L}_{\ell-1}(\mathbb{L}^{q};\mathbb{S}^{p})$ as $\varepsilon\to 0$ , uniformly with respect to $k$ belonging to any bounded subset of $\mathbb{L}^{q}$ , as required. ∎

Bibliography20

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] A. Ambrosetti and G. Prodi, A primer of nonlinear analysis , Cambridge University Press, Cambridge, 1995. MR 1336591 (96a:58019)
2[2] A. Andersson, A. Jentzen, R. Kurniawan, and T. Welti, On the differentiability of solutions of stochastic evolution equations with respect to their initial values , Nonlinear Anal. 162 (2017), 128–161. MR 3695960
3[3] V. I. Averbukh and O. G. Smolyanov, Differentiation theory in linear topological spaces , Uspekhi Mat. Nauk 22 (1967), no. 6 (138), 201–260. MR 0223886
4[4] V. I. Bogachev, N. V. Krylov, M. Röckner, and S. V. Shaposhnikov, Fokker-Planck-Kolmogorov equations , American Mathematical Society, Providence, RI, 2015. MR 3443169
5[5] V. I. Bogachev and O. G. Smolyanov, Topological vector spaces and their applications , Springer, Cham, 2017. MR 3616849
6[6] S. Cerrai, Second order PDE’s in finite and infinite dimension , Lecture Notes in Mathematics, vol. 1762, Springer-Verlag, Berlin, 2001. MR 2002 j:35327
7[7] G. Da Prato, Kolmogorov equations for stochastic PD Es , Birkhäuser Verlag, Basel, 2004. MR 2111320 (2005 m:60002)
8[8] G. Da Prato, S. Kwapień, and J. Zabczyk, Regularity of solutions of linear stochastic equations in Hilbert spaces , Stochastics 23 (1987), no. 1, 1–23. MR 920798 (89b:60148)

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Fréchet differentiability of mild solutions to SPDEs with

Abstract

1 Introduction

2 Preliminaries

2.1 Notation

2.2 Notions of derivative

Lemma 2.1**.**

Proof.

2.3 Estimates for deterministic and stochastic convolutions

Lemma 2.2**.**

Proof.

Proposition 2.3**.**

Lemma 2.4**.**

3 Well-posedness

Definition 3.1**.**

Theorem 3.2**.**

Proof.

Remark 3.3*.*

4 Gâteaux differentiability of the solution map

Lemma 4.1**.**

Proof.

Lemma 4.2**.**

Proof.

Theorem 4.3**.**

Proof.

5 Fréchet differentiability of the solution map

Theorem 5.1**.**

Proof.

6 Fréchet differentiability of higher order

Example 6.1*.*

Proposition 6.2**.**

Proof.

Remark 6.3*.*

Lemma 6.4**.**

Proof.

Theorem 6.5**.**

Proof.

Lemma 2.1.

Lemma 2.2.

Proposition 2.3.

Lemma 2.4.

Definition 3.1.

Theorem 3.2.

*Remark 3.3**.*

Lemma 4.1.

Lemma 4.2.

Theorem 4.3.

Theorem 5.1.

*Example 6.1**.*

Proposition 6.2.

*Remark 6.3**.*

Lemma 6.4.

Theorem 6.5.