Normal approximation of the solution to the stochastic heat equation with L\'evy noise
Carsten Chong, Thomas Delerue

TL;DR
This paper establishes conditions under which solutions to the stochastic heat equation with Lévy noise converge in law to Gaussian solutions, extending finite-dimensional results to infinite-dimensional settings without assuming infinite divisibility.
Contribution
It provides necessary and sufficient conditions for convergence of Lévy-driven solutions to Gaussian solutions in infinite dimensions, using martingale problem characterizations.
Findings
Derived convergence conditions based on noise variances
Extended finite-dimensional Lévy process results to infinite-dimensional case
Characterized solutions via martingale problems in Sobolev spaces
Abstract
Given a sequence of L\'evy noises, we derive necessary and sufficient conditions in terms of their variances such that the solution to the stochastic heat equation with noise converges in law to the solution to the same equation with Gaussian noise. Our results apply to both equations with additive and multiplicative noise and hence lift the findings of S. Asmussen and J. Rosi\'nski [J. Appl. Probab. 38 (2001) 482-493] and S. Cohen and J. Rosi\'nski [Bernoulli 13 (2007) 195-210] for finite-dimensional L\'evy processes to the infinite-dimensional setting without making distributional assumptions on the solutions such as infinite divisibility. One important ingredient of our proof is to characterize the solution to the limit equation by a sequence of martingale problems. To this end, it is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic processes and financial applications · Financial Risk and Volatility Modeling · Complex Systems and Time Series Analysis
Normal approximation of the solution to the stochastic heat equation with Lévy noise
Carsten Chong and Thomas Delerue Institut de mathématiques, École Polytechnique Fédérale de Lausanne, Station 8, CH-1015 Lausanne, e-mail: [email protected] of Mathematical Statistics, Technical University of Munich, Boltzmannstraße 3, DE-85748 Garching, e-mail: [email protected]
Abstract
Given a sequence of Lévy noises, we derive necessary and sufficient conditions in terms of their variances such that the solution to the stochastic heat equation with noise converges in law to the solution to the same equation with Gaussian noise. Our results apply to both equations with additive and multiplicative noise and hence lift the findings of S. Asmussen and J. Rosiński [J. Appl. Probab. 38 (2001) 482–493] and S. Cohen and J. Rosiński [Bernoulli 13 (2007) 195–210] for finite-dimensional Lévy processes to the infinite-dimensional setting without making distributional assumptions on the solutions such as infinite divisibility. One important ingredient of our proof is to characterize the solution to the limit equation by a sequence of martingale problems. To this end, it is crucial to view the solution processes both as random fields and as càdlàg processes with values in a Sobolev space of negative real order.
[TABLE]
Keywords: càdlàg modification, convergence of semimartingale characteristics, functional convergence in law, Lévy space–time white noise, martingale problems, Skorokhod representation theorem, small jump approximation, Sobolev spaces of negative order, stochastic PDEs, weak limit theorems
1 Introduction
The importance of the Gaussian distribution in probability theory and its popularity in applications are manifested in the central limit theorem: The total effect of a large number of small independent contributions is approximately normal. Therefore, when physical systems governed by one or several equations are pertubed by white noise (where “white” means stationary and uncorrelated), it is frequently assumed that the noise is Gaussian.
For example, in his Saint-Flour lecture notes [27], J. B. Walsh discusses an application of parabolic stochastic PDEs to the modeling of neuron potentials. Subject to impulses arriving according to a marked Poisson point process (with mean [math] and atoms of size at time and position ), the electrical potential of the neuron, viewed as a thin cylinder of length, say, , is then well described by the stochastic cable equation
[TABLE]
with suitable boundary and initial conditions. Arguing that “the impulses are generally small, and there are many of them, so that in fact is very nearly a white noise” ([27], p. 311), the author then approximates (1.1) by
[TABLE]
where is a Gaussian space–time white noise.
But of course, the central limit theorem has limitations. In the absence of finite second moments, stable limits may arise; and if there are rare but large contributions, we may have a Poisson limit. In general, any infinitely divisible distribution can arise as a possible limit of compound Poisson laws; see Corollary 3.8 in [23]. This leads us to the following question: If we have a sequence of noises as above where the atom sizes of converge to [math] as , and if denotes the solution to (1.1) with noise (where is the variance of ), do we have convergence in distribution of to the solution of (1.2) with the Gaussian noise ? A positive answer for this normal approximation is given in Theorem 7.10 in [27]: If the atoms of are locally summable and the jump measure of satisfies
[TABLE]
for some , then converges in distribution to .
The purpose of this work is to substantially generalize this result in two aspects. Given that (1.3) is sufficient but not necessary for (see Remark 2.3 below), our first contribution is to show that the necessary and sufficient condition for the normal approximation is
[TABLE]
for all . In fact, if (resp., ) is a Lévy process (resp., Brownian motion), the same condition was found to be necessary and sufficient for in [8]. Somewhat surprisingly, in the special case of small jump approximation, that is, where and is a given Lévy measure, it was shown in [2] that condition (1.4) fails for prominent examples such as the compound Poisson or the gamma distribution. So in these cases, the small jump approximation is not true for Lévy processes and by our results, not true for stochastic PDEs, either.
Our second contribution is to consider equations with multiplicative noise. To our best knowledge, previous works on the normal approximation of stochastic PDEs with jumps have only considered the situation of additive noise; see, besides the mentioned results in [27], also [16, 26] (there is, of course, literature concerning approximation of multiplicative Gaussian white noise by smoother noises [4, 13], but these problems are very different in nature than the one considered here). The proofs in [16, 26, 27] (as well as those of [2, 8]) are based on characteristic functions and the Lévy–Khintchine formula for infinitely divisible distributions, which obviously do not generalize to the situation of multiplicative noise.
Instead, our approach will be to show that satisfies martingale problems which, assuming (1.4) only and not the stronger condition (1.3), have a limit with a unique solution. But this leads to several complications. In order to prove convergence of the associated martingales, we need some sort of uniformity in the time variable (as given, for example, by convergence in the Skorokhod topology). So taking simply the space to support the solutions and will not be sufficient. This is why we will draw upon the results of [7] and view the solution (and also ) as a càdlàg process on with values in the Sobolev space for some (see Section 2.1 for a definition). In order to show tightness in that space with the Aldous criterion [1], we will use the factorization method from [9, 22] to obtain uniform bounds in time without taking moments of order higher than two. Another subtlety that arises in the analysis of the (semi-)martingales mentioned above is that their predictable characteristics are not given by a function of the former, which distinguishes our proof from the corresponding ones for (finite- or infinite-dimensional) stochastic differential equations in [11, 19, 20].
We shall also mention that the initial motivation in [2, 8] to study the normal approximation of Lévy processes comes from numerical simulation. Indeed, for stochastic PDEs as in (1.1) with multiplicative Lévy noise, the rate of convergence of a numerical scheme obtained by removing the small jumps of the noise is slower for noises with a high intensity of small jumps; see [5]. However, the results in [17] and [25] show that in the case of SDEs, an additional Gaussian approximation of the otherwise neglected small jumps improves the rate of convergence. We leave it to future research to examine to what extent this also holds for stochastic PDEs.
The remaining paper is organized as follows. In Section 2, we first describe in detail the considered equations and recall the definition of Sobolev spaces of real order in Section 2.1 before we state our main result, Theorem 2.1, in Section 2.2. Here we also explain the main steps and ideas behind the proof, whereas the details are given in Section 3.
2 Results
2.1 Preliminaries
Let and consider on a filtered probability space that satisfies the usual conditions, for any , the stochastic heat equation on with Dirichlet boundary conditions:
[TABLE]
The function in equation (2.1) describes the multiplicative part of the noise and will be assumed to be a Lipschitz continuous function. Concerning the driving noise , we assume that is a pure-jump Lévy space–time white noise on given by
[TABLE]
for all . In this representation, is a homogeneous Poisson random measure on relative to the filtration , with intensity measure . Here is a Lévy measure on , that is, and . We refer to Chapter II in [14] for the definition of stochastic integrals with respect to Poisson random measures. Furthermore, we assume that for all ,
[TABLE]
Note that this integral is the variance of . In the special case where we have a single Poisson random measure having intensity measure , setting
[TABLE]
leads us to the case of small jump approximation considered in [2].
A predictable random field is called a mild solution to (2.1) if for all ,
[TABLE]
-almost surely, where
[TABLE]
for , is the Dirichlet Green’s function of the heat operator on .
The existence of a mild solution to (2.1) is guaranteed by Theorem 3.1 in [6] and condition (2.3) on the Lévy measure and it is, up to modifications, unique among all predictable random fields satisfying
[TABLE]
for any and .
In this paper, we want to examine when the normal approximation holds for , that is, when can be approximated in law by the mild solution to the same stochastic heat equation as above, but driven by a Gaussian space–time white noise on :
[TABLE]
The driving noise is now a centered Gaussian random field with covariance structure for any measurable sets . It is well-known (see, for example, Theorem 3.2 in [27]) that, up to modifications, equation (2.8) has a unique mild solution satisfying the corresponding bound in (2.7) for all .
Throughout this work, we will look at the mild solutions and from two different points of view. First, they are random elements in the function space as the uniform bound (2.7) shows. But then, as mentioned in the introduction, we will need stronger path regularity in the time variable for our proofs. This is why we shall consider and also as stochastic processes with values in an infinite dimensional space, which we will describe in the following.
Consider for any , the fractional Sobolev space
[TABLE]
where , , form an orthonormal basis of . This is a Hilbert space with scalar product
[TABLE]
and norm for .
The topological dual of is also a Hilbert space, whose dual norm can be expressed, by the Riesz representation theorem, as
[TABLE]
(Note that if are elements of the same -space, then will always denote the standard scalar product of that space. If is an element of a Hilbert space and an element of its topological dual, then will always denote the dual pairing of with .)
Coming back to the mild solution to (2.1), if we identify with the process where
[TABLE]
for all , then by Theorem 2.5 in [7], has a càdlàg modification in for any , which will be denoted by throughout this work. Similarly, by the identification (2.10) and Corollary 3.4 in [27], the mild solution to (2.8) has a continuous modification in for each .
2.2 Main result
We now introduce the Cartesian product
[TABLE]
with . Let denote the metric induced by the -norm on and be the Skorokhod metric on . We then equip with the product metric
[TABLE]
for any and . The main result of this paper is the following limit theorem.
Theorem 2.1**.**
Assume that is given by (2.2) with a variance that satisfies (2.3) for all . Let be the -valued mild solution to the stochastic heat equation (2.1) driven by and be its càdlàg modification in . Similarly, let be the -valued mild solution to the stochastic heat equation (2.8) driven by and be its continuous modification in .
Suppose also that the Lipschitz function in (2.1) satisfies . Then, as ,
[TABLE]
for all if and only if (1.4) holds for all .
Remark 2.2.
We can generalize Theorem 2.1 to nonzero initial conditions. Assume that in both equations (2.1) and (2.8) we now have for all , where is a bounded continuous function with . Define
[TABLE]
Then Theorem 2.1 can be shown in a completely analogous manner if we assume that there exists such that (instead of ), and this assumption is only needed for showing the necessity of (1.4). To be more precise, since the mild solution to (2.8) now satisfies
[TABLE]
-almost surely, a similar argument as in Remark 3.14 shows that for some and hence, the expectation in (3.68) is nonzero.
Remark 2.3.
Let us relate the two conditions (1.3) and (1.4) to each other. Using Hölder’s and Chebyshev’s inequalities, we see from the estimate
[TABLE]
The other implication is not true in general. For example, assume that has density
[TABLE]
where . Then for every , so condition (1.3) does not hold. But a direct calculation shows that . So given , we have for small values of , which implies (1.4) because
[TABLE]
Proof of Theorem 2.1.
We begin by showing that (1.4) implies the weak convergence (2.13). Since is a metric space, we follow the classical scheme of first showing tightness and then uniqueness of the limiting distribution.
In Theorem 3.4, we show that is tight in and in Theorem 3.7 that is tight in . By the subsequence principle, this immediately implies that the random elements are tight in . As it turns out, no assumptions on the Lévy noise other than the ones specified in (2.2) and (2.3) are needed for this tightness property.
As a consequence, we can apply Prokhorov’s theorem, which provides for any sequence with , a subsequence such that converges weakly to some distribution on as . For notational simplicity, we will assume without loss of generality that the whole sequence converges weakly.
Since is a complete separable metric space, we can further apply Skorokhod’s representation theorem (see Theorem 4.30 in [15]) and obtain random elements
[TABLE]
defined on a possibly different probability space , satisfying the following properties:
[TABLE]
We will show that
[TABLE]
which in turn implies (2.13). To do this, we first define a filtration on by setting
[TABLE]
We further define for , and ,
[TABLE]
as well as
[TABLE]
We will then show in Theorem 3.12 that, under assumption (1.4), the pair satisfies the following martingale problem. For all and , the process is a martingale with respect to . Note that we are able to obtain this property only because in (2.15), we have -wise convergence both in the Skorokhod topology and in , which is the reason why we view the solutions to (2.1) and (2.8) as pairs in .
Next, we will show in Theorem 3.13 that this martingale property in turn implies that there exists a Gaussian space–time white noise on , possibly defined on a filtered extension of such that, with probability one, the random field is equal in to the mild solution to the stochastic heat equation
[TABLE]
and such that is indistinguishable from the continuous version in of , which concludes the first part of the proof.
For the second part, under the assumption , Theorem 3.15 directly shows that (2.13) implies (1.4). ∎
3 Details of the proof
In the remainder of this work, the letter will always denote a strictly positive constant whose value may change from line to line. Furthermore, by the Lipschitz continuity of the function , there exists a positive constant that we hold fixed from now on such that for all .
3.1 Tightness
We start with three lemmas that will provide uniform bounds in for the second moments of , which will be crucial for proving tightness of in .
Lemma 3.1**.**
The family of mild solutions to (2.1) satisfies
[TABLE]
and this uniform bound only depends on the Lipschitz function .
Proof.
Using Itō’s isometry and the definition (2.5) of , we have for fixed and ,
[TABLE]
Using the Lipschitz continuity of and the elementary inequality , we then obtain for and ,
[TABLE]
Now in order to find a bound for uniformly in , and , we will use a comparison principle for deterministic Volterra equations. By (B.5) in [3], there exists a constant such that on , where
[TABLE]
is the heat kernel on . Since for all , we obtain
[TABLE]
Recall from (2.7) that is uniformly bounded in for fixed . Therefore, by Lemma 6.4 (2) and (3) in [6], the mild solution satisfies
[TABLE]
for all and , where is the unique nonnegative solution of the deterministic Volterra equation
[TABLE]
and satisfies . ∎
The next lemma gives an alternative integral representation of and is an extension of the factorization method in [9] and [22].
Lemma 3.2**.**
For define
[TABLE]
We then have
[TABLE]
and for all , the representation
[TABLE]
holds -almost surely.
Proof.
First, using Itō’s isometry, the Lipschitz continuity of and Lemma 3.1, we have
[TABLE]
The last integral on the right-hand side is finite if . Indeed, by (B.5) in [3], it can be bounded by
[TABLE]
The identity (3.5) follows in the same way as Lemma 5 in [22]. ∎
Lemma 3.3**.**
The family of mild solutions to (2.1) satisfies
[TABLE]
for all .
Proof.
We will use the integral representation (3.5) of Lemma 3.2. Fix and . Using Fubini’s theorem, we have
[TABLE]
By the semigroup property of the Green’s function, the integral on the right-hand side is equal to
[TABLE]
Using (B.5) in [3] and (3.3), we obtain
[TABLE]
Now let , take the -norm of and apply Minkowski’s integral inequality to obtain
[TABLE]
Take expectation, use the Cauchy–Schwarz inequality and (3.4) to further obtain
[TABLE]
Note that the right-hand side of (3.7) does not depend on anymore. We now consider the integrand
[TABLE]
For fixed , the maximum of the function is for some that is independent of . Let and consider the estimate
[TABLE]
Since , we obtain
[TABLE]
Moreover, the integral is finite because , and the expectation in (3.7) is now bounded by
[TABLE]
for any and . By assumption, and . Hence, we can choose and such that . As a consequence, by Lemma 2 of Chapter 1 in [12], the estimate
[TABLE]
holds for . We have assumed that since otherwise, the last integral is bounded by some constant, which immediately implies that the expectation in (3.7) is uniformly bounded in .
Therefore, we further estimate the integral in (3.8) by
[TABLE]
which is finite because our choice of and implies . This concludes the proof. ∎
We can now proceed to showing tightness.
Theorem 3.4**.**
The family of mild solutions to (2.1) is tight in the Hilbert space .
Proof.
It is easy to see that the functions
[TABLE]
where and for all , form an orthonormal basis of .
First, using the stochastic Fubini theorem (see, for example, Theorem 2.6 in [27]), we have for all ,
[TABLE]
Define for all ,
[TABLE]
Using Fubini’s theorem, the expression (2.6) of the Green’s function and the orthogonal properties of , we obtain for all ,
[TABLE]
Using the integral formula , we can further calculate
[TABLE]
For the -norm of , we then have
[TABLE]
for all . Since
[TABLE]
we obtain from Lemma 3.1
[TABLE]
which implies that
[TABLE]
for all . Moreover, again by Lemma 3.1, we have
[TABLE]
as for all . Therefore, we can conclude from Theorem 1 in [24] that is tight in . ∎
The next two propositions will imply that is tight in .
Proposition 3.5**.**
The càdlàg processes satisfy the Aldous condition: Let and be sequences of positive numbers with and as . For each , further let be a stopping time with respect to the filtration generated by the stochastic process . Then we have for any ,
[TABLE]
Proof.
Recall the expression of the dual norm in (2.9). We have
[TABLE]
We will find a convenient semimartingale decomposition for the real-valued stochastic process for any and that will then allow us to estimate the expectation of the terms appearing in (3.11).
First, proceeding as in (3.9) and (3.10), we have for all ,
[TABLE]
If we define
[TABLE]
for all and , then
[TABLE]
for all , where the last term is the Itō integral of the deterministic function against the square-integrale martingale . Because the integrand is a -function, the integration by parts formula for semimartingales yields
[TABLE]
Altogether we obtain the semimartingale decomposition
[TABLE]
for all , and .
Now the process is the càdlàg version of , so we can infer that and the right-hand side of (3.12) are indistinguishable since the latter is also càdlàg. Coming back to (3.11), we can now decompose
[TABLE]
where
[TABLE]
for all . We now gather some moment estimates for these three terms. First, for the martingale term , we have by Itō’s isometry,
[TABLE]
Using the Lipschitz continuity of , we can bound the last term in (3.13) by
[TABLE]
for any . The second term equals , which converges to 0 as .
For the first term, choose . Using Hölder’s inequality and Lemma 3.3, we obtain
[TABLE]
Altogether, this implies as for all .
Next, we have
[TABLE]
and by Itō’s isometry as well as Lemma 3.1,
[TABLE]
for all . Therefore, by Doob’s inequality, is bounded by
[TABLE]
for all .
Finally, with and similar calculations, is bounded by
[TABLE]
for all . As a consequence, recalling that and thus , we obtain by (3.11) and dominated convergence,
[TABLE]
as , which is the assertion of the proposition. ∎
Proposition 3.6**.**
For any fixed and , the random elements are tight in .
Proof.
Proceeding as in the proof of Proposition 3.5 and using Lemma 3.1, we have
[TABLE]
for any and . Hence, we have for all , and ,
[TABLE]
and thus -almost surely.
Because the penultimate term in the inequality above does not depend on , by Markov’s inequality, we can further deduce
[TABLE]
for all and . Since the embeddings
[TABLE]
are compact for by Theorem 4.58 in [10], it follows that is tight in for any fixed and . ∎
Theorem 3.7**.**
For any , the càdlàg modifications are tight in the Skorokhod space .
Proof.
By Theorem 6.8 in [27], this is a direct consequence of Propositions 3.5 and 3.6. ∎
3.2 Characterization of the limit
After proving tightness in Section 3.1, our next goal is to characterize the limit distribution of weakly converging subsequences. Following the outline of the proof of Theorem 2.1, the first step is to show that under condition (1.4) on the Lévy measure , the process in (2.18) is a martingale with respect to the filtration defined in (2.16). In order to achieve this result, which is Theorem 3.12 below, we prove that the pairs satisfy related martingale problems (Theorem 3.8) and that these “converge” as (Theorem 3.9).
Recall that for all test functions and fixed ,
[TABLE]
-almost surely. This follows, in a similar way to Theorem 3.2 in [27], from the fact that in our situation, we may apply the stochastic Fubini theorem; see, for example, Theorem 2.6 in [27].
Theorem 3.8**.**
For each , the pair where is the mild solution to the stochastic heat equation (2.1) and is its càdlàg modification in , with , satisfies the following martingale problem. For all and , the complex-valued stochastic process
[TABLE]
with , is a square-integrable -martingale with the uniform bound
[TABLE]
Proof.
First, since is the càdlàg version of , the stochastic process is indistinguishable from the right-hand side of (3.15) for any . This directly implies that is an -semimartingale without continuous martingale part. Furthermore, one can easily verify that the -compensator of the jump measure of (on ) is given by
[TABLE]
As a consequence, using Itō’s formula (see, for example, Theorem I.4.57 in [14]), (3.15) and the fact that , we have
[TABLE]
and therefore, by (3.16),
[TABLE]
for all . The two integral processes on the right-hand side of (3.19) are square-integrable -martingales (for the second, this is implied by the elementary inequalities and for all , together with Lemma 3.1) and hence, this is also the case for . By Itō’s isometry, we further obtain
[TABLE]
as well as
[TABLE]
for all . We estimate the last expectation, using the elementary inequalities given above as well as the definition of , by
[TABLE]
Altogether, we obtain (3.17) from (3.19), the Lipschitz continuity of and Lemma 3.1. ∎
We now switch to the probability space from the Skorokhod construction in (2.14) and define the process in the same way as in (3.16), but with and replaced by in (2.15) and , respectively.
Theorem 3.9**.**
Under (1.4), we have pointwise on for any and ,
[TABLE]
in the Skorokhod space , where is the process in (2.18).
In order to prove this result, we first rewrite in a more convenient form. For any fixed , and , let
[TABLE]
for all and . Note that are the predictable characteristics of the -semimartingale and that they are functions of the random field and not of itself (which is another reason why we have adopted a dual view on the solutions to (2.1) and (2.8) as elements of ). The process introduced above can thus be written as
[TABLE]
Define the truncation functions
[TABLE]
The key idea of the proof of Theorem 3.9 is to see that for fixed , and , the function in (3.20) (resp., in (2.17)) is the Lévy exponent of the infinitely divisible distribution (resp., ) with characteristics (resp., ) with respect to . Then we can make use of the following result, which is the only place in this work where (1.4) will actually be needed.
Theorem 3.10**.**
If (1.4) holds, then for any , and , we have
[TABLE]
Proof.
As in the proof of Theorem 2.2 in [8] (see also Theorem 2.1 in [2]), it suffices to show that
[TABLE]
as for all . Starting with , we have
[TABLE]
We can ignore all points in the domain of integration where . So if we let
[TABLE]
for any , and , then, using the triangle inequality and the fact that , we obtain
[TABLE]
By the Lipschitz continuity of and Hölder’s inequality, we have for the first integral on the right-hand side of (3.25),
[TABLE]
By (2.15), in pointwise on . Hence, the sequence is bounded in and we have
[TABLE]
which implies as .
The second integral in (3.25) is more difficult to handle. We decompose it into , where
[TABLE]
for all .
Again we will study each of these three integrals separately. On the set , we have and therefore
[TABLE]
Thus,
[TABLE]
Because the term does not depend on , we can use condition (1.4), whence
[TABLE]
for all and . Since , we obtain by dominated convergence that as for all .
Next, we have by Chebyshev’s inequality,
[TABLE]
which tends to 0 as , uniformly in .
Finally, we have by dominated convergence,
[TABLE]
uniformly in and . Altogether, we have just shown that the left-hand side of (3.25) converges to 0 as , which is condition in (3.23).
The two other conditions will follow from our last calculations. Indeed, we have
[TABLE]
where the first term vanishes because
[TABLE]
as . Furthermore,
[TABLE]
These integrals are exactly the same as in the last line of (3.25), so we obtain \big{|}\overline{B}_{t}^{k}-\overline{B}_{t}\big{|}\longrightarrow 0, which is condition . From this, condition immediately follows since
[TABLE]
∎
The following technical lemma is a direct consequence of Theorem 3.10 and will be crucial for proving Theorem 3.9 afterwards.
If is a function of locally finite variation, we denote by the total variation of the function on the interval . If is complex-valued, we have .
Lemma 3.11**.**
If (1.4) holds, then we have pointwise on ,
[TABLE]
for any , and , where the processes and are defined in (3.20) and (2.17), respectively.
Proof.
For fixed , and , the infinitely divisible distributions and , defined before Theorem 3.10, have Lévy exponents and , respectively. By that theorem, as . This immediately implies the first claim of the proposition (see, for example, Equation VII.2.6 in [14]).
For the second claim, we will need the truncation function
[TABLE]
The main difference between the function , used so far, and is that the latter is continuous. Since this property will be needed for technical reasons, we replace by in the expression of in (3.20) and thus obtain
[TABLE]
With (3.31) and (2.17), we then calculate
[TABLE]
Consequently,
[TABLE]
We will show that the two integrals above converge to 0 as .
The function is bounded, continuous (because is) and as . Hence, by condition of Theorem VII.2.9 in [14], we can infer
[TABLE]
from Theorem 3.10.
Consider now the first integral in (3.33), and notice that
[TABLE]
Using the triangle inequality, we can therefore estimate this integral by
[TABLE]
The second integral above converges to 0 as shown in (3.30), while the same holds for the first integral by (3.25) (set ). Together with (3.33), we conclude that as .
It remains to show that as , which will be done in a similar manner as before. From (3.32), we have
[TABLE]
The first integral on the right-hand side above converges to 0 by (3.28). Furthermore, since
[TABLE]
also the second integral vanishes by (3.29). Finally, the function \big{|}\sin(\xi x)-\xi\vartheta(x)\big{|} is bounded, continuous and as . Hence, we can again apply Theorem VII.2.9 in [14] in order to obtain
[TABLE]
We conclude that , and altogether as . ∎
Proof of Theorem 3.9.
According to Proposition VI.1.23 in [14], because the function is continuous, converges to in the Skorokhod topology for fixed if
[TABLE]
in as .
Using the definition of the Skorokhod topology, we can easily infer from the convergence of to in given in (2.15) that
[TABLE]
as for all .
Next, we have
[TABLE]
Lemma 3.11 then immediately gives us as . In addition, the Skorokhod convergence of towards implies for all continuity points of ; see, for example, VI.2.3 of [14]. Since a càdlàg function has at most countably many discontinuities, we have for almost all . So dominated convergence implies that also the last term of the previous display converges to 0 as . ∎
We have now gathered all the intermediate results needed for the following theorem.
Theorem 3.12**.**
If (1.4) holds, then in (2.15) satisfies the following martingale problem. For all and , the process defined in (2.18) is a martingale with respect to the filtration in (2.16).
Furthermore, has an -predictable modification and
[TABLE]
Finally, for almost all , as well as holds with probability one.
Proof.
By Theorem 3.8, for any , and , the process defined in (3.16) is a square-integrable -martingale. Moreover, as and in (2.15) are adapted to the filtration , the same holds for from (3.21) as well as , and by a limit argument. Since has the same distribution as by (2.15), standard arguments now show that is an -martingale for all , and . This is the martingale problem satisfied by the pair .
Using Theorem 3.9, we have
[TABLE]
as for all . This implies almost everywhere on for all . Furthermore,
[TABLE]
uniformly in and by Theorem 3.8. Hence, again by standard arguments, we can deduce that is an -martingale as well for any and .
Now we show the second part of the theorem. The convergence in (2.15) implies convergence in measure (with respect to the Lebesgue measure on ) of towards for all . Hence, we have by dominated convergence,
[TABLE]
as , and thus, converges to in -measure. Therefore, there exists a subsequence such that
[TABLE]
and we will assume without loss of generality that (3.36) holds for the whole sequence. This in turn implies -almost surely as for almost all . Using Fatou’s lemma, we obtain
[TABLE]
Furthermore,
[TABLE]
for all , so (3.34) follows from Lemma 3.1. (In order to show (3.38), consider for the mollified random fields and on , defined exactly as in (1.8) of Chapter 10 in [12]. Then (2.15) implies
[TABLE]
for all , and . In addition, using Lemma 3 of Chapter 10 in [12], we have
[TABLE]
for all , and . As a consequence, we can find a sequence converging to 0 such that
[TABLE]
as for all , and . So (3.38) follows from (3.39) and (3.40).)
Next, is stochastically continuous by Theorem 4.7 in [6] and Lemma B.1 in [3]. This and (3.38) imply that is also stochastically continuous. By a straightforward extension of Proposition 3.21 in [21] to two-parameter processes, each has a predictable modification . By (3.36), we have -almost everywhere, so has a predictable modification as well.
Finally, the last statement is easy and we leave the details to the reader. ∎
We can now finish the proof of the weak convergence (2.13). Indeed, the martingale problem stated in Theorem 3.12 and satisfied by in (2.15) will allow us to identify uniquely the distribution of (from now on we may and will assume that is predictable).
Note that the next theorem holds independently of all our previous results.
Theorem 3.13**.**
On a filtered probability space , let be an -predictable random field and an -adapted càdlàg process in , with . Assume that for almost all , as well as holds -almost surely and that
[TABLE]
In addition, assume that the pair satisfies the following martingale problem. For all and , the process defined via (2.17) and (2.18) is a local -martingale.
Then there exists a Gaussian space–time white noise on , possibly defined on a filtered extension of such that, with probability one, is equal in to the mild solution to the stochastic heat equation (2.8) with noise . Furthermore, is indistinguishable from the modification of the latter that is continuous in .
Proof.
The proof is inspired by Lemma 2.4 in [18]. First, Theorem II.2.42 in [14] shows that for any , the stochastic process is an -semimartingale with first and second characteristic given by
[TABLE]
respectively. Furthermore, the third characteristic of equals 0, which implies that is continuous. As -almost surely, its canonical decomposition is
[TABLE]
where denotes the continuous martingale part of . Since
[TABLE]
which is finite by assumption, the quadratic variation of is integrable, so
[TABLE]
is a continuous square-integrable -martingale with quadratic variation process
[TABLE]
for all . The specifications (3.42) and (3.43) define an orthogonal martingale measure relative to , in the sense of Chapter 2 in [27], with covariation measure
[TABLE]
for all .
Now let be another filtered probability space on which a Gaussian space–time white noise on is defined. Set
[TABLE]
and extend the random measures and as well as the random elements and to in the standard way so that on , is independent of and thus of . In addition, on this extension, is still an orthogonal martingale measure satisfying (3.42) and (3.44) by Lemma II.7.3 in [14]. Define
[TABLE]
for all and . As before, this defines a martingale measure relative to .
Since and are independent, we have from (3.44),
[TABLE]
for all . Therefore, it follows from Proposition 2.1 in [27] that is orthogonal and from Proposition 2.10 in [27] that the martingale measure is a Gaussian space–time white noise on with respect to . Moreover, we have
[TABLE]
Since, by (3.44),
[TABLE]
the -martingales and are indistinguishable. This implies, together with (3.42) and (3.45), that we have for any ,
[TABLE]
-almost surely. By assumption, the equality in (3.46) holds also -almost surely for almost all if we replace with . This and the assumption (3.41) imply, by the proof of Theorem 3.2 in [27], that we have
[TABLE]
for almost all , i.e., satisfies the mild formulation of (2.19) almost everywhere. Now let be a mild solution to (2.19). Again by Theorem 3.2 in [27] and its proof, we can infer that -almost surely, and are equal almost everywhere and hence, in .
Finally, let be the continuous modification in of , which we obtain from Corollary 3.4 in [27]. By (3.47), -almost surely for almost all , and therefore, because is continuous and càdlàg, these two processes are indistinguishable. ∎
3.3 Necessity of the condition (1.4)
Remark 3.14.
Suppose that the Lipschitz function satisfies . Then there must be such that , where is the mild solution to (2.8). Indeed, if we had -almost surely for all , it would imply everywhere on by equation (2.8). This in turn would imply , which contradicts the assumption.
Theorem 3.15**.**
Assume that . In the setting of Theorem 2.1, if (2.13) holds, then we have (1.4) for all .
Proof.
If (2.13) holds, we can use Skorokhod’s representation theorem as in the first part of the proof of Theorem 2.1 and obtain for any sequence converging to 0, random elements
[TABLE]
on a probability space possibly different from that satisfy (2.15). Of course, we now have
[TABLE]
Consider the same filtration on as in (2.16). For fixed , define the -adapted processes
[TABLE]
for all and . It is straightforward to infer from (2.15) that pointwise on ,
[TABLE]
Furthermore, by (2.15), (3.48) and (3.49), and have the same distribution as the square-integrable -martingales
[TABLE]
respectively, and therefore, by standard arguments, we can deduce that and are -martingales and that is continuous.
Recall the truncation function introduced in (3.22). Using Theorem II.2.21 in [14], we can further infer that the -semimartingale characteristics of and , relative to for a fixed but arbitrary , are given by and , respectively, where is defined as in (3.20), is defined as in (2.17) and
[TABLE]
Define for all . Then we have, by definition of the first characteristic,
[TABLE]
where is a local -martingale.
Now since is continuous, Proposition VI.2.7 in [14] and (3.50) imply that -wise,
[TABLE]
We also have
[TABLE]
for any and by Proposition VI.3.26 and Lemma VI.4.22 in [14]. Therefore, there exists a subsequence of converging -almost surely to 0. For the sake of clarity, assume without loss of generality that this holds for the whole sequence. Applying the Cauchy–Schwarz inequality to in (3.51), we further deduce that
[TABLE]
and the last term converges -almost surely to 0 (note that the supremum is finite because in ). This implies
[TABLE]
-almost surely. Using Proposition VI.1.23 in [14], (3.52), (3.53) and (3.55), we obtain
[TABLE]
as well as
[TABLE]
-almost surely. Since the jumps of are uniformly bounded by , we can apply Proposition VI.6.13 in [14] on the sequence and then Theorem VI.6.22 (c) in [14] on the processes in (3.56) in order to obtain
[TABLE]
in as . By definition of the quadratic variation, we can therefore deduce that
[TABLE]
Denoting by the jump measure of , we have, since is continuous,
[TABLE]
Now denote for any ,
[TABLE]
Then is a square-integrable -martingale with , and for any bounded stopping time , we have, by the optional stopping theorem, . Therefore, by Lenglart’s inequality (see Lemma I.3.30 in [14]), we obtain for all and ,
[TABLE]
By (3.58), we have
[TABLE]
Moreover, because by (3.57) and by Proposition VI.3.26 (iii) in [14], we deduce from the inequality above that and, by (3.60), that
[TABLE]
for all . Finally, combine (3.57), (3.59) and (3.61) to see that
[TABLE]
for all and . Taking a subsequence if necessary, we will from now on assume that the convergence in (3.62) holds even -almost surely.
Recall now the definition of in (3.24) and that, because in , we have as ; see the calculations in (3.26) and (3.27). Together with (3.62) this implies -almost surely,
[TABLE]
for all and by a similar calculation as in (3.25) (note that the first inequality there becomes an equality if is replaced by throughout).
Now on the set , where , we have
[TABLE]
and thus
[TABLE]
Therefore, as a consequence of (3.63), we obtain -almost surely,
[TABLE]
as for all , and .
We have seen in (3.36) that we can assume (perhaps for a subsequence) that
[TABLE]
which implies, by dominated convergence and continuity of ,
[TABLE]
So from (2.15), (3.64) and (3.65), we deduce that
[TABLE]
for all , and . Moreover,
[TABLE]
So if we choose with arbitrary, then by (3.66), the first term on the right-hand side of (3.67) converges to 0 as for all and . The second term does not depend on nor and converges to 0 as . This implies
[TABLE]
as for all . Since , there exists such that by Remark 3.14. Moreover, the mild solution is continuous in , which follows from the proof of Corollary 3.4 in [27]. We can thus infer that the expectation in (3.68) is not 0 and we obtain
[TABLE]
for all , which is exactly (1.4). ∎
Acknowledgements
TD cordially thanks Claudia Klüppelberg for inspiring discussions and valuable advice as well as the Chair of Probabilities of the École Polytechnique Fédérale de Lausanne for its hospitality during his visit. TD’s research is partially supported by the Deutsche Forschungsgemeinschaft, project number KL 1041/7-1.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Aldous [1978] D. Aldous. Stopping times and tightness. Ann. Probab. , 6(2):335–340, 1978.
- 2Asmussen and Rosiński [2001] S. Asmussen and J. Rosiński. Approximations of small jumps of Lévy processes with a view towards simulation. J. Appl. Probab. , 38(2):482–493, 2001.
- 3Bally et al. [1995] V. Bally, A. Millet, and M. Sanz-Solé. Approximation and support theorem in Hölder norm for parabolic stochastic partial differential equations. Ann. Probab. , 23(1):178–222, 1995.
- 4Bardina et al. [2010] X. Bardina, M. Jolis, and L. Quer-Sardanyons. Weak convergence for the stochastic heat equation driven by Gaussian white noise. Electron. J. Probab. , 15(1):1267–1295, 2010.
- 5Chen et al. [2016] B. Chen, C. Chong, and C. Klüppelberg. Simulation of stochastic Volterra equations driven by space–time Lévy noise. In M. Podolskij, R. Stelzer, S. Thorbjørnsen, and A.E.D. Veraart, editors, The Fascination of Probability, Statistics and their Applications , pages 209–229. Springer, Cham, 2016.
- 6Chong [2017] C. Chong. Lévy-driven Volterra equations in space and time. J. Theoret. Probab. , 30(1):1014–1058, 2017.
- 7Chong et al. [2019] C. Chong, R. C. Dalang, and T. Humeau. Path properties of the solution to the stochastic heat equation with Lévy noise. Stoch. Partial Differ. Equ. Anal. Comput. , 7(1):123–168, 2019.
- 8Cohen and Rosiński [2007] S. Cohen and J. Rosiński. Gaussian approximation of multivariate Lévy processes with applications to simulation of tempered stable processes. Bernoulli , 13(1):195–210, 2007.
