An invariance principle for the stochastic heat equation
Mathew Joseph

TL;DR
This paper presents an approximation approach for the stochastic heat equation driven by white noise, replacing the fractional Laplacian with a discrete generator and white noise with i.i.d. variables, providing a new proof of convergence.
Contribution
It offers an alternative proof of the weak convergence of the scaled partition function of directed polymers to the stochastic heat equation, with convergence of all moments.
Findings
Proves convergence of the scaled partition function to the stochastic heat equation.
Provides an approximation scheme replacing fractional Laplacian with a discrete generator.
Establishes convergence of all moments of the partition function.
Abstract
We approximate the white-noise driven stochastic heat equation by replacing the fractional Laplacian by the generator of a discrete time random walk on the one dimensional lattice, and approximating white noise by a collection of i.i.d. mean zero random variables. As a consequence, we give an alternative proof of the weak convergence of the scaled partition function of directed polymers in the intermediate disorder regime, to the stochastic heat equation; an advantage of the proof is that it gives the convergence of all moments.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRandom Matrices and Applications · Stochastic processes and statistical mechanics · Theoretical and Computational Physics
An invariance principle for the stochastic heat equation
Mathew Joseph
Abstract
We approximate the white-noise driven stochastic heat equation by replacing the fractional Laplacian by the generator of a discrete time random walk on the one dimensional lattice, and approximating white noise by a collection of i.i.d. mean zero random variables. As a consequence, we give an alternative proof of the weak convergence of the scaled partition function of directed polymers in the intermediate disorder regime, to the stochastic heat equation; an advantage of the proof is that it gives the convergence of all moments.
1 Introduction
Consider the following discrete space-time stochastic heat equation
[TABLE]
where is the transition kernel of a random walk on and is a Lipschitz continuous function with Lipschitz coefficient . The collection consists of i.i.d. random variables with
[TABLE]
Since we can remove from both sides of (1.1). A solution to (1.1) satisfies
[TABLE]
Above gives the -step transition probability of jumping from to . Note by homogeneity which we define to be by an abuse of notation.
We shall assume that the random walk has an asymptotic speed and when centered it is in the domain of attraction of a symmetric Stable() process with generator . Our assumptions will be stated in terms of the characteristic function of in the next section. A consequence of our main result Theorem 1.1, is that after an appropriate scaling of space, time and noise in (1.1), we get the continuous space-time stochastic heat equation. To be precise consider
[TABLE]
We show that for
[TABLE]
where denotes convergence in distribution, and is the mild solution to the stochastic heat equation
[TABLE]
Note that the existence and uniqueness of mild solutions to (1.5) (see [14]), as well as the finiteness of the speed of the random walk in the domain of attraction of a Stable() process requires
[TABLE]
and we shall assume this without further mention for the rest of the paper.
In the case that , the limiting random field (1.5) is Gaussian. One can then use the Lindeberg-Feller theorem to prove the weak convergence of to , and this was the approach used by [27] in their analysis of the Harness process. Another special case is when . In this case (1.5) is called the parabolic Anderson model or the multiplicative stochastic heat equation. Here one can explicitly write the solution to (1.5) as an infinite series involving multiple stochastic integrals (the Wiener chaos expansion). To illustrate, suppose that , then one can check that
[TABLE]
where and . Similarly also has an infinite series expansion involving sums of products of the random variables . The approach in [1] and [4] shows the weak convergence of to by showing the convergence of the individual terms in the infinite series expansions. Both these methods are specific and do not work in general, even for a slight perturbation of . The approach that we take to address the general case has been inspired by [22], where discrete approximations to stochastic differential equations were considered. We point out that as opposed to weak approximations, there are quite a few papers concerned with strong approximations of the stochastic heat equation (see [13], [16], [18], [19], [21]).
Our results allow us to give an alternative proof of the weak convergence of the partition function of one dimensional directed random polymers in the intermediate disorder regime, obtained in [1], [4] by different arguments. For each consider the following measure on paths of length , where each :
[TABLE]
The measure is a probability measure on paths of length , of a random walk following the transition probability and starting at [math]. The partition function for the inverse temperature
[TABLE]
is the normalizing constant so that is a probability measure on paths of length ; here is the expectation over paths of the random walk starting at [math]. One is then interested in the behaviour of paths sampled from this new measure . As the temperature becomes large, alternatively becomes small, one expects that the paths behave like that of a random walk with transition matrix . On the other hand as the temperature becomes small the effect of the environment becomes important, since substantially more weight is on paths with larger energy . A big motivation for work in this area is to understand this competition between entropy (from the measure ) and energy . The competition between entropy and energy is characterized by the limit of the martingale
[TABLE]
If the limit is positive a.e. we say that the polymer measure is in weak disorder while it is in strong disorder if the limit is [math] a.e.. The polymer paths are diffusive in the weak disorder regime (see [7]) while we see vastly different behavior in the strong disorder regime. One (as well as two) dimensional polymers are in the strong disorder regime for any , while in higher spatial dimensions, the polymer is in the strong disorder regime only if is large enough. It is believed that for one dimensional walks which are in the domain of attraction of Brownian motion the typical fluctuation of the polymer paths for any would be of order (as opposed to when ), although this question is still open in general. One also expects that has fluctuations of order (as opposed to [math] when ), but this question is also open in general. We refer to [1], [6], [10] for a more detailed exposition on this subject.
In this paper we shall consider (as in [1] and [4]) one dimensional polymers in the intermediate disorder regime, that is when the inverse temperature goes to [math] at a specified rate with . With the choice of the inverse temperature , we observe interesting behavior which we shall detail in the following section.
1.1 Main results
Let us denote by the mean of . Let denote the characteristic function associated to the random walk
[TABLE]
The centered characteristic function is
[TABLE]
We shall make the following assumption on the random walk.
Assumption 1.1**.**
Assume that and that there exists a constant such that
[TABLE]
where as .
Remark 1.2**.**
The first part of the assumption means that the random walk is strongly aperiodic and the assumption on necessarily implies that the centered random walk is in the domain of attraction of the strictly Stable() law.
Remark 1.3**.**
The strong aperiodicity assumption is used in the local limit theorem A.1 below and it should be possible to modify the theorem to remove this assumption. In particular the results of this paper are valid for the simple random walk although it is not strongly aperiodic.
Let us now state the main result of this paper. Let
[TABLE]
We choose our initial profile
[TABLE]
for a continuous . We allow to be random but it should be independent of the white noise and the random field . Our main result is the following
Theorem 1.1**.**
Let the conditions in Assumption 1.1 hold, and fix so that . Let be a continuous (random) function so that , that is independent of and . Let be the solution to (1.4) with initial profile . Then for each we have , where is the solution to (1.5) with initial profile . Furthermore we have
[TABLE]
Remark 1.4**.**
Above and for the rest of the paper we use to denote the expectation over all random quantities involved. For ease of notation, we will often drop the superscript when referring from and .
Remark 1.5**.**
The proof also shows the convergence of all moments of order less than . Our proof can also be used to give an upper bound on the rate of convergence in (1.10). This will of course depend on the initial profile .
Remark 1.6**.**
In the case that one has a factor of with the last term in (1.5).
We next focus on the case that is uniformly bounded, that is . In this situation we can consider more general initial profiles, not necessarily those that are uniformly bounded in . We state a general result for the special case . Let be i.i.d. random variables such that
[TABLE]
We further assume that the random variables are independent of the noise . Let
[TABLE]
We have the following
Theorem 1.2** ().**
Assume that , and the random variables satisfy (1.11) and are independent of and . Let the conditions in Assumption 1.1 hold, and fix such that . Let be the solution to (1.4) with initial profile . Then for each we have , where is the solution to (1.5) with initial profile , a two-sided Brownian motion with variance . Furthermore we have
[TABLE]
Remark 1.7**.**
The above theorem explains the height fluctuations of the harness process obtained in [27]. This is the system Let us consider the simplest case where the initial height profile satisfies , and the increments are i.i.d. random variables with finite moments and mean (say). Then the transformation satisfies (1.4) and the initial profile is of the form above. We then have that n^{-1/4}\cdot\big{[}h_{[nt]}([x\sqrt{n}]-[\mu nt])-\rho_{0}[x\sqrt{n}]\big{]}\Rightarrow v_{t}(x) where , which implies that the harness process is in the Edwards-Wilkinson class; see [9] for an explanation of this class and the related KPZ class.
We next consider the scaled partition function of the directed random polymer in the intermediate disorder regime:
[TABLE]
where is the scaled disorder
[TABLE]
We shall also be interested in the scaled point to point partition function given by
[TABLE]
Note that is the probability of the intermediate disorder polymer to be at at time . We rederive the following theorem from [1] and [4]. An advantage of our method is that we are able to show the convergence of all moments; see the first two statements of the theorem.
Theorem 1.3**.**
Assume that the environment variables are i.i.d. and have exponential moments: for all small . Suppose that . Then the following statements hold.
* converges in distribution to , where with initial profile . Furthermore the moments of converge to those of .* 2. 2.
* converges in distribution to , where with initial profile . Furthermore the moments of converge to those of .* 3. 3.
* converges in distribution to , where and are as above and driven by the same white noise .*
Remark 1.8**.**
One can show that .
The first statement in the theorem implies that has fluctuations of order in the intermediate disorder regime. The last statement says that the polymer paths have fluctuations of order and satisfy a random local limit theorem. Recall that when and the disorder is not scaled one expects fluctuations for and fluctuations for the polymer paths.
We end this section with a brief description of the notation and outline of the paper. We will use indices for discrete time, for discrete space. The letters will usually be used for continuous time ,while the variables will usually be used for continuous space . The notation is used to denote probability and expectation for the random variables , while will be used for the probability and expectation over paths of random walks with transition kernel . As already indicated will be used when we integrate out all the random variables involved. We will denote by the norm in . Constants could vary from line to line. We will use the notation to represent constants which remain constant within a proof but might change across different proofs.
In Section 2 we prove existence and uniqueness for solutions to (1.4), and obtain moment bounds required in the proof of Theorem 1.1 Section 3 deals with the proof of our main Theorem 1.1. Section 4 deals with the proof of Theorem 1.2. In Section 5 we provide conditions for tightness to hold in Theorems 1.1 and 1.2. The proof of the first statement in Theorem 1.3 is in Section 6. Section 7 is devoted to several extensions including addition of a drift in (1.4), Dirac initial condition in (1.4) and the proofs of the last two statements in Theorem 1.3. The appendices contain a local limit theorem crucial for our arguments, as well as bounds needed for the proof of Proposition 3.5.
2 Existence and Uniqueness
In this section we show the existence and uniqueness of solutions to (1.4). The ideas are quite standard, see for example [15], but we present them here to show that we can obtain moment bounds which are uniform in . We shall need these in the proof of Theorem 1.1. Recall that a solution to (1.4) satisfies
[TABLE]
We call the second term the noise term because it depends on the noise . We call the first term the non-noise term although the initial profile is allowed to be random.
Theorem 2.1**.**
For initial profiles to (1.4) with , the following statements hold.
There exists a unique solution to equation (1.4) such that for any
[TABLE]
Furthermore the above bound holds uniformly in . 2. 2.
Let and suppose further that has moments. Assume also that \sup_{k,n}\mathbf{E}\big{|}u_{0}(k)\big{|}^{2m}<\infty. Then for any the following holds uniformly in
[TABLE]
Proof.
We use Picard’s iteration scheme to show the existence of a solution which satisfies the required bound. Therefore let and let
[TABLE]
In the case the convergence of the Picard iterates is immediate. So let us assume that . From the above one has
[TABLE]
For a parameter define
[TABLE]
The above inequality gives
[TABLE]
where and are independent walks with the same transition probabilities . We now choose to be large enough
[TABLE]
The reason we can choose such a is that
[TABLE]
uniformly in by Corollary A.2 (see Appendix A) applied to ; a similar statement to Theorem A.1 holds for . From here it follows that for this choice of
[TABLE]
uniformly over . This implies the convergence of to a random field uniformly in the interval . One can then argue that satisfies (1.4) in this region by considering the limit of both sides of (2.4). To prove uniqueness assume that there are two solutions and to (1.4) satisfying (2.2). Then
[TABLE]
As before multiply both sides by , and take supremum over and to get a relation similar to (6.2). If we choose to be large enough we get for , thus proving uniqueness.
Let us now prove the second part of the theorem. An application of Burkholder’s inequality for discrete time martingales gives
[TABLE]
the last step follows from Minkowski’s inequality. Now
[TABLE]
Clearly are independent of . Consider now the expectation of each product term in the above expression. Our assumption that implies that the only product terms which have a nonzero contribution contain only powers of greater than , for any . Thus the above can be bound by a constant multiple of
[TABLE]
Note also by Holder’s inequality, for any
[TABLE]
We also need the following
[TABLE]
which follows from the relation valid for any and nonnegative sequences . These observations imply that (LABEL:eq:mom:4m) can be bound by a constant multiple of
[TABLE]
If we plug this in (LABEL:eq:u:m) one gets
[TABLE]
Now, for a fixed let
[TABLE]
Our calculations above give us
[TABLE]
By choosing a large enough so that
[TABLE]
one can obtain (2.3) by a recursive application of (2.10). This completes the proof of the theorem. ∎
3 Proof of Theorem 1.1
Fix so that . For each we shall construct a probability space containing copies of the random variables , the white noise , and the initial profile independent of and so that
[TABLE]
This would imply the weak convergence stated in the main theorem.
We first prove the result under the assumption that the initial profile . This shall be relaxed later. The proof will involve several steps which we divide into subsections. To find an upper bound on the rate of convergence in (3.1) we shall need to optimize several quantities over such that
[TABLE]
3.1 Adding over blocks
In the first step we apply a coarse graining procedure and compare the solution to (1.4) to a random field on a coarser lattice. Each site in the coarser lattice would correspond to a block of size in the original lattice. More precisely site (we shall use the first coordinate for time and the second for space) in the coarser lattice will correspond to
[TABLE]
in the original lattice. For define
[TABLE]
where the random variables are obtained by summing the contribution of the scaled variables in these blocks:
[TABLE]
Here again we are suppressing the dependence of on in the notation.
The main result of this section is the following theorem where we show that is close to . Choose and such that
[TABLE]
Recall the definition of in (1.8).
Theorem 3.1**.**
Fix . We have for
[TABLE]
Proof.
The proof is obtained by combining Propositions 3.4 and 3.5 below, and our restrictions on the parameters and in (3.2). ∎
We first recall a few facts about general one-dimensional recurrent random walks which we shall apply to , where and are independent random walks with transition probabilities . Note first that the random walk is symmetric. The potential kernel of the random walk is the function where
[TABLE]
One can show that the function exists (Propositions 28.5 and 28.8 in [28]) and further that is increasing in for each . We will need the following lemmas.
Lemma 3.2**.**
The potential kernel of the walk satisfies
[TABLE]
Proof.
The characteristic function of is . By Fourier inversion
[TABLE]
By the assumptions on the characteristic function , we have the bound
[TABLE]
for . Therefore uniformly in
[TABLE]
which is of order . ∎
Lemma 3.3**.**
The following holds
[TABLE]
The above also holds when is replaced by and by [math].
Proof.
Let us denote
[TABLE]
One can check that for any
[TABLE]
for some constant . Thus
[TABLE]
By our assumption (1.7) on the characteristic function one can argue that for we have for all and further
[TABLE]
This of course implies \sup_{n}E\big{[}|R_{n}|^{\delta}\big{]}<\infty. We can choose to complete the proof of (3.7). We have a similar bound for with replaced by [math] because the characteristic function of satisfies (1.7) with replaced by . ∎
The following should be thought of a Hölder continuity estimate for . It is the first step in our proof of Theorem 3.1.
Proposition 3.4**.**
We have the following bound valid for
[TABLE]
Proof.
We write
[TABLE]
An application of Burkholder’s inequality along with the bound in Theorem 2.1 then gives
[TABLE]
For fixed time and spatial point the sum where and are independent random walks. Therefore the above is
[TABLE]
As already observed in (2.7) one can use Corollary A.2 with to get
[TABLE]
We now proceed with (LABEL:eq:u:holder). An application of the Markov property gives
[TABLE]
where and , and both are independent of the walks and . Thus
[TABLE]
In the second last inequality we use Lemma 3.2 and Lemma 3.3. Note also that here we have used that from (3.2). This completes the proof of the proposition. ∎
To complete the proof of Theorem 3.1 we estimate the difference between and in the following
Proposition 3.5**.**
Fix . We have for .
[TABLE]
Proof.
Recall the definition of the random field in (3.4). By writing the variables as the sum of the variables in the blocks we can write the above difference as
[TABLE]
At this stage it is convenient to split in the first term as
[TABLE]
An application of Burkholder’s inequality then gives
[TABLE]
where the limits of the sums are just as earlier; we did this to simplify notation. The first term in the last line is a bound on the first term of the previous step, and this comes from the Hölder continuity estimate in Proposition 3.4 and (2.7). From this point on we focus on the last term in (3.13) which can be written as
[TABLE]
In order to keep the exposition simpler for the reader we bound the above two terms in Lemmas B.4 and B.5 in Appendix B. The proof of this Proposition is complete once we take into account our restrictions on and in (3.2). ∎
3.2 Coupling with white noise
Here we approximate in the block by replacing it by white noise in the rescaled box
[TABLE]
It is this step which allows us to construct copies of and on the same space as in (3.1). For this we shall need the following coupling theorem.
Theorem 3.6**.**
We can construct random variables and and a white noise on the same probability space such that the following hold
* is distributed as and is distributed as .* 2. 2.
For fixed , we have that \big{(}W^{\prime}_{j}(l),\,\zeta^{\prime}_{j}(l)\big{)} is independent of the collection where
[TABLE] 3. 3.
For any real number such that and we have
[TABLE]
Proof.
Parts 1 and 2 can be proved similar to Lemma 1 in [22]. Start with a white noise on constructed on some space . Here form a basis of . Denote by the distribution of and let denote the standard normal distribution. Define the inverse function of by
[TABLE]
For a standard normal random variable we have and therefore
[TABLE]
This gives a way of coupling and . We now construct the random variables in the box as follows. Define probability measures on and on by
[TABLE]
Since and are absolutely continuous with respect to the probability measure we have
[TABLE]
for measurable functions and . We now construct the probability measure on by
[TABLE]
We have constructed random variables corresponding to the box . To construct for the whole space we consider i.i.d copies of . For any the ’s, ’s constructed for the ’th copy would correspond to . Similarly the white noise for the ’th copy would correspond to . The white noise on is created by “gluing” together the white noise in the different boxes. To be precise for any define
[TABLE]
One can check that this is a white noise on .
Let us next prove part 3 of the theorem. Denote by the distribution function of the random variable
[TABLE]
Note that this random variable has mean [math] and variance since we have normalized by the square root of the size of the box . We shall compare with
[TABLE]
which has the standard normal distribution. A modified version of the Berry Essen theorem (see [3], [26], [25]) gives
[TABLE]
As observed in [12] for
[TABLE]
The inequality (3.16) follows easily from this. ∎
Remark 3.1**.**
From this point on in the proof of Theorem 1.1 we shall assume that we are working on the probability space constructed above. We shall also ignore the superscript in the random fields and .
In the next approximating process we replace by the coupled white noise. Again the approximating process will be defined on the coarser lattice. Let
[TABLE]
We show now that is close to .
Proposition 3.7**.**
Fix any . We have uniformly for
[TABLE]
Proof.
By our assumption (3.2) on and we have . Therefore by Burkholder’s inequality and (3.16)
[TABLE]
the last line follows because of (2.7) and the fact that second term on the second line of (3.13) goes to [math]. ∎
3.3 Replacing transition probabilities by the Stable() density
Next we replace the random walk kernel by the transition density for the Stable() process. We can choose integers such that
[TABLE]
and on the coarser lattice for let
[TABLE]
the second line following from the first by a simple translation. Note that the sum over runs from [math] to ; this avoids the singularity of at .
Proposition 3.8**.**
Fix . We have uniformly for
[TABLE]
Proof.
Indeed using the first expression in (3.18)
[TABLE]
Let us focus on the second term. This is equal to
[TABLE]
Firstly taking in Theorem A.1 (see Appendix A)
[TABLE]
Also we control
[TABLE]
For the Riemann sum approximation of the density we need which we have assumed in (3.2). ∎
3.4 Comparison with the stochastic heat equation
In this last step we compare our approximation with the stochastic heat equation (1.5) with initial profile [math], with respect to a white noise constructed as
[TABLE]
Thus is constructed from by spatially shifting the noise in the time region \big{[}j[n^{\theta}]/n,(j+1)[n^{\theta}]/n\big{)} by . One can check that is a white noise as follows. For write
[TABLE]
It now follows that is Gaussian with mean [math] and for . The unique solution to (1.5) with this white noise when the initial profile is [math] is
[TABLE]
The following Hölder continuity estimate will be very useful for us.
Lemma 3.9** ([14]).**
For any
[TABLE]
Remark 3.2**.**
For general initial profile , the solution is obtained by adding to the right hand side of (3.19). Then the conclusion of Lemma 3.9 is valid only for the noise term (3.19).
We shall first consider a spatial discretisation of as in [13] and [21], and divide space into intervals of length . Let
[TABLE]
where
[TABLE]
One has the following
Lemma 3.10**.**
We have
[TABLE]
Proof.
In [13] and [21] we consider the integral in (3.20) from [math] to . Here we are looking at a smaller interval since . If one looks at the arguments in those papers we see we have to control which gives the bound in (3.21). ∎
Next we discretise in time. Let
[TABLE]
for and [math] otherwise, where
[TABLE]
We will need the following lemma
Lemma 3.11** ([23]).**
We have
[TABLE]
Proof.
For one just has to look at estimate (3.7) in [23]. In this case the inequality is true even without the factor of in the right. For this is just a simple computation involving the Gaussian density. ∎
The following gives an error bound on the temporal discretisation of .
Lemma 3.12**.**
We have for any and satisfying (LABEL:eq:r:z)
[TABLE]
Proof.
Using the temporal Hölder continuity supplied in Lemma 3.9
[TABLE]
The first term in (LABEL:eq:vbar-vn) can be bound by
[TABLE]
Using Lemma 3.11 we can bound the second term in (LABEL:eq:vbar-vn) by
[TABLE]
We have again used for the Riemann sum approximation in removing the sum in the third last line. This completes the proof of the lemma. ∎
We now collect all our estimates in this section to give a bound on the difference between and .
Theorem 3.13**.**
Suppose that , and fix . We then have the following uniform bound valid for
[TABLE]
Proof.
Recall from (LABEL:eq:r:z). Firstly we apply Minkowski’s inequality
[TABLE]
By Lemmas 3.10 and 3.12 of this subsection as well as Propositions 3.5, 3.7, 3.8 we can bound the above by
[TABLE]
An application of Gronwall’s inequality then gives
[TABLE]
We can now use the Hölder continuity esitmates in Lemma 3.9 and Proposition 3.4 to conclude the proof. ∎
We now complete the proof of Theorem 1.1.
Proof of Theorem 1.1.
The case of has already been covered. For general , the solution to (1.5) with white noise is
[TABLE]
We have already shown in (3.24) that the difference of the noise terms above and in (2.1) goes to [math] in norm. It is thus enough to show that the difference of the of the non-noise terms goes to [math], or
[TABLE]
To see this first note that the expression inside goes to [math] almost surely by the weak convergence of the centered to a Stable() random variable, and by the smoothing properties of the Stable() density. The dominated convergence theorem can then be applied to show the above. This then implies
[TABLE]
and the conclusion of the theorem follows from Lemma 3.9 and Proposition 3.4. ∎
4 Proof of Theorem 1.2
In order to couple the initial profile with a Brownian motion we need the following proposition whose proof is similar to that of Theorem 3.6. For let
[TABLE]
We have
Proposition 4.1** ([22]).**
For each , one can construct a copy of and a two sided Brownian motion with variance on a probability space, so that for with
[TABLE]
We can now provide the
Proof of Theorem 1.2.
A look at the proof of Theorem 2.1 tells us that existence and uniqueness of holds. However the bounds in (2.2) and (2.3) do not hold uniformly in , although it is easily checked that it continues to hold for the noise term. Let us show instead that the norm of the non-noise term is finite. We split
[TABLE]
Therefore
[TABLE]
which is finite thanks to our assumptions (1.11) and Lemma 3.3.
A careful reader of the proof of Theorem 1.1 would have observed that our assumption of bounded initial profile was only needed so that we had
[TABLE]
While this does not hold for general Lipschitz if is unbounded in , it clearly holds in the case is bounded uniformly in . Therefore to complete the proof of Theorem 1.2 we just need to approximate the non-noise term. As before we divide the proof into several steps.
Step 1: First we collect the initial random variables into groups of size . Thus
[TABLE]
Consider the second term in the right hand side. The expression in square brackets is independent of . Thus using Burkholder’s inequality one gets the following bound for of the second term
[TABLE]
where one uses the Cauchy-Schwarz inequality to obtain the last step. This goes to [math] as since .
Step 2: Next we replace the sum over in blocks by the Brownian motion constructed in Proposition 4.1.
[TABLE]
where the limits of the sum are as in the first line. Using the Cauchy-Schwarz inequality we bound of the second term as follows
[TABLE]
We now use (4.1) to bound this
[TABLE]
where we used Lemma 3.3 in the last line. The above also goes to [math] with .
Step 3: As in the proof of Theorem 1.1 we now substitute the transition probability of the random walk by the heat kernel.
[TABLE]
We now bound of the second term on the right hand side. This gives an upper bound of
[TABLE]
where we used the Cauchy-Schwarz inequality along with the bound \big{\|}B_{\frac{k[n^{\theta^{\prime}}]}{n}}\big{\|}_{2m}^{2}\lesssim\frac{kn^{\theta^{\prime}}}{n}. Fix such that . Continuing with our bound (LABEL:step3) we use Theorem A.1 to get
[TABLE]
A bound for the third term comes from applying Hölder’s inequality along with Chebyshev’s inequality:
[TABLE]
while the second term decays much more quickly because of the exponential decay of the heat kernel.
Step 4: To complete the proof of the theorem it is enough for us to show
[TABLE]
where
[TABLE]
is another Brownian motion. As the reader might have guessed this is just a Riemann sum argument. We just need to bound
[TABLE]
For the first term we use the independence of Brownian increments as well as Burkholder’s inequality to get a bound of
[TABLE]
Next we have the following bound on for the second term in (LABEL:eq:4)
[TABLE]
We have shown that (LABEL:eq:4) goes to [math] with and this completes the proof of Theorem 1.2. ∎
5 Tightness
In this section we discuss the issue of tightness in Theorem 1.1.
Theorem 5.1**.**
Suppose the initial profile satisfies
[TABLE]
for some , all and some
[TABLE]
Then the process is tight in every compact subset of .
Remark 5.1**.**
One can check that the following tightness arguments will hold for Theorem 1.2 with .
We show tightness for the process in the time-space box , although the argument is valid for any compact subset of . Define the modulus of continuity
[TABLE]
Tightness of the process will follow from the following
Lemma 5.2** ([24]).**
Suppose there is a sequence such that the following hold.
There is , and a constant such that for all large enough
[TABLE]
for all and . 2. 2.
For all ,
[TABLE]
for all large .
Then for all there is a such that
[TABLE]
for all large .
Proof of Theorem 5.1.
Let us check the first condition in Lemma 5.2. We use the triangle inequality
[TABLE]
and bound each of the two terms on the right, starting with the first. Recall from (2.1) that the solution is the sum of two terms, and it is enough to look at each of them separately. The contribution of the noise term, the one involving , gives a bound
[TABLE]
The contribution of the non-noise term in (2.1) corresponding to the initial profile is
[TABLE]
We have obtained
[TABLE]
Let us next consider the second term in (5.2). By arguments similar to those in Proposition 3.4 one can see that the contribution from the noise term
[TABLE]
The contribution of the non-noise term
[TABLE]
The last line is because (3.7) also holds with replaced by ; this can be seen from following the proof of Lemma 3.3. We have obtained
[TABLE]
Combining (5.3) and (5.4) we obtain
[TABLE]
when . Because of our condition on the exponent is greater than and this verifies condition 1 in Lemma 5.2.
We next check the second condition in Lemma 5.2. Since we have chosen we just need to consider the maximum over where . There are of the order such points. Thus
[TABLE]
where we used (5.3) and (5.4) for the last line. The above goes to [math] with due to our restriction on . Condition 2 of Lemma 5.2 is verified. ∎
6 Proof of part 1 of Theorem 1.3
Consider the martingale in (1.12) (we ignore the superscript ). By the independence of the environment this has the same distribution as the
[TABLE]
where is the expectation of paths of the random walk starting at . It is therefore enough to prove Theorem 1.3 for . Consider the random field
[TABLE]
Using the Markov property we obtain
[TABLE]
with initial profile
[TABLE]
Although (6.3) is not of the form (1.4) one can check that the statements of Theorem 2.1 hold for ; to see this one could follow the arguments of the proof of Theorem 2.1. Indeed a solution to (6.3) is
[TABLE]
Therefore by Burkholder’s inequality
[TABLE]
thanks to the independence of and , and the first bound in (6.9). This argument shows that we obtain the same moment bounds for as in Theorem 2.1.
To obtain the first part of Theorem 1.3 we shall show that our random field is close to the solution of
[TABLE]
We start with an estimate of the spatial Hölder continuity of .
Lemma 6.1**.**
The following holds
[TABLE]
Proof.
From (2.3) the ’th moment of is bounded for . Therefore one can argue
[TABLE]
The result follows from Lemma 3.2. ∎
Consider now the random field defined as
[TABLE]
with initial profile . The statements of Theorem 2.1 hold for also. We first compare with .
Lemma 6.2**.**
The following holds
[TABLE]
Proof.
We can check that
[TABLE]
Split to get
[TABLE]
The lemma follows by using Gronwall’s inequality and Lemma 6.1. ∎
Next consider the random field given by
[TABLE]
with initial profile . Once again Theorem 2.1 holds for also. We compare with below.
Lemma 6.3**.**
The following holds
[TABLE]
Proof.
We write
[TABLE]
One can check that for any
[TABLE]
Using this we obtain for
[TABLE]
The lemma follows by an application of Gronwall’s inequality. ∎
Finally we compare with in (6.3).
Lemma 6.4**.**
The following holds
[TABLE]
Proof.
We now have
[TABLE]
Burkholder’s inequality along with the first inequality in (6.9) shows
[TABLE]
We can now use Gronwall’s inequality to complete the proof. ∎
Our main result Theorem 1.1 shows that of (6.5) converges in distribution to of (1.5) with and initial profile . Furthermore the moments of converges to that of . As a consequence of the previous lemmas and the discussion at the beginning of this section, the same holds for . This completes the proof of Theorem 1.3. ∎
7 Extensions
7.1 Addition of a drift term
We describe in this section an approximation to
[TABLE]
where is also Lipschitz continuous with Lipschitz coefficient . Recall that we scale time by , which suggests an approximation of the form
[TABLE]
We have the following
Theorem 7.1**.**
Let the conditions in Assumption 1.1 hold, and fix an integer so that . Let be a continuous (random) function so that that is independent of and . Let be the solution to (7.2) with initial profile . Then for each we have , where is the solution to (7.1) with initial profile . Furthermore we have
[TABLE]
The proof of this theorem follows closely the proof of Theorem 1.1. We provide the outline of the proof and leave the details to the reader. We start by checking that the conclusions of Theorem 2.1 continue to hold. Using the Cauchy-Schwarz inequality for the drift term we can check that the Picard iterates in (2.4) satisfy
[TABLE]
With defined as in (2.5) we obtain now
[TABLE]
We now choose large enough so that we obtain
[TABLE]
As before we can show the existence of a unique solution to (7.2) satisfying (2.2). The second statement in Theorem 2.1 follows along similar lines.
As in the proof of Theorem 1.1 we start with initial profile , and impose conditions on and as in (3.2). Our definition of in (3.4) now includes an additional term
[TABLE]
We first consider Proposition 3.4. With the addition of the drift in (7.2) the difference between and gives extra terms
[TABLE]
The norm of the first term is of order , while that of the second term is of order
[TABLE]
Fix any . We can remove the terms from to with an error of order . Also due to Lemma B.3 we might restrict to up to a vanishing error in . Now consider the above expression over the range and . By Theorem A.1 for each and in the above ranges the difference goes to [math] with . Moreover we can bound the difference by the sum and get an expression which is uniformly bounded in . Therefore by the dominated convergence theorem and the arbitrariness of we can conclude that the above expression goes to [math] with .
Before we move on to Proposition 3.5 we state a lemma we shall need.
Lemma 7.2**.**
Suppose additionally that . Then for we have
[TABLE]
Proof.
We shall use Theorem A.1 for and a large deviation estimate for . One checks that
[TABLE]
For one can use the result in [20] along with Lemma B.2 to conclude
[TABLE]
One can use Chebyshev’s inequality to get a similar bound when . ∎
We now move on to Proposition 3.5. The norm of the difference of and arising from the drift terms in (7.2) and (7.3) is bound above by
[TABLE]
The first term goes to [math] by the Hölder continuity of argued above and earlier in Proposition 3.4. The second term also goes to [math] by an argument similar to above. Due to Lemma 7.2, the second expression in (7.4) is uniformly bounded in . Therefore we can apply the dominated convergence theorem to conclude that it goes to [math].
To the random field in (3.17) we add the drift term (7.3). Therefore the difference in Proposition 3.7 arising from the drift terms is [math].
The random field has an additional term
[TABLE]
The bound on the norm arising from the difference of this from the drift term in is
[TABLE]
We claim that the expression is uniformly bounded. Indeed the sum of the terms involving the density is uniformly bounded by a Riemann sum argument. As for the sum of the terms involving the transition kernel , it is also uniformly bounded by Lemma 7.2 if we assume . The convergence of (7.5) to [math] thus follows from the dominated convergence theorem, similar to what was argued earlier in this outline.
Finally we consider
[TABLE]
One can again show that is Hölder continuous in both space and time. We can similarly define the random fields and by including a drift term to each of them. The drift terms we add to the definition of and in equations (3.20) and (3.22) are
[TABLE]
respectively. One can show again that both
[TABLE]
go to [math] with . Collecting all our bounds we can then use (LABEL:eq:final) and apply Gronwall’s inequality to prove Theorem 7.1 in the case . The case of general follows as earlier.
7.2 Dirac initial condition
Consider (1.4) with initial profile
[TABLE]
Our main result of this section is the following
Theorem 7.3**.**
Let the conditions in Assumption 1.1 hold, and suppose for all . Fix an integer so that . Let be the solution to (1.4) with initial profile (7.6). Then for each we have , where is the solution to (1.5) with initial profile . Furthermore we have
[TABLE]
To avoid unnecessarily complicated notation we assume from now on that . The reader can convince theirselves that the proof of the above theorem continues to hold for nonzero .
Due to the singularity of the density at time [math] we can obtain uniform moment bounds on only if we consider a compact time interval away from [math]. More precisely we have the following
Proposition 7.4**.**
Let so that . Fix . We have
[TABLE]
uniformly in .
Proof.
Burkholder’s inequality gives for
[TABLE]
If we apply the above recursively we obtain
[TABLE]
Suppose for some and . One can follow the argument in Section 7 of [4] to deduce that each of the terms converge to the corresponding iterated integrals involving the density . For example the second term on the right would converge to
[TABLE]
Furthermore the tail sums are uniformly bounded in .
Now when we consider the moments of the parabolic Anderson model we get an infinite series involving the integrals of . Therefore we can bound the moments of from above by the moments . Equation (4.15) in [5] states that the moments of are uniformly bounded in a compact subset of . We next show that the iterated integrals of above are maximized when . Indeed let us consider (7.8). By Plancheral’s theorem this is equal to a constant mulitple of
[TABLE]
where denotes the Fourier transform in the spatial variable. Using for functions and one obtains
[TABLE]
A similar argument holds for the higher iterated integrals. Therefore the moments of are in fact uniformly bounded in a compact subset of time. This proves the proposition. ∎
Proof of Theorem 7.3.
We aim to use the arguments in the proof of our main theorem 1.1 here also. For this we remove some terms from the noise term since the moments of are “large” near time [math]. Let us make this more precise. Fix a small . We claim that the random field
[TABLE]
is close to
[TABLE]
in the sense that \big{\|}v_{t}(x)-\tilde{v}_{t}(x)\big{\|}_{2m}^{2}\to 0 as . Note that
[TABLE]
which when iterated gives an infinite series which can be shown to be finite, see [5]. In particular
[TABLE]
from which our claim follows. Similarly
[TABLE]
is close to
[TABLE]
in the sense \sup_{n}\big{\|}u_{[nt]}([xn^{-1/\alpha}])-\tilde{u}_{[nt]}([xn^{-1/\alpha}])\big{\|}_{2m}^{2}\to 0 as . This can be seen by following the argument in Proposition 7.4.
Note first that as . Away from time the moments of and are uniformly bounded, that is for each and uniformly in
[TABLE]
Thus the arguments in the proof of Theorem 1.1 can be used and we can show that
[TABLE]
The theorem follows from this. ∎
7.3 Proof of parts 2 and 3 in Theorem 1.3
Consider now the random field
[TABLE]
This satisfies (6.3) but now with initial profile
[TABLE]
Following the argument in Section 6 we show that is close to the random field in (6.5) with initial profile
[TABLE]
We start with a bound on the Hölder continuity of . One checks
[TABLE]
If we iterate this we obtain
[TABLE]
From this one can show
Lemma 7.5**.**
For each
[TABLE]
Proof.
It is enough to show that \big{\|}u_{i}(k)-u_{i}(l)\big{\|}_{2m}^{2}=o(1) for fixed and when . The arguments in Section 7 of [4] show that the non vanishing contribution to (LABEL:eq:u:k-l:2) come from terms where the successive differences , are of order . Due to Theorem A.1 n^{2/\alpha}\big{[}\mathrm{P}_{i}(k,y)-\mathrm{P}_{i}(l,y)\big{]}^{2} is of order if is of order . This along with the uniform moment bound of obtained from (7.7) is enough to prove the lemma. ∎
As before we compare first with where now satisfies (6.6) but with initial profile . We have
Lemma 7.6**.**
We have
[TABLE]
Proof.
We iterate (6.7), so as to obtain a bound which does not involve the difference between and . Thus we obtain
[TABLE]
Consider the ’th sum in above. We have the trivial bound
[TABLE]
If we replace this bound in each of the terms in (7.11), and then use the bound (7.7) we get an expression which is similar to the expression on the right hand side of (7.7) minus the first term. Using arguments similar to those in [4] one can conclude that this expression is uniformly bounded. Moreover each of the terms in (7.11) go to [math], thanks to Lemma 7.5 and the arbitrariness of . Thus, by the dominated convergence theorem the lemma is proved. ∎
Next we compare with in (6.8) with initial profile . We have
Lemma 7.7**.**
We have
[TABLE]
Proof.
We now have
[TABLE]
Again one can check that the moment bounds of satisfy a relation similar to (7.7). Iterating the above proves the lemma. ∎
Finally we compare with
Lemma 7.8**.**
We have
[TABLE]
Proof.
We now have
[TABLE]
Iterating the above proves the result. ∎
Thanks to Theorem 7.3 we know that converges in distribution to , where and . The above lemmas then imply that the same holds for and consequently for . The second part of Theorem 1.3 follows by a spatial translation of the noise.
Denote be the solution to (6.5) with initial profile and let be the solution to (6.5) with initial profile . Using the same probability space constructed in Theorem 3.6 for both and we see that \big{(}u^{(1)}_{n}([xn^{1/\alpha}]),u^{(2)}_{n}([xn^{1/\alpha}])\big{)} converges jointly in distribution to . Because of the previous lemmas the same holds for \big{(}w^{(1)}_{n}([xn^{1/\alpha}]),w^{(2)}_{n}([xn^{1/\alpha}])\big{)}, where solves (6.2) and solves (7.9). By time reversal and independence of the random variables we can conclude that \big{(}M_{n}^{(\tilde{\xi})},\,n^{1/\alpha}M_{n}^{(\tilde{\xi},\,[xn^{1/\alpha}]}\big{)} converges jointly to . The third statement of Theorem 1.3 follows from this and a spatial translation of the noise since is simply the ratio of and . ∎
7.4 Some more extensions
There are a few other possible extensions of the above results.
The conditions on can be relaxed. For example it is not necessary for them to be i.i.d.. It is enough that they are independent and satisfy conditions in (LABEL:cond:xi) for the weak convergence in Theorem 1.1 to hold, see for example [3]. However we need to assume stronger conditions to obtain convergence of high moments. 2. 2.
We could relax the conditions on , for example we could take them to be independent with slowly varying variances (as in equation 2.10 in [2]). The arguments in Section 4 show that the contributions of the noise and initial configuration can be treated separately, so one could even consider correlated as in [27]. 3. 3.
With some work, it should be possible to extend Theorem 7.3 to more general compactly supported measures. 4. 4.
We can consider weak approximations of the stochastic heat equation with spatially colored noise; depending on the noise these can exist in higher dimensions (see [13]). The key ingredient in the proof of our main theorem 1.1 was the coupling theorem 3.6. A starting point would be Section 3.1 of [8] where a large family of colored noises are constructed by convoluting appropriate functions with white noise. These include Riesz noises (see [13]) which interpolate between spatially smooth noise and white noise. The construction also suggests that one should then consider random variables of the form for appropriate as our discrete noise; here are i.i.d. random variables satisfying (LABEL:cond:xi) as before. We could also consider temporally correlated noise. One could then apply the above to study intermediate disorder regimes for directed polymers in a correlated random environment.
Appendix A Appendix: A local limit theorem
The following local limit theorem is a modification of Proposition 3.3 in [13]. Below is the density for the Stable() process with generator .
Theorem A.1**.**
Suppose Assumption 1.1 holds. Then for any and any
[TABLE]
uniformly for .
Proof.
To simplify notation we denote . We first bound the expression on the left for . Fourier inversion gives
[TABLE]
and
[TABLE]
Therefore
[TABLE]
The first term can be bound just as term in Proposition 3.3 of [13], and we get
[TABLE]
For the second term we split the region of integration depending on whether or not is in
[TABLE]
Our assumptions on the characteristic function imply
[TABLE]
on , and so it follows as in [13] that
[TABLE]
We next need to consider the above integrand over the region . First observe
[TABLE]
It is easy to see that [nt]\mathcal{D}\big{(}z/n^{1/\alpha}\big{)} is bounded on , and so
[TABLE]
We thus have the required bound in the case . In the case of a general such that we have
[TABLE]
This completes the proof. ∎
The local limit theorem has the following consequence.
Corollary A.2**.**
Let Assumption 1.1 hold. Suppose is an integer valued sequence such that
[TABLE]
as for some constant . Then for fixed
[TABLE]
Proof.
We write
[TABLE]
Use the scaling property: for any
[TABLE]
to write the above as
[TABLE]
The rest is a Riemann sum approximation. ∎
Appendix B Appendix: Bounds required for Proposition 3.5
The main results of this section are Lemmas B.4 and B.5 which were needed in the proof of Proposition 3.5.
Lemma B.1**.**
The following holds
[TABLE]
Proof.
The first inequality is a simple consequence of the fact that
[TABLE]
for any , since the characteristic function of is . As for the finiteness of the sum observe that is decreasing with and therefore
[TABLE]
the last inequality following from (2.7). ∎
It follows from Assumption 1.1 on the characteristic function that the distributions function of belongs to the domain of normal attraction of the symmetric stable law with exponent (see section 35 of [17]). This means that one has to scale the centered by a constant multiple of . One can characterize such distribution functions.
Lemma B.2** ([17]).**
A necessary and sufficient condition for to be in the domain of normal attraction of a Stable() law, , is the existence of contants such that
[TABLE]
where the functions and satisfy
[TABLE]
We use the above lemma to deduce the following large deviation estimate.
Lemma B.3**.**
There exists a constant such that
[TABLE]
Proof.
For we use the result in [20]. This gives for
[TABLE]
from the previous lemma. For , one can use Exercise 3.3.19 in [11] to conclude that has second moments. One then uses Chebyschev’s inequality to prove the lemma. ∎
For the rest of this section we provide the proofs of the bounds required for the two terms in (LABEL:eq:2*). Below is the bound on the first term.
Lemma B.4**.**
The first term in (LABEL:eq:2) has the bound*
[TABLE]
where the limits of the summation are as in (LABEL:limit).
Proof.
We write
[TABLE]
the equality holding by an application of the Markov property. We focus on the second term for now, we will return to the first term later. We split the sum according to whether or not. The term is of order . For the term corresponding to replace by by using Theorem A.1 with to obtain
[TABLE]
Returning to our expression (B.1) we now can write it as
[TABLE]
where we have used Lemma B.1 and Lemma B.3 with the constant from there. Let us now consider each of the first two terms above. Using Theorem A.1 as well as (A.2) one gets for the first term
[TABLE]
where is the transition kernel of . Using Theorem A.1 again with , we get for
[TABLE]
Collecting all our estimates and recalling our conditions (3.2) on and completes the proof. ∎
We next bound the second term in (LABEL:eq:2*).
Lemma B.5**.**
The second term in (LABEL:eq:2) has the bound*
[TABLE]
where the limits in the summation are as in (LABEL:eq:2).*
Proof.
We separate out the term and obtain
[TABLE]
The reason for the error bound for the first term is that restricting forces , and the number of terms in the summation over and is of order . As in Lemma B.4 we split the sum over according to whether or not. In the case we use Theorem A.1 with . Thus the above is
[TABLE]
To bound the last term we ignore the difference in the expression and instead bound the sum. By Lemmas B.1 and B.3
[TABLE]
and
[TABLE]
where we used Lemma B.1 in the last step. Collecting our estimates completes the proof. ∎
Acknowledgements: The author expresses his gratitude to Davar Khoshnevisan for providing the proof of Lemma 3.3. He also thanks David Applebaum for comments on an earlier version of the paper. Partial support from EPSRC through grant EP/N028457/1 is gratefully acknowledged.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Tom Alberts, Konstantin Khanin, and Jeremy Quastel. The intermediate disorder regime for directed polymers in dimension 1 + 1 1 1 1+1 . Ann. Probab. , 42(3):1212–1256, 2014.
- 2[2] Márton Balázs, Firas Rassoul-Agha, and Timo Seppäläinen. The random average process and random walk in a space-time random environment in one dimension. Comm. Math. Phys. , 266:499–545, 2006.
- 3[3] A. A. Borovkov. On the rate of convergence for the invariance principle. Theory of Probability and its applications , 18:207–225, 1973.
- 4[4] Francesco Caravenna, Rongfeng Sun, and Nikos Zygouras. Polynomial chaos and scaling limits of disordered systems. J. Eur. Math. Soc. (JEMS) , 19(1):1–65, 2017.
- 5[5] Le Chen and Robert C. Dalang. Moments, intermittency and growth indices for the nonlinear fractional stochastic heat equation. Stoch. Partial Differ. Equ. Anal. Comput. , 3(3):360–397, 2015.
- 6[6] Francis Comets. Directed polymers in random environments , volume 2175 of Lecture Notes in Mathematics . Springer, Cham, 2017. Lecture notes from the 46th Probability Summer School held in Saint-Flour, 2016.
- 7[7] Francis Comets and Nobuo Yoshida. Directed polymers in random environment are diffusive at weak disorder. Ann. Probab. , 34(5):1746–1770, 2006.
- 8[8] Daniel Conus, Mathew Joseph, Davar Khoshnevisan, and Shang-Yuan Shiu. On the chaotic character of the stochastic heat equation, II. Probab. Theory Related Fields , 156(3-4):483–533, 2013.
