Sparse Monte Carlo method for nonlocal diffusion problems
Dmitry Kaliuzhnyi-Verbovetskyi, Georgi S. Medvedev

TL;DR
This paper introduces a sparse Monte Carlo combined with discontinuous Galerkin method for efficiently solving nonlocal diffusion equations, proving convergence and analyzing error sources, with numerical validation.
Contribution
It presents a novel sparse sampling approach for nonlocal diffusion problems, reducing computational cost while maintaining accuracy, with rigorous convergence analysis.
Findings
Sparse Monte Carlo method reduces the number of discretization points needed.
Convergence of the proposed numerical scheme is proven and error estimates are provided.
Numerical experiments confirm the theoretical convergence rates.
Abstract
A class of evolution equations with nonlocal diffusion is considered in this work. These are integro-differential equations arising as models of propagation phenomena in continuum media with nonlocal interactions including neural tissue, porous media flow, peridynamics, models with fractional diffusion, as well as continuum limits of interacting dynamical systems. The principal challenge of numerical integration of nonlocal systems stems from the lack of spatial regularity of the data and solutions intrinsic to nonlocal models. To overcome this problem we propose a semidiscrete numerical scheme based on the combination of sparse Monte Carlo and discontinuous Galerkin methods. An important feature of our method is sparsity. Sparse sampling of points in the Monte Carlo approximation of the nonlocal term allows to use fewer discretization points without compromising the accuracy. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Mathematical Modeling in Engineering · Differential Equations and Numerical Methods · Differential Equations and Boundary Problems
Sparse Monte Carlo method for nonlocal diffusion problems
Dmitry Kaliuzhnyi-Verbovetskyi and Georgi S. Medvedev
Department of Mathematics, Drexel University, 3141 Chestnut Street, Philadelphia, PA 19104; [email protected]
Abstract
A class of evolution equations with nonlocal diffusion is considered in this work. These are integro-differential equations arising as models of propagation phenomena in continuum media with nonlocal interactions including neural tissue, porous media flow, peridynamics, models with fractional diffusion, as well as continuum limits of interacting dynamical systems. The principal challenge of numerical integration of nonlocal systems stems from the lack of spatial regularity of the data and solutions intrinsic to nonlocal models. To overcome this problem we propose a semidiscrete numerical scheme based on the combination of sparse Monte Carlo and discontinuous Galerkin methods. Our method requires minimal assumptions on the regularity of the data. In particular, the kernel of the nonlocal diffusivity is assumed to be a square integrable function and may be singular or discontinuous. An important feature of our method is sparsity. Sparse sampling of points in the Monte Carlo approximation of the nonlocal term allows to use fewer discretization points without compromising the accuracy. For kernels with singularities, more points are selected automatically in the regions near the singularities.
We prove convergence of the numerical method and estimate the rate of convergence. There are two principal ingredients in the error of the numerical method related to the use of Monte Calro and Galerkin approximations respectively. We analyze both errors. Two representative examples of discontinuous kernels are presented. The first example features a kernel with a singularity, while the kernel in the second example experiences jump discontinuity. We show how the information about the singularity in the former case and the geometry of the discontinuity set in the latter translate into the rate of convergence of the numerical procedure. In addition, we illustrate the rate of convergence estimate with a numerical example of an initial value problem, for which an explicit analytic solution is available. Numerical results are consistent with analytical estimates.
1 Introduction
We propose a numerical method for the initial value problem (IVP) for a nonlinear heat equation with nonlocal diffusion
[TABLE]
For analytical convenience, we take as a spatial domain. Throughout this paper, when the domain of integration is not specified, it is assumed to be . Further, , , is a bounded measurable function on , which is Lipschitz continuous in continuous in and integrable in and is a Lipschitz continuous function on
[TABLE]
for all Throughout this paper we assume
[TABLE]
This assumption may be dropped if an apriori estimate on for is available. Furthermore, the analysis below applies to models with the interaction function of a more general form provided
[TABLE]
However, we keep to emphasize the connection to diffusion problems.
Equation (1.1) is a nonlocal diffusion problem. It arises as a continuum limit of interacting particle systems [24, 17]. Equations of this form are used for modeling population dynamics [31, 7, 30, 2, 6], neural tissue [8], porous media flows [10, 11], and various other biological and physicochemical processes involving anomalous diffusion [32, 4]. The key distinction of the evolution equations with nonlocal diffusion from their classical counterparts is the lack of smoothening property. A priori the solution of (1.1), (1.2) is a square integrable function in for all [24] and it may not possess much more regularity beyond that, unless the initial data and kernel are smooth [21, Theorem 3.3]. The lack of smoothness is a serious challenge for constructing numerical schemes for (1.1), (1.2) and for analyzing their convergence. All deterministic quadrature formulas require at least piecewise differentiability for a guaranteed convergence rate. The problem is even more challenging in high dimensional spatial domains. The main idea underlying our approach is to use the Monte Carlo approximation of the nonlocal term in (1.1). We take advantage of the essential feature of the Monte Carlo method: the independence of the convergence rate on the regularity of the integrand. The second key idea is sparsity, whose use is twofold: First, sparse sampling of points in the Monte Carlo method is used to minimize computation without compromising the accuracy. For with jump discontinuity across Lipschitz hypersurfaces, the use of sparsity is computationally beneficial starting from . If the discontinuity set has nontrivial fractal dimension, the sparse Monte Carlo method performs better than its dense counterpart already for (cf. Lemma 4.4). Furthermore, sparsity is the key for extending the Monte Carlo method for models with singular kernels (see § 4.1). Not only does it allow to apply the Monte Carlo method for unbounded functions, it also makes it automatically adaptive: more sample points are selected near the singularities. The combination of these ideas together with the discontinuous Galerkin method yields a numerical scheme for the IVP (1.1), (1.2) that performs well under minimal assumptions on the regularity of and initial data.
This paper is based on our previous work on convergence of interacting particle systems on convergent graph sequences [21, 22, 18, 24]. Continuum limit is a powerful tool for studying various aspects of network dynamics including existence, stability, and bifurcations of spatiotemporal patterns [33, 23, 25, 26]. Very often the derivation of the continuum limit is based on heuristic considerations and its rigorous mathematical justification is a nontrivial problem. Recently, motivated by the theory of graph limits [20, 19, 5], we proved convergence to the continuum limit for a broad class of dynamical systems on graphs [21]. Importantly, our proof applies to models on random graphs [22] including sparse random graphs [18, 24]. These results prepared the ground for the numerical method proposed in this paper. There is an intimate relation between the problem of the continuum limit for interacting particle systems and numerical integration of nonlocal diffusion models. Given a continuum model (1.1), one can construct the corresponding particle system, approximating (1.1). This idea had been already mentioned in [21], but has never been detailed. Further, recent results for the continuum limit of coupled systems on sparse graphs indicate a strong potential of sparse discretization for numerical integration of nonlocal problems. It is the goal of this paper to present these ideas in detail.
In the next section, we present a discretization of (1.1), which can be viewed as an interacting dynamical system on a sparse random graph. The structure of the graph is determined by the kernel , which defines the asymptotic connectivity of the graph sequence parametrized by the size of the graph. In the theory of graph limits, such functions are called graphons [19], the term we adopt for the reminder of this paper. In Section 3, we prove convergence of the semidiscrete (discrete in space and continuous in time) approximation of (1.1) and turn to estimating the rate of convergence in Sections 4 and 5. There are two main factors contributing to the error of approximation. The first is due to approximating the nonlocal term in (1.1) by a random sum (Monte Carlo method), while the second is due to approximating the kernel and the initial data by piecewise constant functions (discontinuous Galerkin method). The rate of convergence of the sparse Monte Carlo approximation follows from our previous results [24, Theorem 4.1]. Convergence of piecewise constant approximation in the –norm follows from classical theorems of analysis (cf. the Lebesgue-Besicovitch Theorem [15] or –Martingale Convergence Theorem [34]). However, neither of these theorems elucidates the rate of convergence. In fact, the example in § 4.2 shows that without additional hypotheses the algebraic convergence may be arbitrarily slow. To this end, we study what determines the rate of convergence of piecewise constant approximations for a square integrable function. For Hölder continuous functions the answer is simple (cf. Lemma 4.1). For discontinuous functions, on the other hand, the answer naturally depends on the type of discontinuity. In Section 4, we consider two examples elucidating this issue. The first example is based on a singular (unbounded) graphon. It shows how the information about the singularity translates into the rate of convergence estimate. Here, we also see how to use sparsity to optimize computation. The second example adapted from [21], on the other hand, presents a bounded graphon with jump discontinuity (§ 4.2). In this case, the convergence rate depends on the geometry of the set of the discontinuity (cf. Lemma 4.4). In the light of these examples, in Section 5, we perform convergence analysis under general assumptions on . In Section 6, we illustrate rate of convergence estimates with a numerical example. Here, we choose a nonlinear problem, which has an explicit solution. This allows us to verify the rate of convergence of the -error as the discretization step tends to zero. Special attention is paid to the dependence of the convergence rate on sparsity. Finally, in Section 7 we present a proof of a technical Lemma 3.5, which extends the corresponding result in [24] to models in multidimensional domains and affords a wider range of sparsification.
Numerical methods for nonlocal diffusion problems have been subject of intense research recently due to their increased use in modeling [28, 14, 13, 4, 3, 29]. Compared to the existing literature, the contribution of the present work is that our method applies to problems with nonlinear diffusivity as well as to problems with more general form of the interaction function (cf. (1.5)). The main focus of this paper is how to deal with models with low regularity of the data. We believe that the combination of the Monte Carlo and discontinuous Galerkin methods provides an effective tool for numerical integration of nonlocal problems under minimal regularity assumptions.
2 The model and its discretization
In this section, we formulate the technical assumptions on the kernel and describe the numerical scheme for solving the IVP (1.1)-(1.2).
We assume that is subject to the following assumptions:
[TABLE]
Theorem 2.1**.**
Let satisfy (W-1). Then for any and there is a unique solution of the IVP (1.1), (1.2) .
Proof.
The proof is as in [18, Theorem 3.1] with minor adjustments. ∎
Next, we note that the kernel in the nonlocal term may be assumed nonnegative. Indeed, by writing as the difference of its positive and negative parts, one can rewrite (1.1) as
[TABLE]
where the nonlocal terms splits into the difference of two terms with nonnegative kernels. Thus, without loss of generality, in the remainder of this paper we will assume
[TABLE]
We approximate the IVP (1.1), (1.2) by the following system of ordinary differential equations
[TABLE]
where
[TABLE]
[TABLE]
The semidiscrete system (2.3) can be viewed as a system of interacting particles on a random graph with the node set and adjacency matrix . The positive sequence
[TABLE]
is used to control the sparsity of . The adjacency matrix is defined as follows. The case is slightly different and so we treat it separately. Thus, there are two cases to consider.
(I)
Suppose . Without loss of generality, we further assume that . Then let
[TABLE]
and
[TABLE]
If (cf. (2.7)), is a dense W-random graph [20], otherwise is sparse with the mean degree
(II)
Alternatively, if is in but not in then let
[TABLE]
where defined in (2.7) with . Then
[TABLE]
3 Convergence of the numerical method
In this section, we study convergence of the discrete scheme (2.3), (2.4). We first deal with the more general case of unbounded graphon and then specialize the result for .
The following additional mild assumption on is used to get a wider range of sparsity. Let nonnegative satisfy
[TABLE]
Theorem 3.1**.**
Suppose nonnegative is subject to (W-1s), and are as in (1.1), (1.2). Further, for some . Then for arbitrary we have
[TABLE]
where is a positive constant independent of , and stands for the -projection of onto the finite–dimensional subspace :
[TABLE]
and
[TABLE]
Estimate (3.1) holds almost surely (a.s.) with respect to the random graph model.
Remark 3.2*.*
The theorem still holds without (W-1s), i.e., for square integrable subject to (W-1). In this case, the last term on the right-hand side of (3.1) is replaced by and
The first two terms on the right–hand side of (3.1) correspond to the error of approximation of the initial data and by the step functions in . Further, and bound the error of approximation of by a bounded step function . Here, the first term is the error of truncating and the second term is the error of approximation of the truncated function by projecting it onto a finite–dimensional subspace. Finally, the last term on the right–hand side of (3.1) is the error of the approximation of the nonlocal term by the random sum in (2.3).
For bounded graphons Theorem 3.1 implies the following result.
Corollary 3.3**.**
Let . Then under the assumptions of Theorem 3.1 we have
[TABLE]
where is a positive constant independent of
Remark 3.4*.*
From (3.2) one can see how to use sparsity to optimize computation. Already for if the largest of the two errors of approximation of and by step functions is with (cf. §4.2) and the nonlinearity does not depend on then taking one can use sparse discretization without compromising the accuracy. This has obvious computational advantages over dense random and, moreover, deterministic spatial discretization schemes, e.g., Galerkin method. Sparse random discretization is even more efficient when .
The proof of Theorem 3.1 modulo a few minor details proceeds as the proof of convergence to the continuum limit in [22, 24]. First, the solution of the IVP (2.3), (2.4) is compared to that of the IVP for the averaged equation:
[TABLE]
Then the solution of the averaged problem is compared to the solution of the IVP (1.1), (1.2). It is convenient to view the solution of the averaged problem as a function on :
[TABLE]
Likewise, we interpret the solution of the discrete problem (2.3), (2.4) as a function on
[TABLE]
We recast the IVP (3.3), (3.4) as follows
[TABLE]
The first step of the proof of convergence of the numerical scheme (2.3), (2.4) is accomplished in the following lemma.
Lemma 3.5**.**
Let nonnegative subject to (W-1s), and (cf. (4.24)). Then for any for solutions of (2.3) and (3.3) subject to the same initial conditions, we have
[TABLE]
where and positive constant independent of .
The proof of the lemma is technical and is relegated to Section 7. The result still holds without for square integrable functions the additional assumption (W-1s) albeit for a narrower range of (cf. [24, Theorem 4.1]).
Proof of Theorem 3.1.
Denote the difference between the solutions of the original IVP (1.1), (1.2) and the averaged IVP (3.7), (3.8)
[TABLE]
By subtracting (3.3) from (1.1), multiplying the resultant equation by , and integrating over , we obtain
[TABLE]
Using Lipschitz continuity of in and an elementary case of the Young’s inequality, we obtain
[TABLE]
[TABLE]
where stands for the -norm. Recall that is bounded by (cf. (1.4)). Using this bound and the Young’s inequality, we obtain
[TABLE]
Finally, using Lipschitz continuity of and Young’s inequality, we estimate
[TABLE]
where we used Fubini theorem and (W-1) in the last line.
By combining (3.11)-(3.15), we arrive at
[TABLE]
where
By Gronwall’s inequality, we have
[TABLE]
∎
4 Two examples
The error of approximation of the nonlocal term by a random sum, the last term on the right–hand side of (3.1), is known explicitly. Next in importance is the error of approximation of the square integrable graphon by the step function This error depends on the regularity of the graphon. In this section, we consider two representative examples of : a singular graphon (§ 4.1) and a bounded graphon with jump discontinuities (§ 4.2). Motivated by these examples in the next section, we will analyze the rate of convergence estimates under general assumptions on graphon .
We will begin with the following estimate for Hölder continuous functions. To this end, and
[TABLE]
Lemma 4.1**.**
Suppose is a Hölder continuous function
[TABLE]
Then
[TABLE]
Here and below, .
Proof.
Using Jensen’s inequality and (4.18), we have
[TABLE]
∎
Remark 4.2*.*
Below, we will freely apply Lemma 4.1 to functions on and on . The latter are clearly covered by the lemma by taking .
4.1 A singular graphon
Consider the problem of approximation by step functions of the singular kernel graphon
[TABLE]
where .
Lemma 4.3**.**
For and we have
[TABLE]
Proof.
Below, we will use the following change of variables for and from defined by
[TABLE] 2. 2.
Let and recall that . Denote . Further,
[TABLE]
where we used (4.22) followed by the change to polar coordinates.
Thus,
[TABLE] 3. 3.
Next we turn to estimating . Since the truncated function is Lipschitz continuous on , by Lemma 4.1,
[TABLE]
It remains to estimate the Lipschitz constant On ,
[TABLE]
The gradient approaches its greatest value as . Thus,
[TABLE]
and
[TABLE] 4. 4.
The statement of the lemma follows (4.24) and (4.26) and the triangle inequality.
∎
Next we choose to optimize the rate of convergence in (4.21). By setting the two exponents of on the right–hand side of (4.21) equal, we see that the rate is optimal for
[TABLE]
With this choice of
[TABLE]
To optimize the rate of convergence of the numerical scheme (2.3), (2.4), one has to choose to maximize the smallest of the following three exponents
[TABLE]
where the last exponent comes from the error of the Monte Carlo approximation (cf. (3.1)).
4.2 -valued functions
The following example is adapted from [21]. It shows how jump discontinuities affect the rate of convergence of approximation by piecewise constant functions. The accuracy of approximation depends on the geometry of the hypersurface of discontinuity, more precisely, on its fractal dimension.
Let be a closed subset of and consider
[TABLE]
Denote by the boundary of and recall the upper box-counting dimension of
[TABLE]
where stands for the number of having nonempty intersection with (cf. [16]).
Lemma 4.4**.**
[TABLE]
for some positive independent on .
Proof.
As in (4.19), we have
[TABLE]
Note that the only nonzero terms in the sum on the right–hand side of (4.30) are the integrals over ’s having nonempty intersection with Thus,
[TABLE]
where we used (4.28). ∎
Remark 4.5*.*
Note that as the rate of convergence in (4.29) can be made arbitrarily low.
5 The rate of convergence of the numerical method
5.1 Approximation by step functions
In this section, we address the rate of convergence of the Galerkin component of the numerical scheme (2.3), (2.4). Specifically, we study the error of the approximation of the graphon and the initial data by step functions.
We will need an -modulus of continuity of function on a unit -cube . In fact, we only need the -modulus of continuity, but present the analysis in the more general -setting, since this does not require any extra effort. For functions on the real line, the definition of the -modulus of continuity can be found in [1, 12]. Here, we present a suitable adaptation of this definition for the problem at hand.
Definition 5.1**.**
For we define the -modulus of continuity
[TABLE]
where and .
For we define a generalized Lipschitz space111 Below, we will freely apply the definitions and various estimates established for functions on to functions on for which they are trivially valid by setting . In particular, the definitions of the modulus of continuity and the corresponding Lipschitz spaces obviously translate to functions on .
[TABLE]
Clearly contains -Hölder continuous functions. However, Lipschitz spaces are much larger than Hölder spaces. For instance, contains discontinuous functions.
Below, we express the error of approximation of by a step function through . The analysis works out a little cleaner for dyadic discretization of , which will be assumed for the remainder of this section. Thus, we approximate by a piecewise constant function
[TABLE]
where stands for the mean value of on
[TABLE]
Lemma 5.2**.**
For , we have
[TABLE]
where is independent of .
Proof.
Fix and denote . To simplify notation, throughout the proof we drop in the subscript of and .
We write
[TABLE]
where
[TABLE]
[TABLE]
and
[TABLE]
Rewrite (5.3)
[TABLE]
By subtracting (5.5) from (5.6) we have
[TABLE]
Further,
[TABLE]
Integrating both sides of (5.7) over and using Jensen’s inequality, we continue
[TABLE]
Thus,
[TABLE]
Since , we have
[TABLE]
where depends on and but not .
Let be arbitrary but fixed. For any integer we have
[TABLE]
By passing to infinity in (5.10), we get (5.4). ∎
5.2 The rate of convergence
We now can combine Theorem 3.1 and Lemma 5.4 to estimate the convergence rate for (2.3), (2.4). For the model with a bounded graphon (cf. (I), Section 2) we have the following theorem.
Theorem 5.3**.**
Suppose that in addition to the assumptions of Theorem 3.1, for some and uniformly for i.e.,
[TABLE]
where is independent of
Then
[TABLE]
where is independent of .
If has singularities then the convergence rate may also depend on the accuracy of approximation of by the truncated function . We do not estimate the truncation error for a general . For an example of how this error can be estimated for a given graphon in practice, we refer to the example in § 4.1.
6 Numerical example
In this section, we illustrate convergence analysis in the previous sections with a numerical example. To this end, we consider an IVP for the continuum Kuramoto model with nonlocal nearest–neighbor coupling [27]:
[TABLE]
where stands for the phase of the oscillator at , is its intrinsic frequency. Function describing the connectivity of the network, is first defined on by
[TABLE]
and then extended as a –periodic function on . The initial condition
[TABLE]
is called a –twisted state (Figure 1a). For , is a stationary solution of (6.12). Thus,
[TABLE]
solves the IVP (6.12), (6.13). We use the explicit solution (6.16) to compute the error of the numerical integration of (6.12), (6.13).
To estimate the rate of convergence of the numerical scheme (2.3), (2.4) we use the following values of parameters: and For these parameter values, the travelling wave solution (6.16) is unstable. We integrated (6.12) numerically for using the fourth order Runge–Kutta method with the time step Note that the error of the Runge–Kutta method, i.e., of the discretization in time is significantly smaller than of that of the discretizing in space (c.f. (2.3), (2.4)). We integrated (6.12) numerically for different values of and for . For each pair we repeated the numerical experiment times and computed the mean value of the error of numerical integration (compared to the exact solution (6.16)). The mean errors and computed for and respectively are used to determine the convergence rate:
[TABLE]
The results of this numerical experiment are shown in Figure 1b. Our main goal was to verity the dependence of the convergence rate on sparsity controlled by . The pixel pictures for the adjacency matrices of random graphs corresponding to the nonlocal nearest–neighbor coupling for and different values of are shown in Figure 2. The plot in Figure 1b shows a clear linear relation between the exponent and . The numerical rates plotted by blue stars are slightly lower the theoretical rates plotted in red. Overall numerical rates show a good fit with the analytical estimate.
7 Proof of Lemma 3.5
In this section, we prove Lemma 3.5. The proof follows the lines of the proof of Theorem 4.1 in [24], which covers for . Extension to the multidimensional case is straightforward. Lemmas 7.3 and 7.4 adapted from [9] allow to extend the range of to . The reader not interested in the extended range of may find a simpler proof in [24] easier to follow. For those interested in the full range of below we present the following proof of Lemma 3.5.
Theorem 7.1**.**
Let nonnegative satisfy
[TABLE]
and
[TABLE]
Then for solutions of (2.3) and (3.3) subject to the same initial conditions and arbitrary , we have
[TABLE]
for arbitrary and positive constant independent of . In particular, for , we have
[TABLE]
where can be taken arbitrarily small.
We precede the proof of Theorem 7.1 with several auxiliary estimates.
Lemma 7.2**.**
From (W-1s) it follows
[TABLE]
Proof.
We prove (7.4) assuming that nonnegative is in but not in . In this case, are defined by (4.24). For arbitrary and we have
[TABLE]
where we used Jensen’s inequality in the second line and (W-1s) in the last line. Thus,
[TABLE]
The bound for is proved similarly. ∎
Lemma 7.3**.**
For , we have
[TABLE]
In particular, with probability there exists such that
[TABLE]
for all .
For the next lemma, we will need the following notation
[TABLE]
and .
Lemma 7.4**.**
For arbitrary , we have
[TABLE]
where is a positive constant independent of and
[TABLE]
Proof of Theorem 7.1.
Recall that and are Lipschitz continuous function in with Lipschitz constants and respectively.
Further, are Bernoulli random variables
[TABLE]
Denote By subtracting (2.3) from (3.3), multiplying the result by and summing over , we obtain
[TABLE]
where is the discrete -norm (cf. (7.12)).
Using Lipschitz continuity of in , we have
[TABLE]
Using Lipschitz continuity of and the triangle inequality, we have
[TABLE]
Using Lemma 7.3 and (7.4), we obtain
[TABLE]
Similarly,
[TABLE]
By plugging (7.17) and (7.18) into (7.16), we have
[TABLE]
It remains to bound :
[TABLE]
The combination of (7.14), (7.15), (7.19) and (7.20) yields
[TABLE]
where
Using the Gronwall’s inequality and Lemma 7.4, we have
[TABLE]
∎
Proof of Lemma 7.3.
Let
[TABLE]
Note that for fixed are mean zero independent RVs. Further, using the definition of it is straightforward to bound
[TABLE]
From (7.25), we have
[TABLE]
Using Bernstein’s inequality and the union bound, we have
[TABLE]
Finally, the combination of (7.23) and (7.27) yields
[TABLE]
This proves (7.13). By Borel-Cantelli Lemma, (7.7) follows. ∎
Proof of Lemma 7.4.
Recall (7.8)-(7.10) and rewrite
[TABLE]
where
[TABLE]
By (7.1), one can choose a sequence such that
[TABLE]
Specifically, let
[TABLE]
and define events
[TABLE]
[TABLE]
Clearly,
[TABLE]
We want to show that . By Borel-Cantelli Lemma, it is sufficient to show that
[TABLE]
From Lemma 7.3, we know that for In the remainder of the proof, we show that is convergent.
Applying the exponential Markov inequality to , from and (7.32), we have
[TABLE]
Using the independence of in , we have
[TABLE]
Using
[TABLE]
and the Cauchy-Schwartz inequality, we bound the right–hand side of (7.36) as follows
[TABLE]
From (7.10), (7.29), and under (cf. (7.33)), we have
[TABLE]
Further,
[TABLE]
Using (7.10), we estimate sum of the fourth moments of
[TABLE]
where we also use (7.4). Similarly,
[TABLE]
By combining (7.39)-(7.41), we obtain
[TABLE]
By plugging (7.38) and (7.42) into (7.37), we obtain
[TABLE]
Using this bound on the right–hand side of (7.36), we further obtain
[TABLE]
Using (7.44), from (7.35) we obtain
[TABLE]
Furthermore, using (7.31) it is straightforward to check that
[TABLE]
The statement of the lemma then follows from (7.32)-(7.34) via Borel-Cantelli Lemma. ∎
Acknowledgements. This work was supported in part by the NSF grant DMS 1715161.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] N. I. Akhieser, Theory of Approximation , Dover Publications, Inc., New York, 1992, Translated from the Russian and with a preface by Charles J. Hyman, Reprint of the 1956 English translation.
- 2[2] Fuensanta Andreu-Vaillo, José M. Mazón, Julio D. Rossi, and J. Julián Toledo-Melero, Nonlocal Diffusion Problems , Mathematical Surveys and Monographs, vol. 165, American Mathematical Society, Providence, RI; Real Sociedad Matemática Española, Madrid, 2010.
- 3[3] Lehel Banjai, Jens M. Melenk, Ricardo H. Nochetto, Enrique Otárola, Abner J. Salgado, and Christoph Schwab, Tensor FEM for spectral fractional diffusion , Found. Comput. Math. 19 (2019), no. 4, 901–962.
- 4[4] Andrea Bonito, Juan Pablo Borthagaray, Ricardo H. Nochetto, Enrique Otárola, and Abner J. Salgado, Numerical methods for fractional diffusion , Comput. Vis. Sci. 19 (2018), no. 5-6, 19–46.
- 5[5] Christian Borgs, Jennifer T. Chayes, Henry Cohn, and Yufei Zhao, An L p superscript 𝐿 𝑝 L^{p} theory of sparse graph convergence I: Limits, sparse random graph models, and power law distributions , Trans. Amer. Math. Soc. 372 (2019), no. 5, 3019–3062.
- 6[6] Nikolaos Bournaveas and Vincent Calvez, The one-dimensional Keller-Segel model with fractional diffusion of cells , Nonlinearity 23 (2010), no. 4, 923–935.
- 7[7] C. Carrillo and P. Fife, Spatial effects in discrete generation population models , J. Math. Biol. 50 (2005), no. 2, 161–188.
- 8[8] Stephen Coombes, Peter beim Graben, and Roland Potthast, Tutorial on neural field theory , Neural fields, Springer, Heidelberg, 2014, pp. 1–43.
