Convergence of stationary radial basis function-schemes for evolution equations
Brad Baxter, Raymond Brummelhuis

TL;DR
This paper proves convergence rates for semi-discrete radial basis function schemes applied to evolution equations, including parabolic and hyperbolic types, using stationary interpolation on regular grids.
Contribution
It introduces precise convergence analysis for RBF-based schemes on regular grids, extending previous work with a broader class of basis functions.
Findings
Established convergence rates for RBF schemes on evolution equations.
Extended applicability to parabolic and hyperbolic equations.
Provided approximate approximation results for these schemes.
Abstract
We establish precise convergence rates for semi-discrete schemes based on Radial Basis Function interpolation, as well as approximate approximation results for such schemes. Our schemes use stationary interpolation on regular grids, with basis functions from a general class of functions generalizing one introduced earlier by M. Buhmann. Our results apply to parabolic equations such as the heat equation or Kolmogorov-Fokker-Planck equations associated to L\'evy processes, but also to certain hyperbolic equations.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic processes and financial applications · Differential Equations and Numerical Methods · Fluid Dynamics and Turbulent Flows
Convergence of Stationary Radial Basis Function-schemes for evolution equations
Brad Baxter and Raymond Brummelhuis
Brad Baxter, Department of Economics, Mathematics and Statistics, Birkbeck, University of London, Malet Street, London WC1E 7HX, UK, e-mail: [email protected]
Raymond Brummelhuis, Laboratoire de Mathématiques de Reims, Université de Reims-Champagne-Ardenne, FR 3399 CNRS, BP 1039, 51687 cedex 2, Reims, France, e-mail: [email protected]
Abstract.
We establish precise convergence rates for semi-discrete schemes based on Radial Basis Function interpolation, as well as approximate approximation results for such schemes. Our schemes use stationary interpolation on regular grids, with basis functions from a general class of functions generalizing one introduced earlier by M. Buhmann. Our results apply to parabolic equations such as the heat equation or Kolmogorov-Fokker-Planck equations associated to Lévy processes, but also to certain hyperbolic equations.
2010 Mathematics Subject Classification:
Primary: 65M12, 65M15, 65M20, 65M70; Secondary: 35S05, 35S10, 41A05, 41A25, 60G51
1. Introduction
We establish convergence rates for semi-discrete numerical schemes based on stationary Radial Basis Function (RBF) interpolation. We do this for the classical heat equation but also for more general constant coefficient (translation invariant) pseudo-differential evolution equations. Our analysis applies to (suitably regularized versions of) the fractional heat equation and other Kolmogorov-Fokker-Planck equations associated to Lévy processes, but also to certain non-parabolic equations such as the free Schrödinger equation or hyperbolic equations such as the half-wave and transport equations.
The scheme we consider is an RBF-version of the method of lines, implemented on regular square grids, and we examine the convergence of the scheme when the grid size tends to 0, using basis functions which scale with the grid. We refer to Buhmann [4], Wendland [17] and Fasshauer [7] for general introductions to Radial Basis Function interpolation, and to the beginning of sections 5 and 6 below for a precise description of the scheme; roughly speaking, RBF-interpolation seeks to interpolate a function by a linear combination of translates of a given basis function, centered in the interpolation points. The coefficients can be computed by solving a linear system whose coeffient matrix, under appropriate conditions on the basis function, is always non-singular. To turn this into a scheme for solving a linear evolution equation on , one takes the coefficients to be time-dependent and asks that the equation be satisfied exactly in the interpolation points, leading to a system of ordinary differential equations. An advantage of using a regular grid for the interpolation points is that the interpolation problem can be reduced to the construction of a single Lagrange function (also called cardinal function), which greatly simplifies the theoretical analysis. The advantage of using basis functions which scale with the grid, known as stationary interpolation, is that the condition number of the coefficient matrix of the linear system which, in a practical implementation of the scheme, has to be solved numerically, becomes independent of the grid-size.
The numerical performance of such a scheme was examined in [2], [9], [14] [5], [6], motivated by applications in mathematical finance. Our main results will relate the order of convergence of the scheme to the degree of the operator and to the approximation order of the underlying RBF interpolation, as function of the grid-size. The latter will only be algebraic, since we use stationary interpolation, but can be arbitrarily large, depending on the basis function and the degree of smoothness of the initial condition. We will furthermore show that under certain circumstances we can have approximate approximation, in the sense that, in case of non-convergence of the scheme to the true solution, one can nevertheless get arbitrarily close to the real solution by an appropriate choice of basis function. More generally, if the scheme does converge, one can, for initial values which are sufficiently smooth and for grid-sizes which are small but not too small, observe an apparent order of convergence which is bigger than the actual one. This was observed numerically in [2]. The notion of approximate approximation was introduced by Maz’ya [11], and analyzed in detail for approximation using Gaussian kernels in Maz’ya and Schmidt [12], [13]. Amongst other things, we generalize their work to more general basis functions.
We will in fact not restrict ourselves to particular examples of Radial Basis Functions, such as the generalized multiquadrics or the polyharmonic basis functions, but carry out our analysis for a general class of basis functions which we introduce in section 2. Since this class is a generalization of one introduced by Martin Buhmann in [3] and by which it was inspired, we have called it the Buhmann class. We will analyze the properties of our scheme in Fourier space and, for that reason, first re-examine in section 3 the convergence of RBF-interpolation on regular grids from the Fourier point of view by deriving precise estimates for the Wiener norm of the difference between a function and its RBF-interpolant. Our convergence theorems have a non-zero intersection with classical results of Buhman and Powell (see [4] and its references), generalizing these in some respects. Despite the use of the Wiener norm we can allow certain classes of polynomially increasing functions. The Fourier transform of such a function will have a non-integrable algebraic singularity at 0, and the allowed order of the singularity (and therefore the allowed growth of the function) will depend on the basis function which is used for the interpolation. In section 4 we show that the convergence rates which we found in section 3 are best possible, and discuss approximate approximation.
The next two sections examine the convergence of the RBF-variant of the method of lines, first, in section 5, for the in many respects typical case of classical heat equation before indicating, in section 6, how these results extend to more general pseudo-differential evolution equations. We show that the scheme converges at a rate of , where is the order of the operator ( for the heat equation) and the order of convergence of the underlying RBF-interpolation scheme, which is also the order of the singularity in 0 of the Fourier transform of the basis function which is used. We show that this rate is in general optimal, but that we can also have approximate approximation, in the sense that for appropriate basis functions which are sufficiently ”flat” and with sufficiently smooth initial data there can be an apparent higher order of convergence when is ”small but not too small”, which is determined by the degree of smoothness of the initial data. This provides an explanation for the empirical convergence rates which were observed in [2].
One obvious limitation of the present paper is that we have restricted ourselves to interpolation on regular grids of scaled integer points, whereas one of the strengths of the RBF method is that one can use arbitrarily scattered interpolation points, opening up the way to adaptive methods. This flexibility may become important when dealing with variable coefficient linear differential operators or with non-linear ones. We note however, in our defense, that the often-used Finite Difference methods are usually restricted to regular grids also, and that even on regular grids RBF methods can have definite advantages over FD methods when treating non-local operators, as they do not discretize the operator, and can therefore be better suited when the latter has a singular kernel, such as for the Kolmogorov backward equation of certain Lévy processes: see [2]. Another limitation is that we only have treated translation invariant evolution equations. These do however already include large classes of operators which are of interest for applications, such as the fractional heat equation or other Kolmogorov-Fokker-Planck equations associated to Lévy processes. It would obviously be of interest to generalize our results to variable coefficient PDEs, but this will require other methods. We also have (primarily) examined convergence in Wiener norm: convergence of the scheme in the norm or more general Sobolev norms will be treated elsewhere. We finally want to note that although we have used stationary RBF interpolation, our analysis can be extended to the non-stationary case, when one uses the same basis function for all grid-sizes. In this case one can have exponential rates of convergence, as first discoverd by Madych and Nelson [10] for interpolation of functions in the so-called native space of the basis function. This also will be examined elsewhere.
Notations: denotes the usual ”variable constant”, whose exact numerical value is allowed to change from one occurrence to the other. We use the following convention for the Fourier transform of an integrable function on :
[TABLE]
being the Euclidean inner product on We will routinely use the extension of the Fourier transform to the space of tempered distributions , where is the usual Schwarz-space of rapidly decreasing functions.
For , let be the space of measurable functions on for which
[TABLE]
and , its non-homogeneous version, with norm We will also need the ”mixed” spaces
[TABLE]
for , where Note that this scale of spaces is increasing in for , and that , while
Finally, we define weighted sup-norm spaces of measurable functions such that
[TABLE]
where the sup is the essential supremum, as usual. If , an element of is of polynomial growth of order at most : on , with
Derivatives of functions on will be denoted by or by , a multi-index. If and , then will denote the Hölder space of -times differentiable functions on with bounded derivatives of all orders, such that the derivatives of order satisfy a uniform Hölder condition on with exponent , provided with the norm
[TABLE]
where
Finally, and denote the usual floor and ceiling functions, defined as the greatest, respectively smallest integer which is less than, respectively greater than a real number ; note that if , while otherwise.
2. A class of basis functions for interpolation on a regular grid
2.1. The Buhmann class
We introduce a flexible class of basis functions which is well-suited for stationary interpolation on regular grids. Since this class of functions is a generalisation of the one introduced earlier by Buhmann [3] (called admissible in there) we will call it the Buhmann class. From the onset, we will allow non-radial basis functions, radiality not being essential for most of the theory (as is of course well known).
Definition 2.1**.**
For and we define the Buhmann class as the set of functions such that
(i) is of polynomial growth of order strictly less than , in the sense that for some
(ii) (Regularity and strict positivity.) The restriction to of the Fourier transform (in the sense of tempered distributions) can be identified with a function in , which we will continue to denote by , which is pointwise strictly positive: for all
(iii) (Elliptic singularity at 0.) There exist positive constants such that for all ,
[TABLE]
while also
[TABLE]
(iv) (Decay at infinity.) There exist positive constants , , such that
[TABLE]
We use the term ”elliptic” for condition (iii) because of the resemblance of (4) and (5) with the ellipticity condition on symbols in pseudo-differential theory (where the singularity would be at infinity). The significance of is that this is the smallest integer which is strictly greater than (Note that if , then ) Conditions (ii) and (iii) for derivatives up till this order will imply polynomial decay of order of the associated Lagrange interpolation function which we will define below. Requiring higher order differentiability would not improve this rate of decay: is best possible, under condition (iii).
In most of the results of this paper, strict positivity of on could have been replaced by the weaker condition that the ”periodisation” of , , be pointwise strictly positive on all of , as in [3]; note that by (6) with , this series converges absolutely on , given that , while it can be set equal to on , in view of (5). Since for most of the radial basis functions used in practice, itself is already strictly positive, we have opted to impose the stronger condition, which also simplifies the proofs.
Remarks 2.2**.**
(i) Buhmann [3] studied stationary RBF interpolation on regular grids for a slightly more restricted class of radial basis functions. The main difference between his original class and the one of our definition 2.1 (besides, as already mentioned, Buhmann requiring strict positivity of the periodisation of instead of of itself) lies in condition (iii), where Buhmann asks that for small , be asymptotically equivalent to a positive multiple of modulo an relative error which has to be sufficiently small: with and as for , with an Under these conditions Buhmann proved the existence of a unique Lagrange function for interpolation on , constructed as an infinite linear combination of translates of , which moreover decays as at infinity. This fundamental result remains true for ’s in : see theorem 2.3 below and and its proof in Appendix A. The condition that is in our treatment made unnecessary by lemma A.2.
(ii) All conditions in definition 2.1 except the first are on the Fourier transform of One can show (cf. Appendix A) that if the Fourier transform of a polynomially increasing function satsifies (ii), (iii) and (iv), then there exists a function which grows at most as at infinity (and, slightly better, as if ) and a polynomial such that
[TABLE]
The function is unique modulo polynomials of degree If we moreover require to have polynomial growth of order strictly less than , as in definition 2.1, then will be a polynomial of degree of at most (which is if , and if ). Note that the Fourier transform of a polynomial is a linear combination of derivatives of the delta-distriubution in 0, and therefore equals 0 on
(iii) The condition that will suffice for convergence of the RBF interpolants on regular grids as , but will have to be strengthened to for convergence of the RBF schemes for solving parabolic PDEs and PIDEs which are of order (in the space variables).
The usual examples of radial basis functions, such as the generalised multi-quadrics, cubic and higher order splines, thin plate splines, inverse multi-quadrics and Gaussians, are Buhmann class.
One can show that if , , satisfies conditions (ii) - (iv) of definition 2.1, then conditionally positive definite of order , where is the smallest integer such that : for this it would in fact be sufficient that is locally integrable, satisfies (4) with and is integrable on One can therefore, by standard RBF theory, interpolate an arbitrary function on a finite set of points by a linear combination of translates of plus a polynomial of degree , provided the set is unisolvent for this class of polynomials: see for example [4]. This involves solving a linear system of equations. The next theorem establishes the existence and main properties of a Lagrange function in terms of which the solution of the interpolation problem on can be simply expressed.
Theorem 2.3**.**
Suppose that with and Then there exist coefficients , , such that the series
[TABLE]
converges absolutely and uniformly on compacta and defines a Lagrange function for interpolation on
[TABLE]
The function satisfies the bound
[TABLE]
and its Fourier transform is given by
[TABLE]
Moreover, at the points of , satisfies the Fix-Strang conditions:
[TABLE]
See Appendix A for the proof. We make the trivial but for the sequel important observation that
[TABLE]
Remarks 2.4**.**
(i) We will write if we want to stress the dependence on the basis function , otherwise we will simply write The subindex 1 in is a notational reminder that is a Lagrange function for interpolation on the standard grid with width 1. For stationary RBF interpolation on the scaled grids one uses the scaled basis functions , whose associated Lagrange functions then simply are If for some , then is an infinite linear combination of translates of which will interpolate on , where the series converges absolutely and uniformly on compacta, in view of the growth restriction on
(ii) One important point of the theorem is that the basis function need not decay at infinity, but is allowed to grow polynomially. A high order of growth will in fact lead to a high order convergence of the stationary RBF interpolants to as , since this will translate into a strong singularity in 0 of the Fourier transform of in the form of a large , which implies that will satisfy the Fix - Strang conditions to a high order. The latter then implies a convergence rate of in sup-norm, as shown by Buhmann [3] (under suitable conditions on ); see also [4], Chapter 4. We will prove such convergence theorems for the Wiener norm instead of the sup-norm, using an entirely different approach: see theorems 3.2, 3.4 and 3.9. Note that, contrary to , the Lagrange function will decay at infinity, as shown by (8), and this the more rapidly the higher is. In particular, is integrable if and its Fourier transform then exists in the classical sense, as an absolutely convergent integral. It is possible for to have faster decay: Buhmann [3] shows that if as with , then
[TABLE]
while there are examples of for which decays exponentially: see [4] for details and references.
(iii) The proof of theorem 2.3 shows that the coefficients are precisely the Fourier coefficients of They satisfy bounds analogous to the ones satisfied by : This guarantees that the defining series for converges absolutely and uniformly on compacta including when , in view of condition (i) of definition 2.1.
Since the denominator of (9) is -periodic and positively bounded away from 0, will have the same decay as as We state this as a lemma, for later reference:
Lemma 2.5**.**
There exists a constant such that for all ,
[TABLE]
Proof. The function is periodic and, by the positivity and ellipticity of at 0, bounded from below by for some Hence
[TABLE]
which implies (12).
Another useful lemma clarifies the smoothness properties of :
Lemma 2.6**.**
There exist constants such that for each multi-index with and all ,
[TABLE]
for in a neighborhood of 0. In particular, if then belongs to the Hölder space , with
Note that if is non-integer, but that it is equal to if is a positive integer, so that then.
Proof of lemma 2.6. This is elementary: if we let , then applying Leibnitz’s rule to the product yields that
[TABLE]
The estimate (13) for now follows by induction on , using that , together with (6) of definition 2.1 and lemma 2.5 (to start the induction). If , we use the same argument, starting from
[TABLE]
on observing that the right hand side is near 0, since equal to Finally, the fact that is implies that all derivatives of order up to , if , or , if , exist and are 0. Their continuity in 0 follows from (13).
Remark 2.7**.**
We briefly pause to examine the differentiability of if Letting and , we have that
[TABLE]
This shows that cannot be in 0 if is not even, even if is (note that then given that is Buhmann class). If , then will be as smooth as is in 0, and therefore as is away from 0.
We finally note that to construct numerical PDE schemes using RBF interpolation one will obviously need sufficient differentiability of The proof of theorem 2.3 given in appendix A also yields existence and decay of derivatives of , provided is chosen sufficiently large:
Theorem 2.8**.**
Suppose that and let with Then Moreover, as , for all
See also theorem 6.1 below.
3. Convergence of RBF-interpolants
As a preparation for our analysis of numerical RBF-schemes for the heat equation and other evolution equations, we first revisit the convergene of stationary RBF interpolants, providing an alternative perspective on classical convergence theorems of Buhmann and Powell. As stated in the introduction, we will limit ourselves to stationary interpolation on regular grids , meaning that we let the basis function scale with the grid-size: The associated Lagrange function scales similarly, and the RBF interpolant of a given function can be conveniently written as
[TABLE]
where is the Lagrange function of theorem 2.3. Here, and below, sums over , , , etc. are understood to be over Note that the use of the Lagrange function eliminates the need for inverting the coefficient matrix in the standard formulation of RBF interpolation111In the present, idealized, set-up of interpolation on that coefficient matrix is infinite; in practice, one would have to truncate the matrix: (where, ) with , taking larger and larger sections of the matrix as One would also have to truncate the series for , leading to quasi-interpolation.. The decay at infinity of easily implies that the series (14) converges absolutely if is of polynomial growth of order strictly less than , in the sense that for some
Throughout this section, we fix a basis function with and : see subsection 3.2 below for the case of : although there can be no convergence then, this case is nevertheless of interest since we can have approximate convergence in the sense of Maz’ja and Smith: see secion 4 below. We will systematically work in Fourier-space, and examine convergence of to in Wiener norm,
[TABLE]
except for the end of this section where we will briefly examine convergence in weigthed sup-norms. Convergence in Wiener norm of course trivially implies convergence in Chebyshev or uniform norm, since
3.1. Convergence in Wiener norm
We begin by computing the Fourier transform of for Schwarz-class functions For sufficiently rapidly decaying functions , let us define the function
[TABLE]
The map will play an important rôle in what follows. We note that is a contraction with respect to the -norm: indeed, by the positivity of and monotone convergence,
[TABLE]
in view of (11); therefore extends to a contraction on We also note that if , then the defining series for converges absolutely a.e., since
[TABLE]
Lemma 3.1**.**
If then and then
[TABLE]
Proof. Since , is integrable by theorem 2.3 and therefore Applying Fubini’s theorem to the function on one finds
[TABLE]
where for the second line we used the Poisson summation formula: , with
We can then already state a first convergence theorem:
Theorem 3.2**.**
Let Then there exists a constant such that for all tempered functions for which and for all positive ,
[TABLE]
The condition that implies a certain smoothness: must have continuous and bounded derivatives of order
Proof. We first note that if , then : the hypothesis on implies that is a bounded continuous function. It follows that is well-defined, by (8), and that there exists a constant such that for all ,
[TABLE]
Indeed, ; the right had side is -periodic, and its sup on can be estimated by a constant times , which converges since
The Fourier transform of therefore exists as a temperered distribution. We show using a density argument that : since , then there exists a sequence such that Consequently in and therefore also as tempered distributions. On the other hand, , so as tempered distributions also. Hence , and consequently As a consequence, has finite Wiener norm if has.
We now observe that since , and using the montone convergence,
[TABLE]
where we used (11) once more. The Fix-Strang condition (10) in 0 then implies (18) with (a number which, in principle at least, is explicitly computabe for a given ).
Remark 3.3**.**
The theorem generalizes to the case when is a finite Borel measure such that : in that case,
[TABLE]
To show this, one first defines by duality: if , then
[TABLE]
where , and one checks that as tempered distributions. Since , on account of the decay of , is a finite Borel measue. Using (11) again, one estimates
[TABLE]
where we can take It follows that the variation norm of is bounded by , which implies (21).
We next observe that the right hand side of (18) still makes sense for certain having a non-integrable singularity at 0. Allowing such singularities means allowing ’s which grow at a certain polynomial rate, and we can prove the following extension of theorem 3.2:
Theorem 3.4**.**
Let be a tempered function on such that for some , and such that
[TABLE]
Then is in , and
[TABLE]
Equation (22) means that the restriction to of the tempered distribution can be identified with a locally integrable function which is integrable with respect to the weight and therefore is in , if we interpret it as an almost everywhere defined function on , whose norm we then simply denote by instead of the more correct
Proof. We first check that is a tempered distribution: this is a consequence of the estimate
[TABLE]
which can be shown as follows: first note that commutes with translations by elements of : if , then
[TABLE]
Let be the -norm on If and with , then
[TABLE]
since if Next, if with , then by translation invariance,
[TABLE]
which implies (24). The next lemma identifies the Fourier transform if
Lemma 3.5**.**
Suppose that for some and that Then the tempered distribution can be identified with the function
[TABLE]
which is in
The proof of the lemma involves extending to a continuous linear functional on the Hölder spaces with (so that if , and otherwise), and using this to define as a tempered distribution. In order not to interrupt the flow of the argument with distribution-theoretical technicalities, we postpone the proof to Appendix B. Note that the individual terms of (25) are integrable on account of the Fix-Strang conditions satisfied by , and that the -norm of (25) can be bounded by the -norm of , using once more that the sum of translates of by elements of is identically equal to one.
The lemma implies the estimate (20), and the theorem follows as before.
Example 3.6**.**
If and if satisfies
[TABLE]
with then one can show that , and that near 0 while at infinity: if (26) holds for all , this follows for example from Stein [15], proposition 1 of Chapter VI. Examination of the proof shows that we only need the number of derivatives indicated. It follows that is integrable if and theorem 3.4 therefore applies to such functions.
We briefly compare our theorem 3.4 with convergence results of Buhmann and Powell, cf. [4] and its references. Theorem 4.6 of [4] states that if and if such that
[TABLE]
then If is an odd integer, Buhmann loc cit., theorem 4.7, needs an additional degree of differentiability, with the derivatives of order again bounded. Note that these condititons imply that is of polynomial growth of order at most On the other hand, theorem 3.4 covers cases when can have stronger growth, and derivatives of order do not not need to be bounded, for example with if and if . We also note that condition (22) implies that is (a consequence of the integrability of on and the smoothness of the inverse Fourier transform of any compactly supported distribution) without the derivatives of order necessarily being bounded (because of the singularity at 0) and we believe this order of differentiability to be the maximal one necessary.
Remark 3.7**.**
If then theorem 3.4 remains true if can be identified with a Borel measure on for which The estimate (23) then generalises to an estimate for the variation norm of (as measure on ) which then implies a uniform estimate
[TABLE]
We next observe that if one is satisfied with a slower rate of convergence, the growth condition on at infinity can be weakened. Recall the spaces defined in (2).
Corollary 3.8**.**
(of the proof). Suppose that and that for some such that
[TABLE]
Then for all ,
[TABLE]
Here, the constant can be taken the same as in theorems 3.4 and 3.2 (cf. the proof of the latter).
Proof. The hypotheses on certainly imply that , so we can apply lemma 3.5. In particular, the estimate (20) still holds. Now split this integral into three parts over the ranges , qnd , and use the Fix - Strang condition in 0, together with the trivial bound if to estimate:
[TABLE]
The corollary shows that there is an interplay between the order of convergence of the RBF interpolants and the smoothness of the function which is interpolated, as quantified by the decay of at infinity. The singularity of at 0 can be of order , as before, which is compatible with having polynomial growth of order less than
3.2. The case of
The results of this section, and in particular the basic estimate (20), remain true if , which includes for example Gaussian basis functions. The reader may of course wonder why one would want to consider this case, since this estimate then does not imply that will converge to and, as we will see in section 4, it won’t. The reason is that we can still have approximate approximation, in the sense that can be made arbitrarily small for functions whose Fourier transform decay sufficiently rapidly, by an appropriate choice of basis function: see theorem 4.1 and the discussion following it. There is a minor technical problem, in that theorem 2.3 no longer guarantees that decays sufficiently rapidly to be integrable, though it may for particular examples such as the Gaussian, or under stronger conditions on , as per Buhmann’s result for integer pair mentioned before. The interpolating function is therefore no longer be guaranteed to be in , even if is rapidly decreasing. Its Fourier transform will still exist as a tempered distribution, and lemma 3.1 will still be true, as an easy approximation argument shows: in so as tempered distributions. But converges to
Next, the series for will not necessarily converge if and we need to impose an additional condition that for some , as in theorem 3.4. Alternatively, we can observe that if is integrable, then the defining series for is summable in the sense that if with and if we let
[TABLE]
then, as , converges uniformly on to a continuous function whose Fourier transform is , independently of the choice of To show this, observe that the Fourier transform of the left hand side is equal to , where Since (by a classical result on convolution with approximate identities),
[TABLE]
in (observing that ), and since is a contraction, it follows that converges in Wiener norm, and therefore in sup-norm, to the inverse Fourier transform of the, integrable, function If we now define as the limit of the , the estimate (20) follows as before. We in fact only need the Fourier transform of to be integrable, but this excludes taking for the characteristic function of a cube centered at 0, which would entail ordinary convergence of the series for
3.3. Convergence in weighted sup-norms
Although our focus in this paper is on convergence in the Wiener norm, one can allow ’s for which does not necessarily coincide with a function or a measure on , but is a more general distribution, if we replace the Wiener norm by one of the weighted sup-norms We give an example which can be deduced from theorem 3.2 by an approximation argument.
Theorem 3.9**.**
Let , and let be a tempered function on whose Fourier transform can be written as
[TABLE]
with complex Borel measures on satisfying
[TABLE]
Then
[TABLE]
where we can take
[TABLE]
for some positive constant independent of
Proof. The hypothesis on imply that is continuous and of polynomial growth of order at most : Let , where , , on We will apply theorem 3.2 to and for that purpose first bound
Lemma 3.10**.**
For as in theorem 3.9 and ,
[TABLE]
with as in (32).
Proof. Since , we find that (writing for )
[TABLE]
The lemma follows by observing that and using the rapid decay of
Proof of theorem 3.9 (continued). By theorem 3.2,
[TABLE]
We next compare with : since for ,
[TABLE]
Now if , then implies that so that Hence, by the decay at infinity of ,
[TABLE]
since we can for example bound the sum by a constant times (recall that ).
Writing , these estimates imply that
[TABLE]
for , which implies the theorem.
Examples of functions which satisify the hypothesis of theorem 3.9 are the inverse Fourier transforms of compactly supported distributions of order since, by a structure theorem going back to Laurent Schwartz, such a compactly supported distribution can be written in the form (29)).
4. Approximate approximation
It is easy to show that the approximation error in the Wiener norm cannot in general go to 0 faster than : if has compact support, then the supports of will be disjoint if is sufficiently small. It follows that
[TABLE]
since If we define
[TABLE]
then Fatou’s lemma implies that
[TABLE]
We will see below that The inequality (35) remains valid if is not compactly supported but decays sufficiently fast at infinity: see theorem 4.3 below. Here we first examine the corresponding upper bound.
As we just noted, one cannot in general do better that for the approximation error. However, for suitable basis functions and for which decay sufficiently fast at infinity we may observe a higher apparent rate of convergence for ’s which are small but not too small. If , we let222The index is a reminder of the degree of the singularity of at 0, and therefore of the natural convergence rate of the RBF interpolants.
[TABLE]
A slight modification of the proof of theorems 3.2 and 3.4 then gives the following more precise estimate for the approximation error.
Theorem 4.1**.**
Let with and suppose that for some such that for some ,
[TABLE]
Then there exists for each , a constant such that
[TABLE]
Concretely, the condition on means that
[TABLE]
The theorem implies that if is very small, and with then the rate of convergence for small, but not too small ’s will at first appear to be , up to the point that the first term dominates and the error saturates at a level comparable to This is the phenomenon of approximate approximation which was discovered by Maz’ya [11] in the context of quasi-interpolation: see also Maz’ya and Schmidt [13]. The quasi-interpolants these authors consider are, in our notation, with of the form , where is rapidly decreasing (for example, a Gaussian) and is called a shape parameter. Since we are in the case of , their quasi-interpolants will not converge to , but it is shown in [13] that if is smooth, satisfies certain moment conditions and decays sufficiently rapidly at infinity, and if has bounded derivatives up till order , then by choosing sufficiently large one can achieve an apparent order of convergence of up to a small saturation error which goes to 0 as tends to infinity. This should be compared with theorem 4.1 when , in which case there will also be no actual convergence and where the required smoothness of is formulated in terms of its Fourier transform. Of course, this theorem concerns the exact interpolants instead of the quasi-interpolants. We will use shape parameters below to construct basis functions with small
We will encounter similar approximate approximation phenomena when studying convergence rates of RBF schemes in sections 5 and 6 below.
Proof of theorem 4.1. It suffices, by (20), to bound Let Then if , there exists a such that if , then , and
[TABLE]
which implies the theorem.
Corollary 4.2**.**
If satisfies the conditions of theorem 4.1, then
[TABLE]
The next theorem complements this upper bound by the lower bound (35) when is not necessarily compactly supported.
Theorem 4.3**.**
Let satisfy the hypothesis of theorem 4.1: for some and for some Then
[TABLE]
Proof. We only need to establish the lower bound. If is the -norm on , let , the cube centered at 0 with sides , and let Then
[TABLE]
so that
[TABLE]
The double sum in the second line can be bounded by
[TABLE]
where we used that
[TABLE]
The first line of (39), on account of taking values in , equals
[TABLE]
where is the indicator function of Since as , the lower bound now follows once more by Fatou’s lemma and the definition of
The next proposition gives a simple explicit formula for and
Proposition 4.4**.**
For with and , let
[TABLE]
Then if ,
[TABLE]
while if ,
[TABLE]
Note that by definition 2.1 (iii) and that the series in these formulas converges absolutely since
Proof. If is the Lagrange function associated to , then
[TABLE]
If we let , then is continuous (even ) in a neighborhood of 0. Since
[TABLE]
(41) and (42) follow upon letting
Corollary 4.5**.**
If exists, then , and
[TABLE]
for as in theorem 4.1 with
We can often construct basis functions with small by introducing a so-called shape-parameter and taking of the form with large, for suitable :
Proposition 4.6**.**
Suppose that with and Then
Proof. Since , it follows that , and therefore, by (41), if ,
[TABLE]
which tends to 0 as under the stated conditions on The case of follows by observing that
[TABLE]
and proceeding as before.
Examples of basis functions which satisfy the conditions of the corollary are the Gaussians (for which ) and the generalized multiquadrics, whose Fourier transforms decay exponentially at infinity, but none of the homogeneous basis functions, since for these : see examples 5.12 below for more discussion. In fact, for a Gaussian or a multiquadric, decays expontially at infinity, and will decay exponentially in
4.1. Dependence of the constants on the shape parameter
If we want to apply (37) with as above (and some fixed ), it becomes interesting to ask how the constant depends on the shape-parameter On expects it to go to infinity with , and the question then is at what rate exactly. We will see that under reasonable assumptions on , this constant behaves like
We first analyse its dependence on for a general From the proof, is proportional to , where is any positive number such that for If we put
[TABLE]
then
[TABLE]
If we introduce the family of semi-norms
[TABLE]
then for , where the euclidean norm of the gradient, so that if , where
[TABLE]
On the other hand, there exists a such that if , where was defined in proposition 4.4 above. Combining these two estimates and observing that we see that
[TABLE]
if where
[TABLE]
and we get an approximate approximation estimate in the form
[TABLE]
We now derive lower bounds for when for a fixed , which we will sometimes drop from the notations. First of all, there exists a such that if This implies that if , so we can take
The behavior of for large depends on the asymptotic behavior at infinity of and of We consider the case of a polynomially decaying and that of an exponentially decaying one.
Example 4.7**.**
*(Polynomially decaying ) *Suppose that
[TABLE]
Then
[TABLE]
On the one hand, we have for any that
[TABLE]
which implies that
[TABLE]
which can certainly be bounded from above by a constant times if Taking in (45) we therefore find that for some constant It follows that for some constant and (47) implies an estimate
[TABLE]
with constants and which depend on (and on ), but not on the shape parameter Neglecting the numerical values of these constants, the second term will dominate the first as long as , and the estimate will be relevant for ’s in the range (since we want ).
Example 4.8**.**
*(Exponentially decaying ) *Suppose that is such that there exists constants and for which
[TABLE]
and
[TABLE]
An example of such a is given by the multi-quadric on : see Example 5.12 below.
Since , will then be bounded by a constant times
[TABLE]
Using that if , and, assuming wlog that , that for , we find that this expression is bounded by
[TABLE]
and using the first inequality of (50), it follows that is bounded by a constant times Remembering (45), it follows that is bounded from below by a constant times This still has as a free parameter, and taking and remembering the estimate for from the previous example, we find that we can take Since (50) implies that , we now have the approximate approximation estimate
[TABLE]
which is relevant for the range , where
Example 4.9**.**
*(The Gaussian) *A final interesting example is that of the Gaussian, with In this case, , and for is bounded by a constant times
[TABLE]
for any , where we used the inequality with and Asymptotically, for large , this behaves like , while , and we can choose (with a fixed ) to be bounded from below by a constant times
[TABLE]
If we now take the free parameters and equal to and remember that , we conclude that we can choose , and we have the following approximate approximation estimate for Gaussian RBF interpolation (recalling that in this case):
[TABLE]
where the constant can be taken arbitrarily close to 1.
5. Convergence of stationary RBF schemes for the heat equation
5.1. An RBF scheme for the heat equation
We introduce an RBF scheme for the Cauchy problem for the classical heat equation,
[TABLE]
being the Laplace operator, and examine its convergence. The scheme is a variant of the classical method of lines, and looks for approximate solutions of the form
[TABLE]
where the are differentiable functions. Here, is the Lagrange interpolation function of theorem 2.3, associated to a given basis function which we fix. We assume throughout this section that and We will see below that for the scheme to converge we will need that while for we can have approximate convergence. The coefficients of are determined by requiring that solve (51) exactly in the points of :
[TABLE]
while is taken to be equal to , the RBF-interpolant of This leads to the following initial value problem for the coefficients :
[TABLE]
Since this is an infinite system of ODEs we first discuss existence and uniqueness of solutions in suitable Banach spaces.
For , let
[TABLE]
One easily verifies that the convolution operator
[TABLE]
is a bounded operator on if Indeed,
[TABLE]
using that The sum of the series on the right is independent of and finite if , by theorem 2.8. It follows that if we let , and if the initial value for some , the system (54) has a unique -valued solution which is given by Next, if for we let (with some abuse of notation)
[TABLE]
then is a bounded linear operator from if Indeed, using the decay of ,
[TABLE]
The sum on the right converges and defines a 1-periodic continuous function on which is therefore uniformly bounded, while the factor in front can be estimated by
If , we can in particular take , and
[TABLE]
is the unique function (52) whose coefficients satisfy (54). We summarize this discussion in the following lemma:
Lemma 5.1**.**
If and if then there is a unique function of the form (52) which satisfies (53) and is a bounded linear map on , for each and
In particular, for each fixed , has tempered growth in , and thus possesses a well-defined Fourier transform, which we will study next.
5.2. Convergence of the scheme in Wiener norm
We start by computing the Fourier transform of Let us introduce the auxiliary function on by
[TABLE]
where the series converges absolutely, since
Lemma 5.2**.**
Let with and If , then the Fourier transform of with respect to is given by
[TABLE]
Proof. Since is in , it follows that is a bounded operator on , and hence the system (54) has an -valued solution if the initial value In particular, if with , then the function is absolutely integrable on for each fixed , and an application of Fubini’s theorem shows that the Fourier transform of is given by , where
[TABLE]
the series being absolutely convergent. By (54),
[TABLE]
where the second line follows from the Poisson summation formula, whose application is justified by the decay at infinity of and its Fourier transform. Hence which, together with the initial condition implies (57).
If , (57) follows by an approximation argument: if in with rapidly deceasing, then in , so by lemma 5.1, in also, since Hence their Fourier transforms converge in On the other hand, in and since is non-negative, in , and therefore in
The following proposition lists some useful properties of
Proposition 5.3**.**
Suppose that with and let be defined by (5.2). Then
(i) is a positive -periodic function, and iff
(ii) There exists a constant such that for
(iii) belongs to the Hölder space with
Proof. (i) The periodicity is obvious and the positivity of is an immediate consequence of the positivity of Next, iff for all Since is non-zero outside of , this implies that Conversely, any such is a zero, given that
Assertion (ii) follows from
[TABLE]
and the behaviour of for small
Finally, (iii) follows from lemma 2.6.
Property (iii) allows us to extend lemma 5.2 to functions of polynomial growth: compare lemma 3.5.
Lemma 5.4**.**
Suppose that for some such that Then the identity (57) holds in the sense of distributions for each Moreover, can be identified with the function
[TABLE]
which moreover is integrable on
Here, and below, without argument will indicate the distribution, and the function with which it can be identified on ; we recall that by lemma 3.5, can be identified with an -function. That the second term of (58) is in follows from lemma 5.3(ii).
Proof. We just clarify the statement of the lemma, and refer to Appendix B for the proof, which uses elements of the proof of lemma 3.5. The proof of that lemma shows that extends to a continuous linear functional on Since (i) and (iii) of proposition 5.3 imply that the function is in , its product with is well-defined as an element of the dual of
We can now show convergence in Wiener norm of the to the solution of the Cauchy problem (51): recall again (2).
Theorem 5.5**.**
Let with and and suppose that for some such that
[TABLE]
Let and let be the solution to the Cauchy problem (51) with initial value Then there exists a constant independent of and such that for ,
[TABLE]
In particular, converges to in sup-norm at a rate of
Proof. Note that the conditions on are weaker than those of theorem 3.4. Indeed, we will be applying corollary 3.8 with By lemma 5.4, we can estimate by
[TABLE]
Since is positive, the first term can be bounded by , by corollary 3.8. To estimate the second term, we first use proposition 5.3(ii) together with the inequality for to estimate the integral over by
[TABLE]
assuming wlog that Since the integral over can be bounded by
[TABLE]
the theorem follows, where in fact we have established the more precise bound
[TABLE]
with and , the constant of corollary 3.8.
Remarks 5.6**.**
(i) If we strengthen the hypothesis on to , we obtain an error bound of with a constant which is independent of
(ii) The estimate for the integral over is quite rough, but note that since is -periodic and equal to 0 in points of , can get arbitrarily close to on this set. A similar remark applies to the first term of (60). There may be room for improvement, by further analyzing the contribution of a small neighborhood of this set of points.
It is not difficult to verify that is the exact order of approximation if , and that the scheme does not converge if Let
[TABLE]
We will see in proposition 5.10 below that
Theorem 5.7**.**
Let for some such that for some Then if ,
[TABLE]
while if ,
[TABLE]
Proof. Since and , lemma 5.4, corollary 3.8 and Fatou’s lemma imply that
[TABLE]
where we have put By the mean value theorem applied to the exponential, there exist such that
[TABLE]
If , then as by proposition 5.3(ii), and (63) follows from
[TABLE]
If , then
[TABLE]
which proves (64).
5.3. Approximate approximation properties of the scheme
As we have just seen, our RBF scheme for the heat equation does not converge if , which is for example the case if and the basis function is the Hardy multiquadric. It turns out that we can then still achieve an arbitrarily small absolute error by an appropriate choice of the basis function, e.g. by introducing a shape parameter. This is again an approximate approximation phenomenon of the type encountered in section 4, and which if will again take the form of an apparent rate of convergence better than than up to some threshold , for initial conditions whose Fourier transform decay sufficiently rapidly at infinity. Suppose that with and , and let the initial condition be as in lemma 5.4. Let
[TABLE]
Theorem 5.8**.**
Let and If then there exist for any a constant which does not depend on such that if ,
[TABLE]
while if and , then
[TABLE]
for sufficiently small If , we can replace the first term on the right in these inequalities by
[TABLE]
Proof. We adapt the proof of theorem 5.5. If , there exists a such that if Hence, assuming wlog that ,
[TABLE]
We estimate the integral over by , which we bound by if , and by if , assuming in the latter case that If we finally use theorem 4.1 to estimate first term of (60) when , and corollary 3.8 when , the theorem follows.
Corollary 5.9**.**
Let and suppose that exists, so that Suppose that for some Then
[TABLE]
Compare with corollary 4.5. It follows from proposition 5.10 below that exists iff exists. One can also give a direct proof of this corollary using Lebesgue’s dominated convergence theorem: see the proof of theorem 6.8 below.
If and are small, then theorem 5.8 with for example equal to shows one may observe a higher apparent rate of convergence than the actual rate for small but not too small ’s if decays sufficiently rapidly at infinity. As in section 4, we can construct basis functions with small by taking these of the form and letting We start by deriving explicit formulas for and Recall the definition of and in proposition 4.4.
Proposition 5.10**.**
We have
[TABLE]
Proof.
[TABLE]
where and are continuous in a neighborhood of 0 . It follows that
[TABLE]
with The formula for follows similarly.
Note that Also note that since and since , by the ellipticity condition on at 0.
If we take , with , then , and It follows that
[TABLE]
where the series converges since
Corollary 5.11**.**
If with , then as
Examples 5.12**.**
(i) Hardy’s multiquadric with shape parameter is defined by
[TABLE]
where the minus sign serves to make positive. Note that with , so that we are in the situation of corollary 5.11, except for an irrelevant multiplicative factor of The Fourier transform of on is given by
[TABLE]
where is the MacDonald function, or modified Bessel function of the second kind: see for example cf. Baxter [1]. The limiting form of for small values of the argument implies that as , (with ), so that , and our RBF-scheme for the heat equation will converge if , at a rate of The MacDonald function is known to decay exponentially at infinity, so that we can apply corollary 5.11 to conclude that and as In fact, these will converge to 0 at an exponential rate since does.
If , then , and the scheme will not converge. However, corollary 5.11 together with theorem 5.8 shows that we can make the error arbitrarily small by taking the shape parameter sufficiently large, with moreover an arbitrarily large apparent order of convergence for small but not-too-small ’s if the Fourier transform of the initial value decays sufficiently rapidly at infinity. At first sight, this may seem strange, because we are after all simply performing an additional scaling by , and we are already using scaled basis functions for our interpolation. Note, though, that we are interpolating with on , and not on
(ii) If we take a homogeneous basis function, with , then is proportional to on , so that and corollary 5.11 does not apply, as indeed it shouldn’t: if is homogeneous, then and are independent of , and therefore and also.
Remark 5.13**.**
One can perform a similar analysis of the -dependence of the constant of theorem 5.8 as the one we did in subsection 4.1, with similar conclusions; we skip the details.
6. Convergence of stationary RBF schemes for pseudo-differential evolution equations
The results of the previous section remain valid for a large class of constant coefficient pseudo-differential evolution equations
[TABLE]
under suitable conditions on the symbol , notably The operatore is defined by initially with domain , for example. We are in fact restricting ourselves to a rather special class of pseudo-differential operators, the Fourier multiplier operators or convolution operators: if is a tempered distribution and if is a Schwarz class function, then is the convolution of with the inverse Fourier transform of These can also be considered as constant coefficient pseudo-differential operators, general pseudo-differential operators having symbols which also depend on The latter are outside of the scope of this paper, but the multiplier operators we consider here already contain many interesting examples, such as the fractional Laplacians or the generators of Lévy processes. Regarding the latter, the equation (68) occurs in mathematical finance, for derivative pricing in exponential Lévy models, and has been treated numerically in [2] using the RBF scheme we investigate here, with good results. We note that, from a theoretical point of view, convergence of this type of stationary scheme for convolution operators is not obvious, since these, as integral operators, are in general non-local (except if is a polynomial), and we already know from section 3 that to obtain good convergence we will need to use basis functions with polynomial growth. To understand why such schemes nevertheless perform well numerically, as for example observed in [2], was a main motivation for this paper.
As regards the conditions on the symbols, we will work with the class of -functions for which there exists for each multi-index a constant such that
[TABLE]
We will not need the faster -decay for which is a standard requirement in much of pseudo-differential theory and which for example is satisfied by the symbols of partial differential operators. The requirement of having -symbols is slightly restrictive, and a priori excludes symbols such as for non-integer but positive , but the theory presented here will apply to regularized versions of such symbols, for example, replacing by with equal to 1 on a neigborhood of 0. Modifying a symbol on a compact set will not affect the singularities of the (distributional) kernel of but will change its decay properties at infinity. Care then has to be taken with the growth properties of the initial values which we allow for (68) (equivalently, the singularities at 0 of ), in the various convergence theorems, which makes the statements of these theorems more complicated. On the other hand, if has non-negative real part, then the solution of (68) with initial value makes sense for any tempered distibution , being the inverse Fourier transform of We therefore limit ourselves here to smooth symbols.
We first examine the action of on :
Theorem 6.1**.**
Let If and if with , then is a bounded continuous function and there exists a constant such that
The proof is similar to that of the bound (8) of theorem 2.3: see appendix A. In fact, here, and below, it would have sufficed to require (69) only for We suppose from now on that
The second condition we will need to put on the symbol is that it has a non-negative real part:
[TABLE]
Perhaps curiously, we do not need or to be elliptic. In particular, our results below will for example also apply to the free Schrödinger operator, for which , or the regularized ”half-wave equation”, with outside of a neighborhood of 0. The heat equation obviously also falls within the class of allowed evolution equations, as do the Kolmogorov-Fokker-Planck equations associated to certain Lévy processes: see example 6.11(ii) below.
The proofs in this section will be similar to the ones for the classical heat equation in section 5, and we will only signal the differences.
We will extend the results of section 5 to the Cauchy problem for (68) for symbols satisfying (69), (70). The proofs will be similar to the ones for the classical heat equation in section 5, and we will mainly signal major differences. We are again interested in solving (68) with initial value using a semi-discrete scheme which is the RBF-variant of the classical method of lines, looking for approximate solutions of the form (52), where is the Lagrange function on associated to a basis function with and : cf. theorem 6.1. The coefficients of are again determined by requiring that solve (68) exactly in the interpolation points: for This now leads to the (infinite) system of ODEs
[TABLE]
where has symbol We again have to solve this system with initial condition One shows as in lemma 5.1 that if then there exists a unique solution in and that, as a consequence, is in if with norm bounded by a constant times that of
Remark 6.2**.**
A noteworthy feature of the RBF-scheme is that we do not need to discretize the operator , contrary to for example Finite Difference schemes, but only need to know its action on (or on when working with irregularly spaced interpolation points). This is an advantage when the operator is a singular integral operator: see [2] for concrete examples and further discussion.
To analyze the RBF scheme we introduce the auxiliary function on defined by
[TABLE]
where the series converges absolutely since To make the connection with the previous section note that if then , with given by (5.2).
One shows, similarly to lemma 5.2, that if the initial condition is a Schwarz-class function, or more generally if , and if satisfies (70), then the Fourier transform with respect to of is given by
[TABLE]
Since is in , we can extend this formula to initial values of polynomial growth strictly less than whose Fourier transform coincides on with an element of We also note that is -periodic in and has non-negative real part. Its zero-set contains but may be bigger. We have the following basic estimate which generalizes proposition 5.3(ii).
Proposition 6.3**.**
Suppose that for some , and let with Then there exists a constant such that for all ,
[TABLE]
Proof. We have
[TABLE]
The first term is bounded by a constant times if As for the other terms, is comparable to if , so , and by (12),
[TABLE]
so that
[TABLE]
We will suppose from now on that for some such that The unique solution of the initial value problem in the space of tempered distributions is given by which is well-defined as a product of a tempered distribution and a -function all of whose derivatives have at most polynomial growth. One shows, using the arguments of Appendix B, that is a continuous function of polynomial growth of order at most for each , and that the Fourier transform of is given by
[TABLE]
We then can state the following convergence theorem.
Theorem 6.4**.**
Let , , and suppose that Then there exists a constant such that if is a function of polynomial growth of order strictly less than such that , then
[TABLE]
Proof. The proof is similar to the proof of theorem 5.5, with a small twist. By proposition 6.3 and the elementary inequality for , and since333In the case of the heat equation (), was negative for small enough , but this is not clear for for general , in particular when is not elliptic.
[TABLE]
for sufficiently small , there exists an such that if , then
[TABLE]
which in absence of further hypotheses on the symbol we simply bound by The rest of the proof proceeds as before: since ,
[TABLE]
The last term can be estimated using theorem 3.4 while the integral can be bounded by
[TABLE]
Now use (76) for the first integral, and bound the second one by
We note that we do not get an improved convergence rate beyond if is of negative order , even if were rapidly decreasing and were smoothing. We therefore limit ourselves to operators of non-negative order:
Also note that, on comparing theorem 6.4 for with theorem 5.5, we require a stronger decay of at infinity, which translates into two additional degrees of smoothnes (two extra derivatives) of If we assume that is elliptic, then
[TABLE]
is independent of and theorem 6.4 remains valid if , on replacing by : compare the proof of theorem 5.5.
The argument can be refined to give a rough approximate approximation estimate: the proof above shows that if and if ,
[TABLE]
where
[TABLE]
On closer examination, the proof of proposition 6.3 shows that
[TABLE]
where we recall that (the constant in front is not optimal). From this it easily follows that
[TABLE]
where we’ve put
[TABLE]
with defined by (40); note that the series converges since , by assumption.
Note that Equations (78) , (80) and theorems 4.1 (when ) and 3.4 (when ) now imply the following rough approximate approximation estimate:
Theorem 6.5**.**
Suppose that , , with Then there exists, for each a constant such that for and , the error can be bounded by
[TABLE]
with , where the second term can obviously be absorbed in the final one if
We call this a rough approximate approximation theorem since the bound does not correctly reflect the -dependence. In particular,there is no corresponding lower bound for the approximation error. It does however imply that if is small, then the error will appear to be smaller that for not-too-small It can in particular be applied with basis functions depending on a large shape parameter , since one easily shows that as if with ; cf. corollary 5.11 and its proof. One also shows that the constant in (81) (for any fixed ) again behaves like for the ’s we considered in subsection 4.1.
To obtain a more precise result, and in particular an asymptotic lower bound for the approximation error, we put an additional hypothesis on
Definition 6.6**.**
We will say that is asymptotically homogeneous at infinity if for some and if
[TABLE]
exists for all
We will also assume for simplicity that exists: if not, one has to replace by the corresponding liminf or limsup, and replace equalities by inequalities at the appropriate places in theorem 6.8 below. Recall that by the definition of the Buhmann class.
Lemma 6.7**.**
If , , is asymptotically homogeneous at infinity, then
[TABLE]
exists and is equal to
Proof. The hypotheses on easily imply that
[TABLE]
for all and all It follows that for each ,
[TABLE]
where the interchange of summation and limit can be justified by Lebesgue’s dominated convergence theorem, using that Since
[TABLE]
the lemma follows.
Theorem 6.8**.**
Suppose that , , is asymptotically homogeneous and let the initial value be as in theorem 6.4, in particular, Then if ,
[TABLE]
while if this limit equals
[TABLE]
where we note that
Proof. First of all, since and , the limit we have to compute is equal to
[TABLE]
If then for each , as , and consequently, using the Taylor expansion of the exponential function,
[TABLE]
while if , this limit equals The dominated convergence theorem, using the estimate (76), then shows that
[TABLE]
exists and is equal to the right hand side of (84) if is sufficiently small. The integral over can as before be bounded by which goes to 0 when multiplied by if , which proves the theorem.
The theorem shows that the approximation order is exact if , but the latter can be 0, for example if is odd and is even, in which case would have to be purely imaginary to comply with (70)). A concrete example is given by the constant-coefficient transport equation ,
Remark 6.9**.**
We include some observations on the case of , which has some relevance for applications: for example, the transition probability densities of a compound Poisson process satisfy a pseudo-differential equation of the type considered here, with a symbol which is bounded together with its derivatives under suitable hypotheses on the probabilty distribution of the jumps (e.g. when the latter has moments of all, or sufficiently high, order).
If with , we have to substract from the right hand side of (83) (where we recall that ), making the limit depend on Also, the contributions of the -norms of et de are now of the same order , as is our estimate for the norm of , and cannot be separated anymore as before. We can nevertheless impose a stronger decay at infinity to control the latter, and use corollary 4.5 to show that if for some , then
[TABLE]
A further analysis shows that if, for example, has compact support, then we have the more precise result
[TABLE]
a result which remains true if the real part of is not necessarily positive, and which implies (86) if it is. Since the gain with respect to the latter seems relatively modest, we skip the details.
We cannot deduce from lemma 6.7 a precise approximate approximation estimate similar to theorem 5.8 since the limit isn’t ”uniform in ”. In fact, as we will show now, lower order parts of the symbol will also occur in such an estimate. We consider the case of a constant coefficient partial differential operator, a polynomial of degree with non-negative real part: We write
[TABLE]
with homogeneous of degree Then , where
[TABLE]
and, as is easily verified,
[TABLE]
for , while since is a constant. Since
[TABLE]
it follows that for each there exists a such that
[TABLE]
Since if is sufficiently small, the arguments above can be used to prove the following estimate:
Theorem 6.10**.**
For all there exists a such that if ,
[TABLE]
where can be replaced by if
Examples 6.11**.**
We finally mention some examples of pseudo-differential evolution equations which are of interest for applications, and to which our results apply.
(i) The Kolmogorov - Fokker - Planck equation associated to a Lévy process on Recall that according to the Lévy - Khintchine theorem such a process is completely characterized by its characteristic function, with
[TABLE]
where is a positive semi-definite linear operator and where is a positive Borel measure on such that
[TABLE]
called the Lévy measure; here is a compactly supported function which is equal to 1 on a neighborhood of 0, and which can be taken smooth, if necessary.
If, for a given , we let , then satisfies (68) with and initial value Note that satisfies (70) since
[TABLE]
Under appropriate hypothese on the Lévy-measure one can derive symbol-type estimates for For example, when with , and a rapidly decreasing continuous function, then if , and in if : cf. remark A.4 in Appendix A below. Examples of such processes are the jump-diffusion processes and the CGMY-processes of mathematical finance, which were treated numerically in [2], [5] and [6] with different choices of basis functions (respectively the multi-quadric, inverse multi-quadric and the cubic spline). If and if is a finite measure on of total mass , we have the special case of a (multi-dimensional) compound Poisson process with intensity and drift , then being the probability law of the jumps. The symbol will then be in if has moments of this order.
(ii) The special case of symmetric stable Lévy-processes leads to the fractional heat equation:
[TABLE]
with Here the symbol, is not smooth in 0, so our theory does not apply directly, but it applies to regularized versions with replaced by , where is equal to 1 outside of a small ball. The theory of this section can nevertheless be extended to certain families of symbols having singularities in The presence of such singularities cause a number of technical problems, notably with regards to the initial values which are allowed in theorem 6.4 and the other theorems of this section: in the case of the fractional heat equation, needs to be defined as a tempered distribution, so it becomes natural to require that On the other hand one can use lemma A.2 to show that the inverse Fourier transform of decays as at infinity, so will have to be taken integrable with respect to this weight. The details of this case will be treated elsewhere. Note that once such a theory is in place, we can consider generalizations of theorem 6.10 to general polyhomogeneous symbols.
(iii) Certain non-parabolic operators, such as the transport equation , the so-called ”half wave equation” , modified so as to make its symbol smooth in a neighborhood of 0, or the free Schrödinger equation, : for these, the symbol is purely imaginary.
Appendix A **Proof of theorem 2.3 **
We prove the existence and main properties of the cardinal function associated to a basis function as stated in theorem 2.3. As already mentioned, this was done by Buhmann [3], [4] for a more restricted class of radial basis functions. The main difference in our treatment and that of [3] is the use of lemma A.2 below, relating the decay at infinity of the Fourier transform of a function with its behavior in 0, which allows one to go beyond the class of basis functions considered by Buhmann.
Before embarking upon the proof, it may be interesting to observe that the estimates (4) are similar to the symbol conditions of pseudodifferential calculus, except that the latter concern the behavior at infinity444Indeed, if (5) were required for all orders (with constants which may then depend on ), then , where is a -function such that is compactly supported, and is the standard symbol class of order (cf. [15]). instead of at 0. From this point of view, (5) corresponds to having an elliptic symbol, whence our terminology.
We note that as a consequence of conditions (ii) and (iii) of definition 2.1,
[TABLE]
Turning to the proof of theorem 2.3, following Buhmann we start by defining as the inverse Fourier transform of the right hand side of (9), observing that since the latter is an integrable function, by definition 2.1(iv), is a well-defined continuous function. We first show that has the proper decay at infinity.
Theorem A.1**.**
Let , and let
[TABLE]
Then there exists a positive constant such that
[TABLE]
The proof will use the following lemma, which basically is a special case of a classical estimate for kernels of convolution operators: see Stein [15], proposition 2 of Chapter VI, section 4.4.
Lemma A.2**.**
*Let and let be supported in some ball such that555Note that since *
[TABLE]
Then the inverse Fourier transform satisfies
[TABLE]
with a constant , where only depends on
Stein in fact shows that if (92) is satisfied at all orders, without necessarily being compactly supported, then can be identified with a -function away from 0, satisfying for all and all This result is stated and proven there for , but the proof generalizes to any We only need this estimate for itself, in which case we only need (92) for the limited number of derivatives of indicated, and we furthermore only need it for large (note that if has compact support, is continuous, even , and Stein‘s estimate for at 0 becomes trivial). The proof in [15] uses the Paley-Littlewood decomposition. An elementary prove of lemma A.2 can be given by writing
[TABLE]
where with bounded derivatives such that for , for , and integrating the first integral by parts
Proof of theorem A.1. Let such that in a neighbourhood of [math] and For , define by and note that the supports of the are disjoint. Finally, let (”c” for ”complement”), so that together with the ’s form a partition of unit. Then
[TABLE]
where
[TABLE]
We examine the decay in of each term separately.
Decay of . The function is in , since the denominator is a strictly positive periodic function which is on the complement of and therefore on the support of Multiplying with , we find that is with integrable derivatives of all orders, which implies by the usual integration by parts argument that
Note that is at best in the points of , so integration by parts will not give the required decay for of , We use lemma A.2 instead.
Decay of . Since
[TABLE]
and since is on the suport of , the estimates (89) easily imply that satisfies condition (92) of lemma A.2 with (for does, and then also ). It follows that , since is rapidly decreasing.
Decay of , . This is similar, except that we have to pay attention to the size of the constant in front of the The Fourier transform will now be supported near Shifting by , we see that
[TABLE]
is supported in a small neighbourhood of 0, with derivatives of order bounded by , with independent of Lemma A.2 then implies that
[TABLE]
Since by assumption, summation over completes the proof.
The same arguments prove theorem 6.1: , and has derivatives which are integrable since , as has Finally, and , satisfy the hypothesis (92) of lemma A.2, the latter with constants bounded by which are summable since
We continue with the proof of theorem 2.3. The expressions for above also show that satisfies the Strang-Fix conditions (10). Once we have defined through its Fourier transform, it is immediate to check that for : indeed, by the -periodicity of the denominator, writing the integral over as a sum of integrals over translates of ,
[TABLE]
It remains to recognise as a sum of translates of To show this we first write the denominator of (9) as a Fourier series:
[TABLE]
One verifies by the similar arguments as the ones of the proof of theorem A.1 that
[TABLE]
write
[TABLE]
and estimate the first integral using lemma A.2 and the second by integrating by parts.
It follows that (96) converges absolutely. We then claim that
[TABLE]
where the series converges absolutely and uniformly on compacta, by (97), since grows at most as , by assumption. Formally, (98) follows by writing
[TABLE]
except that the final step does not make sense for an arbitrary since will not be integrable in 0 if and even if it is, when , it might differ from integration against the function by a distribution supported in 0.
We have to carefully distinguish between the tempered distribution and the locally integrable function with which it can be identified on The relation between the two is given by the following identity: there exist constants , such that for all ,
[TABLE]
where the sum is interpreted as empty if Indeed, the first integral converges since has an integrable singularity at 0. The sum of the two integrals on the right defines a tempered distribution. If we denote this distribution by then the restriction of to can be identified with the function The difference is then supported in 0, and therefore a linear combination of derivatives of the delta distribution in 0. To bound , we use the following lemma, whose proof we postpone till the end of his section:
Lemma A.3**.**
If , then the inverse Fourier transform is a continuous function which is bounded by for non-integer and by if is a positive integer.
Since the inverse Fourier transform of is a polynomial of order , and since, by assumption, , has polynomial growth of order strictly less than , it follows that , which is equivalent to
We now use (A) to prove that (98) holds as tempered distributions, that is, if , then
[TABLE]
If we let
[TABLE]
then is if , and if , with all its derivatives rapidly decreasing. To obtain a function in the Schwartz class we convolve with , where with integral 1. Let Then we first claim that
[TABLE]
To show this it suffices to consider the case that , since is integrable if and converges uniformly. If we assume for example that then by the Taylor expansion with remainder there exists a constant such that for all ,
[TABLE]
where we note that if or if , then derivatives of of order exist, while if and , these derivatives exist a.e. but are uniformly bounded, and the estimate remains true. Next, for since is Furthermore, since vanishes of order in 0, it follows that for and (101) follows by dominated convergence,
Since is Schwartz-class, we have By (96) , which can be interpreted as the product of a tempered distribution and a test function, whose Fourier transform equals
[TABLE]
One easily verifies using (97) that
Since , and since for some , Lebesgue’s dominated convergence theorem then shows that
[TABLE]
Finally, one checks that the functions and are integrable on with respect to the product of the Lebesgue measure and the counting measure. A double application of Fubini’s theorem then shows that the right hand side equals
[TABLE]
which proves (100). The pointwise identity (98) follows since both sides are continuous.
Proof of lemma A.3. The lemma presumably is classical, but since we could not locate a suitable reference (apart from the well-known case of homogeneous ), we sketch a proof for convenience of the reader. If , the inverse Fourier transform is a bounded function, so suppose that Since is integrable, its inverse Fourier transform is a bounded continuous function, and it therefore suffices to examine the inverse Fourier transform of the tempered distribution defined by the first integral on the right hand side of (A). This distribution being of compact support, its inverse Fourier transform is the function obtained by taking :
[TABLE]
where we put This can be bounded by
[TABLE]
Split the integral into an integral over and one over the complement, where is some fixed number and where we assume wlog that The first integral converges absolutely, since
[TABLE]
and we can bound its contribution to by As for the integral over , it can be bounded by a constant times
[TABLE]
assuming that Since by the definition of , this will be bounded by Finally, if , , then and
[TABLE]
Remark A.4**.**
The only hypotheses on we needed for this lemma is that it be integrable on and that near 0. If we strengthen the first assumption to
[TABLE]
where , then will be -times differentiable, and we will have that
[TABLE]
for : it suffices to observe that if is given by (102) then its derivative of order is given by the same formula with replaced by
These estimates can be used to obtain symbol estimates for the generator of pure-jump Lévy processes with Lévy measure
[TABLE]
with , and a rapidly decreasing continuous function satisfying (103) for all The inverse Fourier transform of then is, modulo a function in , equal to the symbol of the generator of the Lévy process, and the estimates show this symbol to be in if , and in for any otherwise (even a bit better, since the first few derivatives will decay relative to the symbol itself). Examples are given by the CGMY-processes which are used in financial modeling.
Appendix B Some technical proofs
B.1. Proof of lemma 3.5
Let , where and Then gives rise to a tempered distribution defined as follows: if be equal to 1 on a neighbourhood of 0, we put
[TABLE]
The integral converges since in a neighbourhood of 0 and defines a distribution of order Note that coincides on with the locally integrable function
We next observe that extends to a continuous linear functional on the Hölder space with Indeed, if , then the Taylor expansion formula with integral remainder term easily implies that
[TABLE]
which shows, with and , that is well-defined and continuous.
We can, in particular, let act on the imaginary exponentials The function
[TABLE]
is then found to be bounded by , since , and one easily verifies that the inverse Fourier transform of coincides with If one has the stronger estimate
[TABLE]
which can be seen as follows: write with the characteristic function of a small ball around 0. Since is integrable, its inverse Fourier transform tends to 0 at infinity, by the Riemann-Lebesgue lemma. We can therefore wlog assume that is supported in If we apply (B.1) with then666e.g. by using the Taylor formula with integral remainder in the form
where
[TABLE]
where is the inverse Fourier transform of the -function By the Riemann-Lebesgue lemma, as , for all , and the same is true for the integral over , by the dominated convergence theorem (the are bounded). Hence for , as claimed.
Now let be a measurable function on of polynomial growth of order strictly less than , such that its Fourier transform (in the sense of tempered distributions) satisfies
[TABLE]
We write for Then is a distribution which is supported in 0, and therefore of the form for certain and with Since the inverse Fourier transform of is a polynomial of degree , it follows that , the largest integer which is strictly smaller than , since otherwise would grow at a rate of at least in certain directions. If this would contradict the bound , and if this would contradict (106).
In follows that also extends to a continuous linear functional on We exploit this to define by duality.
If , we let
[TABLE]
Note that is the formal (real) adjoint of By lemma 2.6, is with and uniformly bounded together with all its derivatives, since -periodic. In fact, this is true even if with the same , on account of the decay at infinity of We can then define , as a tempered distribution and, more generally, as a bounded linear functional on by
[TABLE]
We next check that is the Fourier transform, in distribution sense, of This is done by a standard approximation argument, with some care with the spaces in which the approximating sequence converges. We first note that we can assume without loss of generality that is compactly supported: indeed, we can write with compactly supported and , and we know already that
So let be compactly supported, and let be a non-negative symmetric function with Let Then
Lemma B.1**.**
* in the dual of , with and *
Proof. On account of the symmetry of ,
[TABLE]
which is valid both for Schwarz-class functions and for Write If , then pointwise on for all , while a trivial estimate shows that , uniformly in , for This, together with the remainder estimate (105), the integrability of and Lebesgue’s dominated convergence theorem, implies that Since also for all , the lemma follows.
The lemma immediately implies that if , then , so in and even in with and as above.
On the other hand, if we let be the inverse Fourier transform of , then since is, and We have that , with the inverse Fourier transform of , so and By hypotheses, for some If such that , then by (24), writing ,
[TABLE]
as , using for example the first order Taylor expansion for for plus a trivial estimate for This certainly implies that in , so we conclude that and therefore as distributions.
We finally show that , where is the (distribution obtained by integrating against the) function
[TABLE]
We first check that is well-defined and in : first of all, each of the terms on the right hand side is in , on account of the Fix-Strang conditions satisfied by for and the integrability of Next, the function is absolutely integrable on with respect to the product of Lebesgue measure and the counting measure, since
[TABLE]
Fubini’s theorem then implies that is well-defined for almost all and that If , then a double application of Fubini will show that
[TABLE]
Since, by the Fix-Strang conditions (10), all derivatives of order of in 0 are 0, the last integral is equal to , and therefore , which finishes the proof of lemma 3.5.
Remark B.2**.**
The lemma and its proof generalizes to ’s such that is a finite Borel measure with respect to which the function is integrable, provided that (the reason being that we then no longer have (106)).
B.2. Proof of lemma 5.4
It again suffices to consider the case of compactly supported ’s. We use the notations of the proof of lemma 3.5 above: in particular, let be the inverse Fourier transform of where ia an approximation of the identity. We have seen that in , where and Since this implies that
[TABLE]
in and hence in
On the other hand, we have seen in the proof of lemma 3.5 that in if Hence by lemma 5.1, if then in and therefore as tempered distributions. This implies that
[TABLE]
where we used lemma 5.2. Hence as tempered distributions, as claimed. We finally prove (58): if we let
[TABLE]
then is a -function of and
[TABLE]
Since vanishes to order in , by proposition 5.3(ii), the representation from the proof of lemma 3.5 shows that the distribution can be identified with the locally integrable function
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] B. Baxter, On the asymptotic cardinal function of the multiquadric φ ( r ) = ( r 2 + c 2 ) 1 / 2 𝜑 𝑟 superscript superscript 𝑟 2 superscript 𝑐 2 1 2 \varphi(r)=(r^{2}+c^{2})^{1/2} as c → ∞ → 𝑐 c\to\infty , Computers & Math. with Appl., 24 , 1-6 (1992).
- 2[2] R. Brummelhuis and T. L. Chan, A radial basis function scheme for option pricing in exponential Lévy models, Appl. Math. Finance, 21 , 238-269 (2014).
- 3[3] M. Buhmann, Multivariate Cardinal Interpolation with Radial Basis Functions, Constr. Approx. 6 , 225-255 (1990).
- 4[4] M. Buhmann, Radial Basis Functions , Cambridge Monographs on Applied and Computational Mathematics, Cambridge University Press, 2003
- 5[5] R. T. L. Chan, Adaptive Radial Basis Function Methods for Pricing Options Under Jump-Diffusion Models, Comp. Econ. 47 (4), 623-643.
- 6[6] R. T. L. Chan and S. Hubbert, A Numerical Study of Radial Basis Function Based Methods for Options Pricing under the One Dimension Jump-diffusion Model, Rev. Deriv. Res., 17 (2), 161 - 189 (2014).
- 7[7] G. Fasshauer, Meshfree Approximation Methods with Matlab , Interdisciplinary Mathematical Sciences 6 , World Scient. Publ. Co., 2006.
- 8[8] Y. C. Hon and X. Z. Mao, A Radial basis function method for solving options pricing model, Financial Engineering, 8 , 31-49 (1999).
