A generic construction for high order approximation schemes of semigroups using random grids
Aur\'elien Alfonsi, Vlad Bally

TL;DR
This paper introduces a universal method for constructing high-order approximation schemes for semigroups of linear operators using random grids, applicable to various processes including diffusions and deterministic systems.
Contribution
It develops a general framework for high-order semigroup approximation schemes with random grids, achieving any order of accuracy with controlled complexity and broad applicability.
Findings
Achieves arbitrary order of approximation with finite variance estimators.
Constructs random grids and coefficients for universal applicability.
Demonstrates effectiveness on diffusions, ODEs, and Markov processes.
Abstract
Our aim is to construct high order approximation schemes for general semigroups of linear operators . In order to do it, we fix a time horizon and the discretization steps and we suppose that we have at hand some short time approximation operators such that for some . Then, we consider random time grids such that for all , for some , and we associate the approximation discrete semigroup Our main result is the following: for any approximation order , we can construct random grids and coefficients , with such that \[…
| Standard deviation of | Used for approx of order | |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMathematical Approximation and Integration · Stochastic Gradient Optimization Techniques · Stochastic processes and financial applications
A generic construction for high order approximation schemes of semigroups using random grids
Aurélien Alfonsi111Université Paris-Est, Cermics (ENPC), INRIA, F-77455 Marne-la-Vallée, France. email: [email protected] and Vlad Bally222LAMA (UMR CNRS, UPEMLV, UPEC), MathRisk INRIA, Université Paris-Est. email: [email protected]
Abstract
Our aim is to construct high order approximation schemes for general semigroups of linear operators . In order to do it, we fix a time horizon and the discretization steps and we suppose that we have at hand some short time approximation operators such that for some . Then, we consider random time grids such that for all , for some , and we associate the approximation discrete semigroup Our main result is the following: for any approximation order , we can construct random grids and coefficients , with such that
[TABLE]
with the expectation concerning the random grids Besides, and the complexity of the algorithm is of order , for any order of approximation . The standard example concerns diffusion processes, using the Euler approximation for . In this particular case and under suitable conditions, we are able to gather the terms in order to produce an estimator of with finite variance. However, an important feature of our approach is its universality in the sense that it works for every general semigroup and approximations. Besides, approximation schemes sharing the same lead to the same random grids and coefficients . Numerical illustrations are given for ordinary differential equations, piecewise deterministic Markov processes and diffusions.
Keywords: approximation schemes, random grids, parametrix, Monte-Carlo methods.
AMS: 60H35, 65C30, 65C05, 65C20
1 Introduction
We consider a semigroup of linear operators and we want to construct high order approximation schemes based on some random grids. Before presenting our general result, we would like to present the popular example of diffusion processes and of approximation schemes of Euler type. Consider the diffusion process
[TABLE]
where is a -dimensional Brownian motion and are smooth vector fields. Our aim is to construct an approximation scheme for the semigroup , where is the diffusion process starting from . Given the time horizon and the time step one constructs the Euler scheme of step by
[TABLE]
Then one constructs the approximation semigroup for . It is well known that
[TABLE]
where is the supremum norm of and its derivatives up to order four. The proof is based on Lindeberg method (or Duhamel’s principle):
[TABLE]
Since
[TABLE]
the above inequality gives (2). If we want to go further we develop as well and we obtain
[TABLE]
The last term is of order , which gives an error of order two if we only keep the two first terms. And one may continue and go further in the development: one develops and so on. This is similar to the development made in the classical parametrix method. But now a problem appears: how to compute in the second term? In the classical parametrix method, one uses an integration by parts formula based on the infinitesimal operator of the diffusion semigroup. Here we follow another way: we develop itself in the same way as for in order to improve the order of approximation. So, in our approach we have two simultaneous developments: an "horizontal" one as in (1) and a "vertical" one which is used in order to refine And in both cases we continue the development up to the moment that we have obtained the order of approximation that we desire. The control of this two folds "Taylor expansion" gives rise to a rather intricate combinatorial problem. The natural way to describe this is to use some trees which are constructed by backward recurrence, which is a little bit tricky. A second problem concerns the computation of the sum and more generally of sums of the form . Computing all these terms would make the use of the development (1) inefficient for computational purposes. The idea is then to randomize by using order statistics and then to use the Monte Carlo method in order to compute it. This is the reason for which random grids come on in our schemes.
These developments are made in Sections 2 and 3. Eventually, we show that for any order , there exists , coefficients and random grids with for some such that
[TABLE]
This is our first main result, stated precisely in Theorem 3.10, where we give an explicit construction of the coefficients and of the time-grids . Thus, the complexity of our algorithm remains of order , for any order of precision. However, seriously increases with , and one has to take care about this in the complexity analysis for the choice of .
To use the approximation in practice, one has to work with a probabilistic representation of the semigroup. On the discretization time grid , we define the corresponding Euler scheme by and
[TABLE]
Since the grids are independent from , we then have for smooth functions
[TABLE]
and the right hand side gives an estimator that can be computed in operations. Then, an important issue is the variance of this estimator. In Section 4, we present a specific organization of the algorithm which allows to get a finite variance. Theorem 3.10 proposes a particular way to gather the terms, i.e. a partition of , and Theorem 4.4 shows that the variance of is bounded for all , so that the variance of the estimator is bounded.
An important and nice feature of our approach is that it is generic and provides an algorithm that can be used in many contexts. Indeed, in the previous approach the only fact which is necessary in order to make the algorithm work is to have at hand a short time approximation for such that (4) holds. The construction of the grids and the coefficients only depends on this. This leads us to consider the following abstract framework. Let be a vector space endowed with a family of seminorms , , such that . We consider a family of linear operators on that have the semigroup property, i.e and for all , . Our goal is to approximate the semigroup and build, for any a linear operator such that
[TABLE]
To achieve this goal, we suppose that we have at our hands a family of linear operators , , such that we have for some and ,
[TABLE]
where . We note the identity operator on and, for , the operator obtained by applying times the operator . We also assume that all these operators satisfy
[TABLE]
The Euler scheme discussed before corresponds to with , and the approximation of order only involves . But, if we want to construct higher order schemes as in (1), we have to mix operators for . This leads us to consider grids with the property that for every we have for some . Then we define . Notice that is built by using the "short time" approximation operators only. Eventually, we show (see Theorem 3.10) that for any order , there exists coefficients and random grids with for some and constants and such that
[TABLE]
We stress that the coefficients and the grids does not depend on nor on the specific form of : only the order of approximation and in () matter. Then, we give several examples of applications besides the Euler scheme: the Ninomiya Victoir scheme for diffusion processes (then and ), or approximation schemes for ordinary differential equations and piecewise deterministic Markov processes.
The approximations introduced in this paper are of any order with a computation time in . To calculate then with a precision , we naturally use a Monte-Carlo method with and samples, which has a computational cost of . Since is arbitrary large, we will denote by this complexity. There is a large literature in numerical probability dedicated to construct either unbiased estimators of , leading then to a computational cost of (but this is only true in the case of finite variance), or approximated estimators leading to a computational cost of . Let us give an overview of the different methods to position our work.
When , the Richardson-Romberg extrapolation provides an approximation of order , and Pagès [23] shows in the case of the Euler scheme for SDEs how to get with this method an estimator with bounded variance. In a different way, extending Fujiwara’s method, Oshima et al. [22] propose approximations for SDEs of any order by considering linear combinations of Ninomiya and Victoir schemes with different time steps. These approximations have very similar properties to the ones presented in this paper, but they are obtained with a significantly different approach: they are constructed with linear combinations of schemes using uniform grids obtained with multiples of the same time-step, while our approximations uses non-uniform time grids that are refined at some random places. Also, the principle of our methodology is not to find a combination of schemes that cancels the terms of orders for , but instead to calculate the contribution of all these terms.
The Multi-Level Monte-Carlo (MLMC) method proposed by Giles [12] that generalizes the statistical Romberg method of Kebaier [16] gives another generic way to approximate in . McLeish [19] and Rhee and Glynn [24] have then proposed an unbiased estimator constructed with similar ideas, see also the recent work of Vihola [25]. Contrary to the previous approaches, the MLMC method does not rely on the development of high order approximations since it already works using the Euler scheme. It stems from a clever probabilistic representation and variance analysis. The MLMC method is in fact complementary to high order approximations. For instance, Lemaire and Pagès [18] have proposed estimators combining the MLMC method and the Richardson-Romberg extrapolation, improving the asymptotic complexity of the standard MLMC method with the Euler scheme.
Last, there is a stream of papers that develop unbiased estimators for in the case of SDEs. We have already mentioned the unbiased estimators [19, 24] that are obtained as telescopic series and that use a discretization scheme with more and more refined time grids. Another direction of research is to try to write , where is a simulatable process (e.g. a Euler scheme) and is some computable weight. By using a change of measure and a rejection algorithm, Beskos and Roberts [9] have proposed such a method for one-dimensional diffusions. Recently, Bally and Kohatsu-Higa [7] have given a probabilistic representation of the parametrix method that opens the road to construct unbiased estimators for a wide class of Markov processes, including stopped or reflected diffusions [11, 4]. By using a different approach, Henry-Labordère et al. [13] have lately proposed unbiased estimators for SDEs that present nonetheless a similar structure as the ones obtained with the parametrix method. A common important issue with all these unbiased estimators is to come up with a bounded variance estimator. This problem is tackled by Andersson and Kohatsu-Higa [5] who provide a finite variance estimator for the parametrix method, see also Agarwal and Gobet [1]. The approximation method that we develop in this paper can be seen somehow as a discrete version of the parametrix method. Instead of considering a continuous time approximating semigroup , and iterate indefinitely the formula ( and are the corresponding infinitesimal generators), we iterate the equality a finite number of times until to achieve an approximation of order . The main advantage of our approximation schemes is that their construction is generic and only depends on the parameter in (), while the weights involved in these unbiased estimators really depends on the underlying SDE or Markov process. This makes our approach much easier to implement for an whole class of processes. Besides, the discrete structure enables us to gather the correcting terms in a way to get a finite variance estimator as already mentioned.
The paper is organized as follows. In Section 2, we introduce some notation and present the recipe to construct iteratively high-order approximation schemes. Section 3 introduces trees, random trees and random grids that we use to construct our approximation schemes. It also prepares the variance analysis by gathering the terms of the approximations in an appropriate way. Theorem 3.10 states our first main result. Section 4 specify these approximations in some cases by using particular probabilistic representations of semigroups, for instance in the case of the Euler scheme for SDEs. In this case, we state in Theorem 4.4 our second main result that ensures that our estimators have a finite variance. Last, we provide in Section 5 numerical examples of our approximations that illustrates the broad application of our approach.
2 Basic development
We first introduce notation that will be used through the paper. We denote by the space of smooth functions from to which are bounded and have bounded derivatives of any order. And we work with the norms
[TABLE]
where for a multi-index we denote
[TABLE]
In many proofs of the paper, we will have to deal with derivatives of composed functions. Let and be smooth functions. We note with the coordinates of . Then, one may prove by recurrence that
[TABLE]
with
[TABLE]
where the sum is over all , and (non void) multi-indices such that . For , we note , and the same formula applies coordinate by coordinate. This result is known in the literature as the Faà di Bruno’s formula, but we do not need in this work to use the explicit formula for the coefficients , see Constantine and Savits [10].
We consider a semigroup of linear operators which satisfies
[TABLE]
Let be a time horizon which is fixed in the sequel. We are interested in building approximation schemes for For and , we define
[TABLE]
We suppose that we are given a sequence of linear operators which will be used in order to construct our approximation schemes. The operator is supposed to be an approximation of , more precisely we assume that for every and
[TABLE]
for some In the case of the Euler scheme we have and (see Example 2.2 below). We denote
[TABLE]
and
[TABLE]
Thus, we produce a discrete semigroup We will use the following regularity hypothesis:
[TABLE]
Remark 2.1**.**
We could more generally assume that the left hand-side of () is upper bounded by for some : this would not modify the main results of Sections 2 and 3. However, this generalization is not relevant for usual semigroups that already satisfy this bound for . For simplicity, we only consider this case.
Example 2.2**.**
(Euler scheme for diffusion processes) We work with the diffusion process (1). We assume that and the derivatives of any order of and are bounded. In particular they have linear growth. By standard results on stochastic flows (see Proposition 2.1 and Theorem 2.3 of [14], Chapter 5), we have ().
We denote by the semigroup a diffusion process (1) and , the (discrete) semigroup of the Euler scheme of step . Then
[TABLE]
where is the infinitesimal operator of the semigroup and is the semigroup corresponding to the Euler scheme, with frozen coefficients and . By Theorem 4.4 [17], we can take a modification of the solution such that the flow is infinitely differentiable with derivatives which have finite moments of any order. Thus we have
[TABLE]
Thus, the property () is satisfied with
We come back to the general case and we present the basic decomposition that we will use. We use the linearity of the operators in order to get
[TABLE]
Iterating this equality, we get for every ,
[TABLE]
with (convention and for non commutative operators )
[TABLE]
Then, using () and () we get for ,
[TABLE]
Thus, we get an error of order for .
Formula (11) represents a discretization with step on the interval . In the sequel we will use similar developments on intervals with step . So, using the above formula with and with step instead of , we obtain
[TABLE]
with . We then have from (12) with ,
[TABLE]
Similarly, we get
[TABLE]
Formula (11) is appealing since it may lead to an approximation of order . The natural question is then how to simulate the terms . This raises two problems that we explain now.
Problem 1. It seems cumbersome (time consuming of complexity ) to compute the sum defining . To avoid this issue, we will use a randomization procedure (inspired from [7] in the framework of the parametrix method). We fix and we consider a random variable that follows a "discrete order statistics" on Precisely, follows the distribution
[TABLE]
We will use the notation
[TABLE]
Then
[TABLE]
Remark 2.3**.**
If we look to the equality between the terms in (16) and in (17), we see that (17) gives a way to compute the sum which appears in (16) by the Monte-Carlo method. This Monte Carlo avoids the “curse of dimensionality” since the discrete simplex of dimension , , has elements. Besides, the different terms in the sum (16) have values that may be very close each other leading to a bounded variance. This will be analyzed later on in Subsection 4.3 for SDEs and the Euler scheme.
Let us note that this randomization makes the approximation (11) effective. Otherwise, it would have a computational cost of for a precision in (see (12)), exactly as .
Problem 2. The basic element in the above formula is and we are not able to simulate directly this quantity, due to . To overcome this problem, we will use the fact that the short-time estimate () of the semigroup is more and more precise when increases since :
[TABLE]
Thus, we will construct by backward induction on , some approximations only based the approximation kernels , each of them involving at most iterations of these approximations (so we keep a complexity of order ). This is precised by the following lemma, which is the core of our computations. We will use the following numbers: for and we define
[TABLE]
Here, if is the ceiling function. We observe that is nonincreasing and therefore .
Lemma 2.4**.**
Let and . Suppose that we have already a sequence of operators for such that
[TABLE]
for some and . For , we define
[TABLE]
with
[TABLE]
Then, we have
[TABLE]
for some and with
[TABLE]
Remark 2.5**.**
Compare the definition of with the one of in (17): one just replaces by . So is replaced by , which is supposed to be "computable". 2. 2.
Recall that for , we have . Thus so is an approximation of order of . This is what we want to obtain. 3. 3.
The inductive construction suggested by Lemma 2.4 to get a -th order scheme for is finite. See the construction of the tree in (54).
Proof of Lemma 2.4.
For , we have so that Using (14) with , we obtain (23).
Suppose now that so that we have We write
[TABLE]
First we compare and . From (22) we have
[TABLE]
We get by expanding (choose times and times ) and using () and (20)
[TABLE]
with depending on the constants We have to check that, for every
[TABLE]
The first inequality is true if and thus if by using (18). The last inequality is equivalent to
[TABLE]
which holds since
[TABLE]
We obtain
[TABLE]
We deal now with the remainder. Using (14) with we obtain
[TABLE]
Since the proof is completed. ∎
Remark 2.6**.**
Lemma 2.4 gives a recursive way to construct approximation of order . When is not an integer, another natural choice may be to consider approximations of order , with . Of course, it is possible then to get an analogous recursive construction.
3 High order approximations of semigroups
Lemma 2.4 gives the recipe to construct high order approximations of semigroups by induction: from high order approximations on a time step , we produce high order approximations on a time step , we go on this construction to get high order approximations for . To describe precisely this construction, we need to introduce basic mathematical objects. In Subsection 3.1, we introduce "trees", "random trees" and "random grids" that will be used to define suitably our approximation schemes. Then, Subsection 3.2 presents a sequence of abstract operators and the composition operations associated to some given random tree. All these definitions are motivated by the approximation schemes that we describe in Subsection 3.3, but for the moment we keep an abstract framework because this allows a precise and simple presentation.
3.1 Trees, random trees and random grids
The approximations that we construct in this paper involve a quite intricate combinatorics. This can be understood from Lemma 2.4: an approximation of order at a level is constructed from approximations of different orders at level . To describe this recursion, we will use trees, see (58) and (56) thereafter. Then, to make this approximation more explicit and non-recursive, we will then use what we call random trees, i.e. trees labeled with particular random variables. In the case of SDE and the Euler scheme approximations, these random trees can be seen as a way to represent the random grid on which the Euler scheme is constructed. In this paper, we will use as much as possible the letter for trees and for random trees.
3.1.1 Trees
We will use the Neveu notation [20]. Let be the set of finite sequences of non-negative integers. For and we denote and We also denote the length of
Definition 3.1**.**
(Trees) A tree is a subset such that:
, 2. 2.
, 3. 3.
* for every *
Convention: Throughout the paper, we use a different symbol for the ancestor (root) of a tree and for the void set:
[TABLE]
We think to as a genealogical tree: each represents an individual (we call it also node or vertex) which is characterized by his genealogy: is the -th son of So the first property imposes that the root belong to the tree (it is the universal ancestor), the second property says that any node of the tree (except the root) has a father, and the third property imposes to number the sons of a node increasingly, without jumping any number: put it otherwise, if a third son exists, then a second one has to exist also. Last, let us mention that a tree can be infinite: throughout the paper, we will only consider finite trees.
We introduce some more notation related to . For , we denote by the number of sons of that is
[TABLE]
We denote by the sub tree of rooted in that is We also denote . This means that we root at the point . Notice that this is not a tree because it does not contain , for , nor the ancestor. We note , the depth of the tree . Finally we define the extreme points (leaves) of
[TABLE]
3.1.2 Random trees
Let be a finite tree and such that . To every vertex (i.e. such that ) we associate a random variable
[TABLE]
which we may considered as the (random) birthdays of the sons of . We denote
[TABLE]
We assume :
- •
The random variables are independent each other.
- •
is an order statistics, i.e. they are uniformly distributed on
Definition 3.2**.**
(Random trees) If the above hypothesis are verified, we call a random tree.
Remark 3.3**.**
Let us precise the relation between the random trees and its subtrees , with . For with we will assume where is the random variable associated to . So, is the family of uniform random variables obtained from to which is added one independent random variable uniformly distributed on .
3.1.3 Random grids
We associate to a random tree a random grid. We fix and and we recall
[TABLE]
and we use here (and in the sequel) the convention
[TABLE]
Then, we construct by recurrence the random grid on in the following way. For convenience, we drop in the notation the dependence in even if this grid is constructed by using . If (contains just the ancestor) then and we define
[TABLE]
This is the usual uniform grid of step on . Otherwise, we have and we define by recurrence
[TABLE]
where . This means that we consider the uniform grid of step on and moreover we refine the intervals according to the random grid (see Remark 3.3 to see how the random trees and are related). Notice that, if then We denote and we define
[TABLE]
the reordering of . We notice that for every one has for some We finally give an alternative representation of the random grid .
Lemma 3.4**.**
Let be a random tree. Let us define and for , we define
[TABLE]
Then, we have
[TABLE]
Proof.
We prove this result by recurrence on the depth . The result is clear for . Suppose the result true for any random tree and assume . Let and the random variables associated to (see Remark 3.3). For , we have
[TABLE]
We set . Since for any , we get
[TABLE]
From , we deduce that
[TABLE]
By using the recurrence hypothesis and (28), this set is equal to . ∎
3.2 Operators
We consider again a sequence of operators , (or more generally where is an abstract vector space). For we denote
[TABLE]
Given a random tree we construct by recurrence in the following way (again we drop the dependence on in the notation). If we define
[TABLE]
Suppose now that and let . We define by recurrence
[TABLE]
see Remark 3.3 for the dependence between the random trees. Finally, we define in the following way. If we define
[TABLE]
and if we define by recurrence
[TABLE]
Notice that the recurrence formula (33) is the same as (31), but the initial condition (32) is different from (30).
We consider now a deterministic tree (in contrast with which is a random tree) and define by recurrence . If , then
[TABLE]
and if
[TABLE]
where is the uniform law of the order statistics
Our aim now is to give an explicit computational formula for In order to do it, we need to introduce one more notation concerning families of trees (forests). Let , where is a family of indices and is a tree for all . Given , we construct
[TABLE]
with
[TABLE]
So, the tree is obtained by rooting to the ancestor the tree at the node , for each . We note the random tree obtained by labeling the nodes of with random variables, according to Definition 3.2.
Using this notation, we are able to construct the family of trees associated to a tree by recurrence, in the following way. If (this means the tree is composed just by the ancestor) then we define - so the finite family associated to the tree has just one element which is the tree . Suppose now that . Then, we define
[TABLE]
where is the shorthand notation for . We are now able to give the first result from this section.
Proposition 3.5**.**
Let be a tree and let be the family of trees associated to in (38). We label each tree with random variables, so that is a random tree in the sense of Definition 3.2. Then, with defined in (32) and (33), we have
[TABLE]
with
[TABLE]
Proof.
Take first . By definition, we have On the other hand we have and So the equality (39) holds true.
Suppose now that (39) holds if and let us prove it for Using the recurrence formula (35) first and the recurrence hypothesis then we get
[TABLE]
Let We have
[TABLE]
We recall that is defined in (37), and by construction we have Then, we use the recurrence formula (33) in the definition of and we obtain
[TABLE]
Moreover, we have
[TABLE]
so the term in (3.2) is equal to
[TABLE]
We conclude that
[TABLE]
Our aim now is to compute in an explicit way For a tree and for a subset of leaves , we define : we cut the extreme nodes which belong to . Notice that is no more a tree: for example, if and then is not a tree: the first and second axioms of Definition 3.1 are satisfied, not the third. We also stress that may be the void set in the case and . Thus, we look to as to a set (not a tree) which may be void as well (remember the convention (25)).
Suppose that Our first concern is to precise how is decomposed on each of the subtrees . We define
[TABLE]
We stress that, if no descendant of belongs to , then we have (void set). We also have
[TABLE]
We define now recursively. First, if (void set) or if (ancestor) we define
[TABLE]
Otherwise we have , and we define
[TABLE]
with and defined in (42).
Before going further, we construct the grid in a similar way with defined in (28). We denote So represents the number of sons of the ancestor which are not in (so, that are alive after killing the individuals from ). We also denote , the indices of the surviving sons. Then, we define (with the convention
[TABLE]
if , and . Here is the set defined in (42). So, we use the refinement procedure for only, and not for every . In the case (void set) coincides with . As for Lemma 3.4, we can show that
[TABLE]
Note that we need to add the union with for the case , i.e. when . We denote
[TABLE]
the reordering of . We notice that for every one has for some . Thus, we produce a sequence associated to , and we have
[TABLE]
Proposition 3.6**.**
Let defined in (32) and (33) and defined in (44). Then
[TABLE]
The above sum includes (void set) and
Before giving the proof of the above proposition, we need to get a more detailed description of the set and of the decomposition given in (42). Let be such that , so that . Then, for any , we denote
[TABLE]
where is defined by (42). We define now the converse operation: given a sequence of sets we define
[TABLE]
In order to precise the structure of we consider the sets of indices defined by
[TABLE]
We stress that for the set is not void and does not contain the ancestor Then the set defined in (50) is given by
[TABLE]
and we define
[TABLE]
Lemma 3.7**.**
Let be a tree such that . Then, we have for any and any ,
[TABLE]
Proof.
We just check the first equality. Let We have to prove that for each we have , where is the th coordinate of the application defined by (49). From (51), we have . Thus, if and otherwise. The second equality is verified in a similar way. ∎
Proof of Proposition 3.6..
If then and with is the void set and So (48) holds.
If then, using the recurrence hypothesis
[TABLE]
Let We have , and according to (44)
[TABLE]
Since every may be decomposed in this way by Lemma 3.7, we get
[TABLE]
3.3 Tree representation of the approximation schemes
We define now a family of trees which describes our approximation schemes. For and , let us define the tree as follows:
[TABLE]
with given in (18), (19) and the convention (void set). These trees are defined by recurrence, and it is not clear at a first glance that the induction ends. This true by the next lemma.
Lemma 3.8**.**
Let . Let us denote for ,
[TABLE]
We have and
[TABLE]
In particular, the recursion defining in formula (54) ends for every .
Proof.
Since , we have for any , which gives . Let us first observe that for , we have and thus by (54). Therefore, the implication will prove that the recursion ends. Let us take then and such that . The last inequality implies and then . Thus, we have to check that
[TABLE]
Since , it is sufficient to prove
[TABLE]
After simplifications, this inequality is equivalent to
[TABLE]
which clearly holds true for every since . ∎
Now, we explain how we associate an approximation scheme to a finite tree. In the following we will work with the specific trees constructed above, but for the moment we consider a general finite tree . We recall that is the number of sons of the root , and for , is the subtree that is rooted at the node . For a finite tree , we define the approximation scheme as follows by induction.
If then and we put
[TABLE]
If we define by recurrence
[TABLE]
Since and the tree is finite, this induction clearly ends.
Proposition 3.9**.**
(Tree representation of the approximations of order )
For every , we have
[TABLE]
where is the approximation defined in (21). Consequently, we have
[TABLE]
with defined in (24). In particular, taking (recall that we obtain
[TABLE]
Proof.
We consider the sets defined in (55) and prove the result by induction on . For we have and so that . Let . From (54), we have . Using Lemma 3.8 and the induction hypothesis, we get We also have so the recurrence formulas (56) for and (21) for coincide, proving the claim.∎
We put the above formula in an alternative form which is more enlightening and easier to handle. We define
[TABLE]
Then, formula (56) is equivalent to
[TABLE]
This is precisely the operator defined in (35). We are now able to give the main result in this section, which is a consequence of Propositions 3.5 and 3.9.
Theorem 3.10**.**
Suppose that Hypotheses () and () hold true. Let be given and let be the tree constructed in (54) for . Let be the family of trees associated to in (38) and let be given in (40). Then, we define
[TABLE]
and we have
[TABLE]
Remark 3.11**.**
The result presented in this section also holds in the more abstract framework described in the introduction, with semigroups defined on a vector space with seminorms . Under () and (), we have similarly .
4 Probabilistic representation of the approximation semigroup for some Markov processes
All the results presented in the previous sections apply for an abstract semigroup with a family of approximation schemes corresponding to the abstract operators . If one wants to use a Monte Carlo algorithm, one needs to use some probabilistic representation for in order to compute the approximation schemes. This probabilistic representation may be very different according to the problem at hand. Nonetheless, a crucial common issue is the variance of the estimator. More precisely, the approximation proposed in (60) has to be seen as an addition of correction terms that can be calculated independently. Instead, it is very important to try to calculate jointly the terms appearing in for . This is the sum of terms with rather close values since . If one would use independent samples to compute each term, the correcting term would have roughly times the variance of the initial basic Monte-Carlo estimator, which would make the approximation (60) poorly efficient. Fortunately, it is in general possible to do much better.
The goal of this section is to precise the probabilistic representation of the approximation schemes, when considering diffusion processes or Piecewise Deterministic Markov Processes (PDMP). In these cases, it is possible to specify a probabilistic representation of all the schemes on the same probability space, and such that the variance of is bounded.
4.1 Probabilistic representation of
We start by presenting a general framework. We consider a Polish space and consider a kernel such that the application is measurable. Moreover, we consider an independent random variable and define
[TABLE]
We will assume that for some and the estimates () and () hold true. We consider now a grid and denote . We also consider a sequence of independent copies of and define the random vector fields
[TABLE]
and the approximating flow defined by and
[TABLE]
To a tree and to a subset we associate defined in (44) and the grid defined in (45). It is easy to check that we have the probabilistic representation
[TABLE]
Therefore, (60) (with given in (40)) can be rewritten as
[TABLE]
It is clear that (and consequently may be computed using Monte-Carlo simulation, and Theorem 3.10 gives
[TABLE]
Remark 4.1**.**
The above estimate involves which requires much regularity for the test function . However, under some supplementary regularity and non degeneracy assumptions one may prove convergence in total variation distance. Precisely, one may consider measurable and bounded test functions and replace by in the estimation of the error. This has been done in Bally and Rey [8] for usual approximation schemes and the uniform grid (which corresponds in our framework to , i.e. to ). The supplementary hypothesis are the following. First, one has to assume that satisfies the so called Doeblin condition: there exists and such that for every measurable set one has where is the Lebesgue measure. Moreover one has to assume some non degeneracy condition on the gradient of with respect to . However the proof is technical and non trivial, so we do not consider this possible extension in the present paper.
4.2 Probabilistic representation of
We now specify, on some examples, how to sample jointly the random variables for all the schemes for .
4.2.1 Approximation schemes for SDEs
We deal with approximation schemes for the dimensional diffusion process which solves the SDE (1):
[TABLE]
Here denotes the Stratonovich integral and designates the drift coefficient that one obtains when passing from the Itô integral to the Stratonovich integral. We assume that the coefficients and are , bounded with bounded derivatives of any order.
We start with the Euler scheme. It corresponds to
[TABLE]
and being distributed as standard normal random variable on . The finest discretization is , and one therefore needs independent random variables . The grids with are sub-grids of . For some indices , it goes directly from to while the uniform discretization of is contained in . One takes then for the corresponding normal variable for .
We now present the Ninomiya and Victoir scheme [21]. We use the following notation: for a vector field , we define to be the solution of the ODE
[TABLE]
and denote . Then, we set
[TABLE]
Let be a Bernoulli random variable such that and let be an independent standard normal random variable on . Then represents the Ninomiya and Victoir scheme. We denote
[TABLE]
One may show, adapting for example the proof of Theorem 1.18 in [3], that
[TABLE]
and more generally that () holds with and . Therefore, the tree will be different from the one of the Euler scheme and shorter. To sample the Ninomiya and Victoir scheme on the grids , we take a sequence of independent copies of . We define the corresponding flow by
[TABLE]
For each grid with , we again take each time that the discretization goes from to , and take for the associated Bernoulli variable.
4.2.2 Approximation schemes for PDMPs
We consider the infinitesimal operator
[TABLE]
Here is a measurable space, is a finite measure on , are globally Lipschitz continuous functions, is measurable, bounded and Lipschitz continuous with respect to , uniformly with respect to . We set . We denote by the semigroup associated to the infinitesimal operator . The probabilistic representation of is given in the following way.
Let be a Poisson process with intensity , and a sequence of independent random variables such that
[TABLE]
and being independent of . Then, we define as the solution of
[TABLE]
where is the time of the -th jump of . It is well known that under our hypothesis the above equation has a unique solution: between two jump times it follows the deterministic curve given by , and at time it makes the jump if . This process satisfies . This is one particular possible description of PDMPs. There is a huge literature concerning this type of process and their applications, see e.g. [15].
We now define the approximation scheme. Let be the solution of
[TABLE]
which is the solution of (67) for . Then, we define
[TABLE]
where is the semigroup associated to . On the discretization time-grid , this amounts to consider
[TABLE]
where is defined recursively by and
[TABLE]
Here, the generic Polish space introduced in Subsection 4.1 is . Let us note that other approximation schemes are possible. The interest of (69) is that all the schemes with are sampled from the same random variables and and are likely to have very similar jumps, which is interesting to reduce the variance of .
Let us assume now that , and are , bounded with bounded derivatives (uniformly in ). We check that () holds in this case. To do so, we introduce the flow associated to , see equation (66). We have (see e.g. [15], Lemma 7.3.3)
[TABLE]
By differentiating this equation, we get by induction on (we clearly have for ) that
[TABLE]
using the Faà di Bruno formula and Gronwall’s lemma. This property () has been studied for more general jump SDEs very recently by Bally, Goreac and Rabiet [6]. The next lemma proves that () also holds with with and .
Lemma 4.2**.**
We assume that , and are , bounded with bounded derivatives, uniformly in . Then, we have
[TABLE]
Proof.
From (70) and , we get
[TABLE]
which leads to
[TABLE]
Since , we get from (71) . We define which has bounded derivatives, uniformly in . Let be the infinitesimal generator of (68). We have similarly
[TABLE]
We obviously have Since , we also have , which yields to . ∎
4.3 Estimates of the variance on the Euler scheme for SDEs
The aim of this section is to estimate the variance of the algorithm given in (63), (64) in the case of the Euler scheme for SDEs. We consider the valued diffusion process solution of the SDE (1). Given a random grid
[TABLE]
we construct the corresponding Euler scheme by and
[TABLE]
In (64), we have constructed an approximation scheme based on a linear combination of We use here all the notation introduced there. We denote
[TABLE]
Remark 4.3**.**
The important point here is that all the Euler schemes for are defined on the same probability space and constructed with the same Brownian motion . Thus, all the values of are close. When summing according to (72), we may then expect that is small with a small variance. This is precised in the next proposition.
Theorem 4.4**.**
Suppose that . Then, we have for any ,
[TABLE]
In particular, we have and thus .
The proof needs some preparation. We use the alternative representation of the random grids and with given by Lemma 3.4 and (45). We recall that is the ordered grid :
[TABLE]
We write with . For , there exists such that . We check that these indices are distinct, and we assume without loss of generality that . These are the "extreme times". We also denote
[TABLE]
Example 4.5**.**
We consider , with and . The grid is drawn below to scale, with , .
[math]s_{0}$$s_{1}$$s_{2}$$s_{3}$$s_{4}$$s_{5}$$s_{6}$$s_{7}$$s_{8}$$s_{9}
On this example, we have , , , , and . The three other grids needed in the computation of (72) are
[TABLE]
Notice that the grid contains, by construction, all the points in the uniform grid on , that is with . But, if , the grid is not refined between and and does not contain with . We deduce the next lemma.
Lemma 4.6**.**
Let . We note with and set . Then, .
We also recall that, in order to construct our scheme (see (63)), we have considered a kernel and a sequence of independent random variables and and we have defined the vector fields with (see (62)). In the case of the Euler scheme, we have a special representation of these random variables and of these operators. We define
[TABLE]
This corresponds to the quantity defined in (62) with and Moreover, for we define
[TABLE]
with the convention if . This represents the flow of the approximation scheme which runs from to , and that is common to all the grids for . We also define the flow between and :
[TABLE]
We now specify if we use or not the refined grid on the interval . For the case where the grid is refined (i.e. when ), we define
[TABLE]
This is the Euler scheme which starts from and runs from to using the uniform step. Instead, for the coarse discretization which goes from to directly in one single step (i.e. when ), we set
[TABLE]
Now, we are able to define the flow of the whole Euler scheme on the grid . If , we use in order to go from to . Instead, if we use . Thus, we define, for
[TABLE]
From Lemma 4.6, we get
[TABLE]
As a consequence, we have
[TABLE]
with and defined in (80). We are now in the framework of Appendix A. The above formula has to be understood in the following way: represents the approximating flow associated to the grid which runs from to So when we arrive in This is the last "extreme time". After this, we run with up to
Lemma 4.7**.**
With the notation above, we define families of elements of as follows. We set and for , we define
[TABLE]
where and has to be understood as the concatenation symbol. Then, we have
[TABLE]
Lemma 4.7 is obvious but important for simulation purposes: by branching, it is possible to simulate at the same time all the values of as explained in Subsection 5.1. More precisely, there is no need to store : adding the sign to the state space makes the branching dynamics Markovian.
Proof of Theorem 4.4.
Our aim now is to check that and verify the hypothesis of Proposition A.1. Standard estimates concerning Euler schemes (see the short sketch below) give
[TABLE]
and the same estimate holds for Then, as a consequence of Proposition A.1 we obtain
[TABLE]
Notice that where . Thus, from Lemma A.3 we get easily that and then
[TABLE]
by using the Faà di Bruno formula. Then (73) follows. Moreover, we have
[TABLE]
One checks easily by induction that , which gives (73).
We now give a sketch of the proof of (74). We consider the grid given at the beginning of this section. The corresponding Euler scheme on is defined by
[TABLE]
where for . Then, we have Using Burkholder-Davis-Gundy inequality and the fact that the coefficients are bounded, we get . Moreover, the first derivatives satisfy
[TABLE]
Since and are bounded, using Burkholder-Davis-Gundy inequality and Gronwall’s lemma we get For higher order derivatives, the proof is similar.∎
Remark 4.8**.**
We have a better estimate for (76) when is constant. In this case, we have from Lemma A.3
[TABLE]
which leads to get instead of (73). When , we even have and thus .
Remark 4.9**.**
Since , we get with if , when is a constant function. Thus, the computation of some terms in the sum (60) is useless: we can drop the terms for any such that . More precisely, also satisfies .
For example, the tree is such that and its calculation is useless for an approximation of order for ODEs ().
5 Numerical results
5.1 Implementation
First, we have to calculate the tree given by Equation (54) in function of the desired order of convergence. To calculate this tree, we only have to know and the coefficient that characterizes the order of convergence of the elementary scheme (see hypothesis). For the Euler scheme, we have . For example, the tree corresponding to the approximations of order and constructed with the Euler scheme are given in Figure 2. To compute these trees, we use the induction formula (54). To help the reader, we have indicated in the node the convergence order (i.e. the value of in (54)) needed in the induction. For example, , and are the value indicated for the sons of the ancestor of the tree .
The second step consists in calculating the forest . According to Proposition 3.5, each tree of this forest represents a combination of elementary schemes. For example, using the Neveu notation, we have for the Euler scheme
[TABLE]
Let us note that the number of trees in the forest increases rapidly with : for the Euler scheme (), we have , , . Nonetheless, these forests can be calculated once and for all.
The last step consist in calculating for all the trees . Then, we get the approximation by using (60). The key point here is to sample all the Euler schemes from the same Brownian path, as explained in Section 4. Figure 3 gives an illustration of the times grids that are involved in the calculation of , with .
To implement the Euler schemes involved in , it is possible to do it “by hands”, i.e. to generate the random tree and then to simulate simultaneously the schemes. This is easy to do for rather small trees , but the drawback is that it requires to write a routine for each . Thus, it is easy to do this direct implementation up to order three, but then it becomes rather cumbersome since the number of routines needed is rather large. Instead of this, it is possible to write a recursive routine that works for any . This routine starts from one initial value and calculates at the same time the schemes and branches each time it finds a leaf. It also calculates inductively the weight associated to each scheme. This routine works as follows. It takes in arguments a tree , a step , and a set of initial values with weights . If , it samples independent increments and calculate for each , the Euler scheme on the coarse grid with time step starting from and the Euler scheme on the fine grid with time step starting from . It returns the set of values
[TABLE]
Otherwise, we have with . We draw a uniform random variable on (see Remark 5.1). Then, we apply to all the initial values times the Euler scheme with time step , conserving their weights. They are used as argument to apply inductively the function with and . This generates a set of values and weights to which we apply times the Euler scheme with time step , and then we apply again inductively the function with and . We repeat this times, and finally apply times the Euler scheme with time step . This inductive algorithm consists precisely in implementing the formula given in Lemma 4.7.
Remark 5.1**.**
To sample a uniform random variable on for , we can proceed as follows. If , we simply draw a uniform r.v. on . For , we proceed by induction and draw a uniform random variable on . Then, we draw a uniform random variable on . This can be done by sampling an independent random variable that is uniform on and then set . Last, we sort the , which produces a vector that is uniformly distributed on .
Now that we have an algorithm that is able to calculate for any tree , we just have to approximate all these quantities for all the trees and then to sum these contributions according to (60). To decide how many samples we use to approximate , we fix a desired precision , calculate the empirical variance of on a small sampling and then take such that , so that all the terms have roughly the same statistical error with a 95% confidence interval half-width equal to .
5.2 Numerical results for an ODE
To visualize numerically the orders of convergence provided by (60) for the Euler scheme, it is more convenient to work with ODEs. In this case, the variance of the terms is very small and it is possible to observe the five first order of convergence. In the particular case of a linear ODE , we can go further, but we can check also that the value of is deterministic and does not depend on the uniform random variables ’s. Thus, we have considered the following example
[TABLE]
with and . The exact value is given by . We have drawn on Figure 4, for , the values of in function of with , , and , where is the estimator of given by equation (60) and . The corresponding values of the slopes are , , and which is in line with what is expected. All the values given on this example are with an half-width of the 95% confidence interval that does not exceed . Our run for the approximation of order already gives with a value that is accurate up to : the exact value is already in the 95% confidence interval.
Last, let us mention that for this ODE, we have used the same approximation rule as for the SDE and calculated all the terms of (60). However, as noticed in Remark 4.9, it is possible to avoid the calculation of many terms.
5.3 Numerical results for an SDE
We now want to illustrate the orders of convergence for the approximation given by (60) for the Euler-Maruyama scheme. We consider the following SDE
[TABLE]
with , , . In Figure 5, we have plotted the approximation of with with the orders in function of . We still denote by the estimator of given by (60), using the approximation of order with time-steps. The half-width of the 95% confidence interval is about . The approximation of order is already at this level of precision for , and we have indicated this value as a reference line for the other schemes. The convergence are again in line with what is expected.
5.4 Numerical results for a PDMP
We consider the TCP process with infinitesimal generator
[TABLE]
starting from , and our goal is to approximate , with . Since the jumps are only downward, the jump intensity is bounded by on . We are thus in the framework of paragraph 4.2.2, and use the scheme described in (69). We denote again by the estimator of given by (60), using the approximation of order with . In Figure 6, we have plotted the approximation of with with the orders in function of . The half-width of the 95% confidence interval is about . The approximation of order is already at this level of precision for , and we have indicated this value as a reference line for the other schemes. The plot is very similar to the one obtained in Figure 5. This demonstrates numerically that the approximations described by (60) are relevant for a wide range of processes and applications.
5.5 A rough complexity analysis
Now, let us do a rough complexity analysis to understand which order of approximation to use in practice. To make this derivation, we make the assumption for sake of simplicity that the variance corresponding to the term is equal to for all , . Thus, in this analysis, we will use the same number of samples for all these terms. We also suppose that we want to achieve a precision of order , with a standard error which is exactly . Then, we have the following.
For the approximation of order , we use one Euler scheme with time step and the standard error is , where is the number of samples. We take and and the calculation time (counted as the number of Euler iterations used) is . 2. 2.
For the approximation of order , we have two terms corresponding to and . The first one requires calculations of Euler iterations. The second one requires between and Euler iterations: due to the branching implementation, we only calculate iterations when and iterations when . For simplicity, we will only consider in this computational cost analysis the worst case and count iterations. Since the convergence is of order , we take . The standard error is , and we take . Thus, the calculation time is . 3. 3.
For the order 3, we have in addition to calculate for and that requires respectively and Euler iterations. We take to have an approximation of order . The standard error is , and we take . Thus, the calculation time is . 4. 4.
For the order 4, we have in addition to calculate for and , , and : they require respectively , (see Figure 3), , and . The overall cost is . We then take and to have a standard error . Thus, the calculation time is .
With this rough cost analysis, we would use:
- •
the approximation of order 2 rather than the approximation of order 1 if , i.e. ,
- •
the approximation of order 3 rather than the approximation of order 2 if , i.e. ,
- •
the approximation of order 4 rather than the approximation of order 3 if , i.e. .
This analysis shows that in practice the order 3 may be already sufficient for the precision that is usually needed. However, this cost analysis has to be tempered, because the assumption of a unit variance for each term is rather pessimistic. For ODEs or SDEs with constant diffusion coefficient, we already know from our theoretical results (see Remark 4.8) that the variance of may be much smaller. Also, for SDEs, we see from Table 1 that, globally, the terms that are needed for the calculation of order 4 have a smaller variance than the one needed for the order 3, which have also smaller variance than the one needed for the order 2. Of course, there is exception: for example in Table 1, the standard deviation associated to is of same magnitude as the one associated to or even . This is why it is better in practice to estimate first the variance of each term and then determine how many samples are needed to achieve a given precision. For the example of Figure 5, to get a precision of , the approximation of order 2 has required 88s (), the order 3 about 89s (), the order 4 about 214s () and the order 5 about 345s (). Thus, the scheme of order 3 is already competitive for this precision with respect to the order 2.
Appendix A Technical results for the variance analysis
We introduce some notation. We consider smooth random fields, that is functions which are measurable with respect to and such that, for each the function is of class . For such a random field we denote
[TABLE]
Moreover, we will say that a sequence of random fields are independent if there are some independent algebras such that is measurable. We will use this property as follows. Suppose that is measurable and and are measurable. Then, for every and every
[TABLE]
In the sequel we consider a sequence of smooth random fields and , and moreover, a vector field . We assume that and are independent. We fix and, for a set we define
[TABLE]
Moreover, given a multi-index we define
[TABLE]
Proposition A.1**.**
Suppose that for every , there exists such that
[TABLE]
Then, for every and every multi-index we have
[TABLE]
for some depending on , and
Remark A.2**.**
This proposition says the following: if at each step the error is of order , then after steps we have an error of order This may seem a little surprising, and one may have expected an error of order , but this is due to the way how terms are summed with
Proof.
Step 1. We use the Faà di Bruno formula (see (8)) and the inequality between geometric and arithmetic means to upper bound the terms defining . We then obtain for random functions
[TABLE]
Besides, for two random fields and , we write
[TABLE]
Using the inequality between geometric and arithmetic means for the product on and then the Cauchy-Schwarz inequality, we get
[TABLE]
Step 2. We prove (82) for In this case or so that
[TABLE]
with
[TABLE]
and
[TABLE]
Using (81) and the fact that is independent of we get (see (79))
[TABLE]
so that Moreover, using again (79) first and then (83), we get
[TABLE]
so (82) is proved for .
**Step 3. **Suppose (82) is true for for every and every We prove it for We do it first for (without derivatives) because it is simpler. We write
[TABLE]
Note that we have made a slight abuse of notation here: the notation is used in fact for , not for . Since is independent of we have from (79)
[TABLE]
Then, by using the induction hypothesis, we get
[TABLE]
We prove now (82) for a general multi-index and make the same abuse of notation for . Using (8) and (9) for and we obtain
[TABLE]
with
[TABLE]
and
[TABLE]
By assumption are independent of . Therefore, is independent of . We use (79) first and then the induction hypothesis and (83) to obtain
[TABLE]
Moreover
[TABLE]
Notice that . Using again (79) and the recurrence hypothesis, we get
[TABLE]
Lemma A.3**.**
Let denote the flow of the SDE (1) and the flow of the Euler scheme. We assume that and are , bounded and with bounded derivatives. Then, we have
[TABLE]
with if , if is a constant function and in the general case.
Proof.
We show this result by induction on . We only focus on the general case, the cases or constant can be then easily deduced. For , this result is stated for example in Proposition 1.2 [2]. For simplicity of notation, we do the proof in dimension with . We note the -th derivative of . For , we have and
[TABLE]
Since is bounded, we have . We write
[TABLE]
Since is bounded and Lipschitz, we get by using the Burkholder-Davis-Gundy inequality and then Jensen inequality
[TABLE]
with a constant that does not depend on . We check then again with the BDG inequality that since is bounded and since is bounded and (84). Thus, we have .
We suppose now the result true for and that we have shown that
[TABLE]
for each , with a constant that does not depend on . We have and by the Faà di Bruno formula
[TABLE]
with . Note that in this sum is equal to [math] for and otherwise there is at least one , such that . This gives by using the induction hypothesis (85) and Hölder type inequalities. Since , and are bounded and for any , we get by using BDG and Gronwall inequalities. Therefore, also satisfies .
We now repeat the same arguments as for : from
[TABLE]
we get .
∎
Acknowledgements
Aurélien Alfonsi benefited from the support of the “Chaire Risques Financiers”, Fondation du Risque.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Ankush Agarwal and Emmanuel Gobet. Finite variance unbiased estimation of stochastic differential equations, 2018.
- 2[2] A. Alfonsi, B. Jourdain, and A. Kohatsu-Higa. Pathwise optimal transport bounds between a one-dimensional diffusion and its Euler scheme. Ann. Appl. Probab. , 24(3):1049–1080, 2014.
- 3[3] Aurélien Alfonsi. High order discretization schemes for the CIR process: application to affine term structure and Heston models. Math. Comp. , 79(269):209–237, 2010.
- 4[4] Aurélien Alfonsi, Masafumi Hayashi, and Arturo Kohatsu-Higa. Parametrix methods for one-dimensional reflected SD Es. In Modern problems of stochastic analysis and statistics , volume 208 of Springer Proc. Math. Stat. , pages 43–66. Springer, Cham, 2017.
- 5[5] Patrik Andersson and Arturo Kohatsu-Higa. Unbiased simulation of stochastic differential equations using parametrix expansions. Bernoulli , 23(3):2028–2057, 2017.
- 6[6] Vlad Bally, Dan Goreac, and Victor Rabiet. Regularity and stability for the semigroup of jump diffusions with state-dependent intensity. Ann. Appl. Probab. , 28(5):3028–3074, 2018.
- 7[7] Vlad Bally and Arturo Kohatsu-Higa. A probabilistic interpretation of the parametrix method. Ann. Appl. Probab. , 25(6):3095–3138, 2015.
- 8[8] Vlad Bally and Clément Rey. Approximation of Markov semigroups in total variation distance. Electron. J. Probab. , 21:Paper No. 12, 44, 2016.
