Canonical RDEs and general semimartingales as rough paths
Ilya Chevyrev, Peter K. Friz

TL;DR
This paper develops a theory of rough differential equations that includes jumps, introduces a new metric for solution continuity, and shows that general semimartingales can be lifted to rough paths, leading to new limit theorems.
Contribution
It extends rough path theory to include jump processes and semimartingales, providing new tools and results for stochastic differential equations with discontinuities.
Findings
A new metric ensures solution continuity with respect to rough paths and jumps.
Semimartingales can be canonically lifted to rough paths.
Extended BDG inequality leads to novel limit theorems for semimartingale-driven equations.
Abstract
In the spirit of Marcus canonical stochastic differential equations, we study a similar notion of rough differential equations (RDEs), notably dropping the assumption of continuity prevalent in the rough path literature. A new metric is exhibited in which the solution map is a continuous function of the driving rough path and a so-called path function, which directly models the effect of the jump on the system. In a second part, we show that general multidimensional semimartingales admit canonically defined rough path lifts. An extension of L\'epingle's BDG inequality to this setting is given, and in turn leads to a number of novel limit theorems for semimartingale driven differential equations, both in law and in probability, conveniently phrased via Kurtz-Protter's uniformly-controlled-variations (UCV) condition. A number of examples illustrate the scope of our results.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Canonical RDEs and general semimartingales as rough paths
Ilya Chevyrev label=e1][email protected] [
Peter K. Friz label=e2][email protected] [ University of Oxford\thanksmarkm1, TU Berlin\thanksmarkm2 and WIAS\thanksmarkm3
I. Chevyrev
Mathematical Institute
University of Oxford
Andrew Wiles Building
Radcliffe Observatory Quarter
Woodstock Road
Oxford OX2 6GG
United Kingdom
P. K. Friz
Institut für Mathematik
Technische Universität Berlin
Strasse des 17. Juni 136
10623 Berlin
Germany
and
Weierstraß–Institut für Angewandte
Analysis und Stochastik
Mohrenstrasse 39
10117 Berlin
Germany
Abstract
In the spirit of Marcus canonical stochastic differential equations, we study a similar notion of rough differential equations (RDEs), notably dropping the assumption of continuity prevalent in the rough path literature. A new metric is exhibited in which the solution map is a continuous function of the driving rough path and a so-called path function, which directly models the effect of the jump on the system. In a second part, we show that general multidimensional semimartingales admit canonically defined rough path lifts. An extension of Lépingle’s BDG inequality to this setting is given, and in turn leads to a number of novel limit theorems for semimartingale driven differential equations, both in law and in probability, conveniently phrased via Kurtz–Protter’s uniformly-controlled-variations (UCV) condition. A number of examples illustrate the scope of our results.
60H99,
60H10,
Càdlàg rough paths,
stochastic and rough differential equations with jumps,
Marcus canonical equations,
general semimartingales,
limit theorems,
keywords:
[class=MSC]
keywords:
\arxiv
arXiv:1704.08053 \startlocaldefs
and
t1Affiliated to TU Berlin and supported by DFG research unit FOR2402 when this project was commenced. Currently supported by a Junior Research Fellowship of St John’s College, Oxford. t2Partially supported by the European Research Council through CoG-683166 and DFG research unit FOR2402
1 Introduction
Itô stochastic integrals are well-known to violate a first order chain rule of Newton–Leibniz type, as is manifest from Itô’s formula. In a number of applications, is is important to have a chain rule which, in the context of continuous semimartingales, was achieved in a satisfactory way by Stratonovich stochastic integration, which - loosely speaking - replaces left-point evaluation (in Itô–Riemann sums) by a symmetric mid-point evaluation. In the case of stochastic integration against general semimartingales (Lévy processes as an important special case), one can check that the Stratonovich integral no longer gives a chain rule – a more sophisticated approach is necessary to take care of jumps and the mechanism for doing this was developed by Marcus [39, 40]. The resulting “Marcus canonical integration” and “Marcus canonical (stochastic differential) equations” (in the terminology of [1]) was then investigated in a number of works, including [23, 29, 22, 3, 2, 31, 30, 24], see also [1, 8] and the references therein.
On the other hand, continuous stochastic integration has been understood for some time in the context of rough path theory, see [38, 12, 17], [20, Ch. 14]. Loosely speaking, given a multidimensional continuous semimartingale , the Stratonovich integral can be given a robust (pathwise) meaning in terms of , a.e. realization of which constitutes a geometric rough path of finite -variation for any . In contrast to the popular class of Hölder rough paths (usually sufficient to deal with Brownian motion, see e.g. [18]), -variation has the advantage that it immediately allows for jumps. This also prompts the remark that Young theory, somewhat the origin of Lyons’ rough paths, by no means requires continuity. Extensions of rough path theory to a general -variation setting (for possibly discontinuous paths) were then explored in [47], [19] and [9]. However, none of these works provided a proper extension of Lyons’ main result in rough path analysis: continuity of the solution map as a function of the driving rough path.
The first contribution of this paper is exactly that. We introduce a new metric on the space of càdlàg rough paths, and a type of (Marcus) canonical rough differential equation, for which one has the desired stability result. (Experts in la théorie générale des processus will recognize our topology as a -variation rough paths variant of Skorokhod’s strong topology.) In fact, we reserve the prefix “Marcus” to situations in which jumps only arise in the -dimensional driving signal, and are handled (in the spirit of Marcus) by connecting and by a straight line. (As a straight line has no area, this creates no jump in the area.) A “general” rough path (level , over ), however, can have jumps of arbitrary value , and there are many (different) ways to implement Marcus’s idea of continuously (parametrized over a fictitious time interval) connecting and . This is really a modelling choice, no different than choosing the driving signal and/or the driving vector fields. The notion of a “path function” helps us to formalize this, and indeed one may view as the correct/extended rough driver.
In the second part of the paper, we show how general (càdlàg) semimartingales fit into the theory. In particular, we show that the canoncial lift of a semimartingale indeed is a.s. a (geometric) rough path of finite -variation for any (several special cases, including Lévy processes, were discussed in [47, 19] but the general case remained open). Our result is further made quantitative by establishing a BDG inequality for general local martingale rough paths. (We thus extend simultaneously the classical -variation BDG inequality [36], and its version for continuous local martingale rough paths [17]). This BDG inequality turns out to be a powerful tool, especially in conjunction with uniform tightness (UT) and uniformly controlled variation (UCV) of semimartingale sequences. (Introduced by [26] and [32] respectively, these conditions are at the heart of basic convergence theorems for stochastic integrals in Skorokhod topology; we work only with UCV in this article, but note that UCV and UT are equivalent under extra assumptions, e.g., convergence in law [33, Thm 7.6]. See Section 4.2 for the definition of UCV, and also [12] for some links to continuous semimartingale rough paths.)
As an example of an application to general semimartingale theory, we are able to state a criterion for convergence in law (resp. in probability) of Marcus SDEs, which is an analogue of the celebrated criterion for Itô SDEs due to Kurtz–Protter [32, Thm. 5.4] (we emphasize however that neither criterion is a simple consequence of the other). Loosely speaking, the result asserts that if are -valued semimartingales such that satisfies UCV and in law (resp. in probability) for the Skorokhod topology, then the solutions to Marcus SDEs driven by (along fixed vector fields) converge in law (resp. in probability) to the Marcus SDE driven by (see Theorem 4.18 for a precise formulation). Our theorem (which crucially involves rough paths in the proof, but not in the statement) entails a pleasantly elegant approach to the Wong–Zakai theorem for SDEs with jumps (Kurtz–Protter–Pardoux [31], with novel interest from physics [8]) and in fact gives a number of novel limit theorems for Marcus canonical SDEs (see Theorem 4.29). We remark further that homeomorphism and diffeomorphism properties of solution flows are straightforward, in contrast to rather lengthy and technical considerations required in a classical setting (see, e.g., [1, p. 423] and the references therein). At last, we discuss the impact of more general path functions, noting that the “Marcus choice” really corresponds to the special case of the linear path function.
The paper is organized as follows. In Section 2 we collect some necessary preparatory material, including basic properties of path functions. In Section 3 we give meaning to canonical RDEs, for which drivers are (rough path, path function) pairs , and introduce the metric for which the direct analogue of Lyons’ universal limit theorem holds. Section 4 is then devoted to applications to càdlàg semimartingale theory, particularly in connection with the UCV condition and Wong–Zakai type approximations. We briefly comment in Section 5 on the further scope of the theory.
2 Preparatory material
2.1 Wiener and Skorokhod space
Throughout the paper, we denote by and the space of continuous and càdlàg functions (paths) respectively from an interval into a metric space .
Unless otherwise stated, we equip and respectively with the uniform metric
[TABLE]
and the Skorokhod metric
[TABLE]
where denotes the set of all strictly increasing bijections of to itself, and . When we omit the interval from our notation, we will always assume it is . We recall that if is Polish, then so is the Skorokhod space with topology induced by , also known as the -topology.
We always let denote a partition of , and notation such as denotes summation over all points in (possibly without the initial/final point depending on the indexing). We let denote the mesh-size of a partition.
For , we define the -variation of a path by
[TABLE]
We use superscript notation such as to denote subspaces of paths of finite -variation. For continuous only is interesting, for otherwise is constant.
Remark 2.1*.*
At least when (or , see below) there is an immediate -variation metric and topology. Due to the fact that convergence in topology to a continuous limit is equivalent to uniform convergence, a discontinuous path cannot be approximated by a sequence of continuous paths in the metric . The same will be true for a /-variation (rough path) metric below. That said, we will propose below a useful /-variation (rough path) metric under which the space of continuous rough paths is not closed.
2.2 Rough paths
For , we let denote the step- free nilpotent Lie group over , embedded into the truncated tensor algebra (, which we equip with the Carnot-Carathéodory norm and the induced (left-invariant) metric . Recall that the step- free nilpotent Lie algebra is the Lie algebra of . The space , with , is the classical space of (continuous, weakly) geometric -rough paths as introduced by Lyons.
Unless otherwise stated, we always suppose a path starts from the identity . We denote the increments of a path by . We consider on the inhomogeneous -variation metric
[TABLE]
Unless otherwise stated, we shall always assume that and satisfy .
We let denote the closure in under the metric of the lifts of smooth paths . Recall in particular that is precisely the space of absolutely continuous paths.
We let denote a collection of vector fields in with and . For a geometric -rough path , we let denote the solution to the RDE
[TABLE]
We let denote the associated flow map , which we recall is an element of . For further details on the theory of (continuous) rough paths theory, we refer to [20].
For the purpose of his paper we have (cf. [19])
Definition 2.2**.**
Let . Any is called a general (càdlàg, weakly) geometric -rough path over . Define and say is Marcus-like if for all
[TABLE]
where is taken in .
As we will see later, any canonical lift of a general -dimensional semimartingale (with area given by ) gives rise to a Marcus-like general geometric -rough path for . The model case of Lévy processes was studied in [46, 19].
2.3 Path functions
We briefly review and elaborate on the concept of a path function introduced in [9]. Let be a metric space.
Definition 2.3**.**
A path function on is a map defined on a subset for which and for all .
For a path , we say that is a jump time of if . We call the pair admissible if for all jumps times of . We say that two admissible pairs and are equivalent, and write , if and is a reparametrization of .
We denote by the set of all admissible pairs , and by the set of all equivalence classes of admissible pairs. For a fixed path function , let denote the set of all such that is admissible.
We will often simply say that is a path function on and keep implicit the fact there is an underlying domain of definition . We point out that situations where arise naturally when studying solution maps of canonical càdlàg RDEs, see Theorem 3.13 and the discussion before it.
In the case that is a Lie group with identity element (taken in this article to always be ), we shall often assume that is left-invariant, which is to say that there exists a subset such that and
[TABLE]
In this case, it is equivalent to view as a map such that and for all , for which . Whenever we write with only one argument as , we shall always mean that it is left-invariant.
Example 2.4** (log-linear and Marcus path function).**
The prototypical example of a (left-invariant) path function on , which we shall often refer to in the paper, is the log-linear path function
[TABLE]
where is taken in . Since , we have is admissible for all . When we see a familiar special case: since , one has , and then This is precisely the “Marcus interpolation” of a càdlàg path before and after its jump. See parametric plots in Figures 2 and 2.
For we now construct a continuous path as follows. Fix a convergent series of strictly positive numbers . Let be the jump times of ordered so that , and if . Let be the number of jumps of .
Let and define the strictly increasing (càdlàg) function
[TABLE]
Note that if and only if for some . Moreover, note that the interval is of length .
Define by
[TABLE]
Denote by the increasing linear bijection from to . We finally define
[TABLE]
We note that one can recover via the time change
[TABLE]
for which it holds that
[TABLE]
Remark 2.5* (Intrinsic definition of ).*
The construction of involves an ad-hoc choice, namely the sequence and the increasing bijection . If is constructed similarly, but via a sequence , followed by another reparametrisation given by , then and are reparametrizations of one another.
Remark 2.6*.*
The construction above is similar to ones appearing in [19, 47], and is a simplification of the construction in [9]. The primary difference is that in [9] the added fictitious time for the jump depended further on the size of the jump . This extra dependence was used to show continuity of the map from , which we will not require here.
2.4 A generalisation of Skorokhod’s topology
For and , let be constructed in the same procedure as but using the series instead of .
Lemma 2.7**.**
For all , the limit
[TABLE]
exists, is independent of the choice of series , and induces a pseudometric on the set of equivalence classes .
Proof.
To show that the limit exists, note that for every , there exists such that and . Since , it follows that for every
[TABLE]
from which the existence of the limit follows. The fact that (3) is independent of the series and is zero if is straightforward. ∎
Definition 2.8**.**
Define the pseudometric on by
[TABLE]
(Usually no confusion will arise by using the abusive notation on the left-hand side.)
Note that for any fixed , induces a genuine metric on the space . We note also that the strong (i.e., ) topology on the space is a special case of the topology induced by the metric , as demonstrated by the following result.
Proposition 2.9**.**
For and the linear path function, it holds that induces the topology on the space .
Proof.
It is straightforward to verify that in this case is equivalent to the metric (see [45, Sec. 12.3.1]) which induces the topology. ∎
Remark 2.10*.*
The reader may wonder if convergence in the (Skorokhod ) metric implies, as in the classical setting, convergence in the (Skorokhod -type) pseudometric . In essence, the answer is yes, however this requires “reasonable” path functions, see Lemma 2.20.
Remark 2.11* (Restriction of time interval).*
It is trivial to see that uniform convergence of paths on implies convergence on any subinterval of , while this fails for both Skorokhod and metrics. The observation generalizes to our setting and, in particular, the metric does not behave well under restriction. Indeed, while for any the jumps of still belong to , so that is well-defined, it does not hold that implies that .
We now collect several useful definitions and lemmas concerning path functions.
Lemma 2.12**.**
Let for which
[TABLE]
where the limit is taken over some enumeration of jump times of . Let be a sequence in such that . Then for every continuity point of .
Remark 2.13*.*
Note that condition (5) is satisfied whenever is either endpoint continuous or has finite -variation (see Definitions 2.14 and 2.17 below). In particular, since càdlàg paths are uniquely determined by their continuity points, it follows from Lemma 2.12 that is a genuine metric on the space introduced in Definition 2.14.
Proof.
Suppose is a continuity point of . Then using (5) and the definition of , it holds that for every there exists such that for all sufficiently large and sufficiently small we have
[TABLE]
from which the conclusion follows. ∎
Definition 2.14**.**
For , we define the -variation of as
[TABLE]
and let denote all of finite -variation.
Moreover, a path function is called -approximating if there exists a function such that for all
[TABLE]
We say that is a -variation modulus of .
Remark 2.15*.*
Due to the invariance of -variation norms under reparametrizations, and our previous Remark 2.5, we see that there is no ambiguity in the definition of and that is well-defined.
The following lemma gives a simple criterion for a pair to have finite -variation.
Lemma 2.16** ([9] Lemma A.5).**
Let and set . Then for every , it holds that
[TABLE]
where the summations are over the jump times of .
In particular, if has a -variation modulus , then for all ,
[TABLE]
where .
Definition 2.17**.**
A path function is called endpoint continuous if
* whenever ,* 2. 2.
* for all , and* 3. 3.
* is continuous with equipped with the uniform topology.*
Moreover, we say that a function is a uniform modulus of if for all , , and for all
[TABLE]
Remark 2.18*.*
In general, it is hard to find an explicit uniform modulus of a path function (or even show that one exists). But evidently if is restricted to for a compact , then a uniform modulus exists whenever is endpoint continuous.
Example 2.19**.**
Let be the log-linear path function on . Then clearly is endpoint continuous and there exists a constant such that for all and
[TABLE]
so that the constant is a -variation modulus of .
Lemma 2.20**.**
Suppose has a uniform modulus . Then for all , it holds that .
Proof.
Suppose there exists such that and . Then for all sufficiently small there exists such that and , and the conclusion follows. ∎
3 Canonical RDEs driven by general rough paths
To ease notation, we assume throughout this section that all path spaces, unless otherwise stated, are defined on the interval and take values in . For example will be shorthand for .
3.1 Notion of solution
Following the notation of Section 2.2, let and fix a family of vector fields in for some and . For , we would like to solve the RDE
[TABLE]
Our notion of solution to this equation will depend on a path function defined on a subset , and therefore the fundamental input to an RDE will be a pair .
Definition 3.1** (Canonical RDE).**
Consider and let be the solution to the continuous RDE
[TABLE]
We define the solution to the canonical RDE
[TABLE]
by (where is given by (2)).
In the particular case that is the log-linear path function from Example 2.4, we denote the RDE simply by
[TABLE]
Remark 3.2*.*
While the continuous RDE solution clearly depends (up to reparametrization) on the choice of representative as well as the choice of , it is easy to see that is independent of these choices, and is therefore well-defined on .
Remark 3.3*.*
Observe that every continuity point of is also a continuity point , and is therefore also a continuity point of .
Remark 3.4*.*
For the log-linear path function , the solution agrees precisely with the solution to the rough canonical equation considered in [19, Def. 37, Thm. 38]. Furthermore, we shall see in Section 4 that all semimartingales admit a canonical lift to a càdlàg geometric -rough path, and that, for the log-linear path function, the solution agrees with the Marcus solution of the associated SDE (see Proposition 4.16 below).
3.2 Skorokhod-type -variation metric
We now introduce a metric on for which the RDE solution map is locally Lipschitz continuous. We first define an auxiliary metric on which is independent of any path function. Recall the inhomogeneous -variation metric from Section 2.2.
Definition 3.5**.**
For and , define
[TABLE]
Remark 3.6* (Topologies induced by and ).*
Note that and induce the same topology on . Indeed, it is sufficient to show that for all and , which follows from writing and applying dominated convergence.
Furthermore, for , and induce the same topology on . Indeed, it again suffices to show that for all and , which follows from and interpolation [20, Lem. 8.16].
However, note that and do not induce the same topology on . This can be seen from the fact that is dense in under (see Proposition 3.10 part (iii)), or from the following direct example: consider the -valued Cantor function , where is the Cantor distribution, and shifts (with for ), where . Clearly . However, choosing irrational, one can show that and are mutually singular measures (see, e.g., [13]), so that .
We note that for the case , the metric already appears in the works of Simon [44] and Williams [46] where in particular a continuity statement for RDE solutions in the Young regime appears in terms of (cf. Remark 3.19).
A drawback of the metric is that the space of continuous rough paths is closed under . In particular, this implies that is unable to describe situations in which continuous drivers approximate a discontinuous one (e.g. the Wong–Zakai theorem in [31]). We are thus motivated to introduce the following metric whose relation with is analogous to that of with .
Definition 3.7**.**
For , define the metric
[TABLE]
where is defined as at the start of Section 2.4.
Remark 3.8*.*
Note that the limit (7) exists, is independent of the choice of series , and induces a well-defined metric on , all of which follows from the same argument as in Lemma 2.7 and Remark 2.13.
Remark 3.9*.*
In light of Remark 3.6, it may seem possible to define an equivalent topology as that induced by (at least on ) by replacing by in (4) for the definition of (and thus avoid introducing altogether). However one can readily check that doing so will induce a completely different topology even on (in fact the same remark applies to replacing by when defining in Definition 2.8).
We record several basic properties of the metric space . For a path function , let denote the closure of in the metric space .
Proposition 3.10**.**
Let and a path function defined on a subset .
- (i)
The space is a separable metric space. 2. (ii)
It holds that is dense in . 3. (iii)
It holds that . 4. (iv)
If , the closure of in is precisely . In particular, . 5. (v)
For every , .
Proof.
(i) Recall that is a separable space, and therefore so is (see Remark 3.6). Since the metrics and coincide on , it follows that is separable.
(ii) For every and , it holds that . One can readily see that , from which the claim follows.
(iii) By point (ii), it suffices to show that is dense in under . This in turn follows from the fact that any can be reparametrized to be in (where the isometry is via the weak derivative) and thus lies in .
(iv) Recall by Wiener’s characterization [20, Thm. 8.22] (which relies on ) that if and only if
[TABLE]
As a consequence, for any , it holds that if and only if . Therefore, for any sequences in and in for which , it holds that , from which the conclusion follows.
(v) Since , the conclusion follows from (ii). ∎
We now record an interpolation estimate which will be helpful later. It turns out to be simpler to state in terms of a homogeneous version of the distance . See [20, Ch. 8] for the definition and basic properties of the metrics and . For we define
[TABLE]
as well as
[TABLE]
(which are well-defined metrics on by the same argument as in Lemma 2.7 and Remark 2.13). Observe that the estimate [20, Prop. 8.15] implies
[TABLE]
Moreover, to move from to , it holds that the identity map
[TABLE]
is Lipschitz on bounded sets in the direction, and -Hölder on bounded sets in the direction [20, Thm. 8.10].
Finally, the following result now follows directly from the usual interpolation estimate for the homogeneous metric [20, Lem. 8.16].
Lemma 3.11**.**
Let . For all it holds that
[TABLE]
As a consequence, we obtain the following useful embedding result.
Proposition 3.12**.**
Let and a -approximating, endpoint continuous path function defined on a subset . Then the identity map
[TABLE]
is uniformly continuous on sets of bounded -variation.
Proof.
This is a combination of Remark 2.18 and Lemmas 2.16, 2.20, and 3.11. ∎
3.3 Continuity of the solution map
An advantage of the metric is it allows us to directly carry over continuity statements about the classical (continuous) RDE solution map to the discontinuous setting. Recall the RDE (6)
[TABLE]
which is well-defined for any admissible pair .
Consider the driver-solution space . Given that is defined on , we obtain a natural path function defined on the following (necessarily strict)subset of
[TABLE]
and which is given by
[TABLE]
where we recall from Section 2.2 is the solution map for the (continuous) RDE driven along . Therefore, an admissible pair yields an admissible pair as the solution to the RDE. The following is now a consequence of Lyons’ classical rough path universal limit theorem.
Theorem 3.13** (Continuity of solution map).**
For vector fields in with and , the solution map of the RDE (6)
[TABLE]
is locally Lipschitz. In particular,
[TABLE]
implies that
[TABLE]
Furthermore the flow map
[TABLE]
is uniformly continuous on sets of bounded -variation (see Section 2.2 for the definition of ).
Remark 3.14*.*
Note that one cannot replace by for any (fixed) in the final statement of Theorem 3.13. This is a manifestation of the fact that does not behave well under restrictions to subintervals (cf. Remark 2.11).
Remark 3.15*.*
Though we don’t address this here, the Lipschitz constant appearing in Theorem 3.13 can be made to depend explicitly on and the -variation of .
Remark 3.16*.*
Note that in the second statement of Theorem 3.13, one cannot replace by in “for all continuity points of ”. Note also that this type of convergence is the one considered in the Wong–Zakai theorem of [31].
Proof.
The claim that the solution map is locally Lipschitz and that the associated flows converge follows from the corresponding result for continuous rough paths (see, e.g., [20, Thm. 10.26]). To make this explicit, consider with path function and then with path function . Write also and . By definition
[TABLE]
On the other hand, for we have . Write for the unique RDE solution to and note that , by the very definition of the path function . Note also that
[TABLE]
for every time change . It is then a direct consequence of the (local Lipschitz) continuity of the Itô-Lyons map, in the setting of continuous rough paths, that
[TABLE]
As a consequence,
[TABLE]
Finally, take on both sides to see that
[TABLE]
The claim of a.e. pointwise convergence follows from Lemma 2.12, while uniform continuity of follows as above (cf. [20, Thm. 11.12]). ∎
Remark 3.17*.*
An important feature of solutions to continuous RDEs (6) is that they can be canonically treated as geometric -rough paths , and the solution map remains locally Lipschitz for the metric (see [20, Sec. 10.4]). This allows one to use the solution as the driving signal in a secondary RDE
[TABLE]
At least with the notion of canonical RDEs considered in this article, we cannot expect this functorial nature to be completely preserved due to the fact that in general no path function can be defined on to capture the information of how traversed a jump (which, in our context, clearly impacts the solution of (9)).
However, at the cost of retaining the original driving signal , we can readily solve the secondary RDE (9). Indeed, for vector fields on and on , we consider the vector fields on defined by , where is the -th coordinate of . Then the natural solution to (9) is given by the larger RDE
[TABLE]
As a consistency check, one can readily see that, in the continuous setting, the second component of the solution to (10) coincides with the solution of (9).
If we further assume that drivers converge in Skorokhod topology, then more can be said about convergence of RDE solutions.
Proposition 3.18**.**
Let notation be as in Theorem 3.13.
- (i)
Suppose that (8) holds and that in the Skorokhod topology. Then in the Skorokhod topology. 2. (ii)
Suppose that is an endpoint continuous, -approximating path function defined on (a subset of) . Then on sets of bounded -variation, the solution map
[TABLE]
is continuous.
Proof.
(i) By Theorem 3.13, it suffices to show that is compact in the Skorokhod space . Recall that for a Polish space , a subset is compact if and only if is compact and
[TABLE]
where
[TABLE]
see, e.g., [4, Thm. 12.3]. In particular, using that and applying the first inequality in Lemma 2.16 to , we see that , where
[TABLE]
It now follows from (8) that
[TABLE]
from which we see that . Since , it follows that is compact in as required.
(ii) This follows directly from taking and applying (i) and Proposition 3.12. ∎
Remark 3.19*.*
We suspect that under suitable conditions on a fixed path function , the solution map remains locally uniformly (or even Lipschitz) continuous in the classical -variation metric defined by (1) (and thus trivially for the metric ). However there seems to be no easy way to derive this as a consequence of our main Theorem 3.13. We suspect that this could be done by carefully relating canonical with “Itô-type” non-canonical RDEs (see [19, Def. 37]), followed by a proper stability analysis of the latter. The study of such non-canonical equations was recently carried out in [21].
3.4 Young pairing and translation operator
In this section we extend the Young pairing and rough path translation operator to the càdlàg setting.
Given a path in (smooth), its canonical level- rough path lift is given by . This extends immediately to a -rough path , for , with replaced by the a priori level- information . For of finite -variation with , all other cross-integrals remain defined. This is the Young pairing of a -rough path with a -variation path called . The mapping is continuous, and the general construction, for continuous (rough) paths, is found in [20, Sec. 9.4]. The important operation of translating rough paths in some -direction, formally , can be algebraically formulated in terms of the Young pairing and has found many applications [18].
We first illustrate the difficulty of continuously extending and to càdlàg paths by showing that the addition map is not continuous as a map where we equip and with and respectively (which is reminiscent of the fact that is not a topological vector space).
Example 3.20**.**
Consider the sequences , , and , of continuous piecewise linear paths from to , where is constant on , moves linearly from [math] to over and remains constant on , while and do the same, except from [math] to and over the intervals and respectively.
One can see that converges to in , and and both converge to in , where is the linear path function on . However converges to while converges to in , where is the “Hoff” path function which moves first in the -th coordinate, and then in the -th coordinate.
Note that the limiting path in the above example is unambiguously defined (namely ) whereas the corresponding path function is not. In the following definition we circumvent this problem by choosing a priori the (log-)linear path function.
Definition 3.21**.**
Let and such that . For integers , let and be the log-linear path functions on and respectively.
For and , consider the continuous -valued path . We define by
[TABLE]
where we recall that is defined by (2) for the càdlàg path . In the case that , we let denote the log-linear path function on and define by .
Remark 3.22*.*
Due to the choice of log-linear path function, note that and have finite - and -variation respectively, so that and are well-defined as continuous rough paths. Moreover, we have the relation (see [20, Thm. 9.33]).
We now record a simple result in the case which will be helpful in the remainder of the paper. Recall from Definition 2.2 that is called Marcus-like if , i.e., where is taken in and for all .
Proposition 3.23**.**
Let notation be as in Definition 3.21. Suppose that is Marcus-like. Then for every it holds that is a weakly geometric Marcus-like -rough path with the anti-symmetric part of its second level determined for and by
[TABLE]
where the integrals are well-defined Young integrals (all other components of the anti-symmetric part are given canonically by and ). Furthermore
[TABLE]
Proof.
By definition of and the choice of log-linear path functions, we see that is indeed Marcus-like. Observe that satisfies the linear RDE in driven by itself . It follows by [19, Thm. 38] and the definition of that satisfies the canonical rough equation (in the sense of [19, Def. 37]), so that
[TABLE]
where we have used that is Marcus-like. In particular, the last term on the RHS only contributes to the symmetric part of , so that (12) follows by identifying in the appropriate components with well-defined Young integrals. The inequality (13) follows from standard (continuous) rough path estimates [20, Thm. 9.26, 9.33] along with the fact that is Marcus-like (so that upon appropriately restricting domains, we may assume , , and are -approximating). ∎
4 General multidimensional semimartingales as rough paths
4.1 Enhanced -variation BDG inequality
The main result of this subsection is the BDG inequality Theorem 4.7 for enhanced càdlàg local martingales. The proof largely follows a classical argument found in [36, 20] with the exception of Lemma 4.2 which constitutes our main novel input.
Let be an -valued semimartingale with and -valued bracket . Consider the -valued (“area”) process
[TABLE]
as an Itô integral, and the “Marcus lift” (terminology from [19]) of to a -valued process given by
[TABLE]
where is taken in the truncated tensor algebra .
Note that, if where is (another) semimartingale (w.r.t. the same filtration) and is adapted, of bounded variation, then the respective Marcus lifts of and are precisely related by the translation operator; that is, . This can be seen by combining (12) with the fact that the second level of is given precisely by the Marcus canonical integral ; see [19, Prop. 16] for details.
Recall that a function is called moderate if it is continuous and increasing, if and only if , and there exists such that for all .
Lemma 4.1** (Uniform enhanced BDG inequality).**
For any convex moderate function there exist such that for any -valued local martingale
[TABLE]
Proof.
For a process , denote . Following the proof of [20, Thm. 14.8], it suffices to show that
[TABLE]
However we have
[TABLE]
so one can apply the classical BDG inequality (e.g., [35, Thm. 2.1]) to and , and to and , which is also a convex moderate function, to obtain
[TABLE]
∎
The following lemma is the crucial step in establishing finite -variation of the lift of a local martingale.
Lemma 4.2** (Interpolation).**
For every there exists such that for every -valued local martingale
[TABLE]
Proof.
For define the increasing sequence of stopping times by and for
[TABLE]
(where of the empty set is ). Define further . Observe that (cf. [9, p. 10])
[TABLE]
Fix and denote . For consider the sequence of local martingales . Denote by the lift of , which coincides with .
It holds that
[TABLE]
and moreover for all . Thus by the uniform enhanced BDG inequality (Lemma 4.1) with , ,
[TABLE]
It follows that for
[TABLE]
and likewise for
[TABLE]
Taking and , the conclusion follows from the estimate (15). ∎
Corollary 4.3**.**
For every -valued semimartingale , , and , it holds that a.s.
[TABLE]
Proof.
Note that can be decomposed into a process of finite variation and a local martingale with jump sizes bounded by a positive constant (see, e.g., [25, Prop. 4.17, p. 42]). Denoting by the lift of , it follows from a localization argument and Lemma 4.2, that a.s.. The conclusion follows by observing that . ∎
Lemma 4.4** (Chebyshev inequality).**
For all , there exists a constant such that for every -valued local martingale and ,
[TABLE]
Proof.
This crucially uses that has finite -variation for every local martingale , and follows in exactly the same manner as [36, Lem. 1] or [20, Lem. 14.10]. ∎
Lemma 4.5** (Burkholder [6] Lemma 7.1).**
Suppose and are non-negative random variables, is a moderate function, and and such that ,
[TABLE]
and
[TABLE]
Then
[TABLE]
Lemma 4.6**.**
Let be an -valued local martingale and an adapted, non-decreasing process such that a.s., for all . Then for every moderate function (not necessarily convex), there exists such that
[TABLE]
Proof.
We follow closely the proof of [36, Prop. 2] and [20, Thm. 14.12]. Since is Marcus-like, i.e., , there exists a constant such that for all
[TABLE]
Let , , , and define the stopping times
[TABLE]
Define the local martingale with lift and note that
[TABLE]
On the event , we have , and so from (16)
[TABLE]
By Lemma 4.4, it follows that
[TABLE]
On the event , it holds that , whilst on , we have and thus
[TABLE]
It follows that
[TABLE]
and thus we have for all
[TABLE]
The conclusion now follows by applying Lemma 4.5. ∎
Theorem 4.7** (-variation rough path BDG).**
For every convex moderate function and there exists such that for every -valued local martingale
[TABLE]
Proof.
This again follows very closely the proof of [36, Prop. 2]. We may suppose (otherwise all concerned quantities are infinite). Let and . Note that is of integrable variation since
[TABLE]
so there exists a unique previsible process such that is a martingale ( is the dual previsible projection of [43, Thm. 21.4] and is a special case of the Doob–Meyer decomposition). Finally, define the martingale ; recall that is called the Davis decomposition of .
Observe that . Indeed, by construction, and, if is a stopping time, then either is totally inaccessible, in which case since is previsible, or is previsible, in which case , where is the underlying filtration (cf. [42, p. 80-81]). Hence, by Lemma 4.6, we have
[TABLE]
Since and
[TABLE]
we have
[TABLE]
Furthermore, since is convex, it follows from the Garsia-Neveu lemma (using the argument provided by [36, p. 306]) that
[TABLE]
and thus
[TABLE]
Finally, as , we obtain
[TABLE]
∎
It was seen in [19, Thm. 20] that every (level ) càdlàg -rough path admits a unique minimal jump extension as a level- rough path, . (This generalizes Lyons’ fundamental extension theorem to the jump setting.) Moreover, it is clear from the proof of [19, Thm. 20]
[TABLE]
(with a multiplicative constant that depends on ). Applied with and Marcus lift of an -valued local martingale, we obtain the following form of the BDG inequality.
Corollary 4.8** (-variation level- rough path BDG).**
For every , convex moderate function , and , there exists such that for every -valued local martingale
[TABLE]
This is a useful result in the study of expected signatures, which is, loosely speaking, the study of , with and component-wise expectation in the linear space . Since the norm of , the projection to , is bounded (up to a constant) by , we see that the very existence of the expected signature is guaranteed by the existence of all moments of . In a Lévy setting with triplet , this clearly holds whenever has moments of all orders. One can also apply this with a stopping time , e.g., the exit time of Brownian motion from a bounded domain. In either case, the expected signature is seen to exist (see also [19, Part III] and [37] for more on this).
A motivation for the study of expected signatures comes from one of the main results of [10, Sec. 6] which provides a solution to the moment problem for (random) signatures, i.e., determines conditions under which the sequence of expectations uniquely determines the law of the full signature of (see [19, Thm. 54] where the moment problem was discussed for the Lévy case, and [7, 10] for other families of random geometric rough paths).
4.2 Convergence of semimartingales and the UCV condition
As an application of the BDG inequality, we obtain a convergence criterion for lifted local martingales in the rough path space with a fixed path function , which is the main result of this subsection.
We first recall the uniformly controlled variation (UCV) condition for a sequence of semimartingales . For and , we define
[TABLE]
Note that is a continuous function on the Skorokhod space and with whenever .
Definition 4.9** (UCV, [33] Definition 7.5).**
We say that a sequence of semimartingales satisfies UCV if there exists such that for all there exist decompositions and stopping times such that for all
[TABLE]
Recall the following result of Kurtz–Protter [32, Thm. 2.2] (see also [33] Theorem 7.10 and p. 30).
Theorem 4.10**.**
Let be càdlàg adapted processes (with respect to some filtrations ). Suppose converges in law (resp. in probability) to in the Skorokhod topology as , and that is a sequence of càdlàg semimartingales satisfying UCV. Then is a semimartingale (with respect to some filtration ) and converge in law (resp. in probability) to in the Skorokhod topology as .
We can now state the main result which allows us to pass from convergence in the Skorokhod topology to convergence in rough path topology (see also Corollary 4.12).
Theorem 4.11**.**
Let be a sequence of semimartingales such that converges in law (resp. in probability) to a semimartingale in the Skorokhod topology. Suppose moreover that satisfies the UCV condition. Then the lifted processes converge in law (resp. in probability) to the lifted process in the Skorokhod space . Moreover, for every , is a tight collection of real random variables.
Proof.
Since the stochastic area is given by the Itô integral (14), the convergence in law (resp. in probability) of to is an immediate consequence of Theorem 4.10.
Let for which we can apply the UCV condition to . We next claim that is tight. Indeed, for choose so that . Let be the decomposition from the UCV condition along with the stopping times . Then there exists such that for all
[TABLE]
and
[TABLE]
where in the final line we used the enhanced BDG inequality Theorem 4.7. Using the fact that proves that is tight as claimed.
To conclude, observe that where is a process of bounded variation for which
[TABLE]
Since is tight and is a continuous function of (for the Skorokhod topology), it follows that is tight as required. ∎
Corollary 4.12**.**
Follow the notation of Theorem 4.11. Let and an endpoint continuous, -approximating path function defined on such that a.s.. Then for every , in law (resp. in probability) in the metric space .
Remark 4.13*.*
As we shall see in Proposition 4.15, a simple way to apply Corollary 4.12 is to assume that comes from the lift of a (left-invariant) path function (denoted by the same symbol) which is endpoint continuous and -approximating for some (so that a canonical lift indeed exists), and does not create area, i.e.,
[TABLE]
Since is Marcus-like, i.e., , it indeed follows that so that is well defined (which corresponds to interpolating the jumps of using ); the same of courses applies to .
Proof of Corollary 4.12.
Consider first the case of convergence in probability. By Theorem 4.11, is tight, so for every we can find such that
[TABLE]
The conclusion now follows from Proposition 3.12. For the case of convergence in law, the proof follows in a similar way upon applying the Skorokhod representation theorem [27, Thm. 3.30] to the space and the sequence . ∎
As an application of Corollary 4.12, along with the fact that piecewise constant approximations satisfy UCV [32, Ex. 3.7], we obtain the following Wong–Zakai-type result (which shall be substantially generalized in Section 4.4 using different methods).
Remark 4.14*.*
The following result resembles the Wong–Zakai theorem of [31], where it was shown that ODEs driven by approximations of the form converge to an SDE of the Marcus type. Here we are able to complement [31] by showing this for the case of genuine piecewise linear (and a variety of other) approximations. Moreover the deterministic nature of our rough path approach is able to handle anticipating initial data (see Remark 4.19).
Proposition 4.15** (Wong–Zakai with no area).**
Let be a semimartingale with Marcus lift , and let be a sequence of deterministic partitions such that . Let be the piecewise constant approximations of along the partition , and let be their (Marcus) lifts.
Let be an endpoint continuous, -approximating path function for some such that does not create area, i.e., (17) holds. By an abuse of notation, let denote also the canonical lift of , treated as a path function defined on .
- (1)
Consider the admissible pairs and in . Then for every , in probability as . 2. (2)
Let be the piecewise-* interpolation of along the partition . Let denote the flow associated to the RDE (6). Then converges in probability to as (as -valued random variables).*
Proof.
(1) follows immediately from Corollary 4.12 and the fact that satisfies UCV [32, Ex. 3.7].
For (2), note that, since does not create area, coincides (up to reparametrization) with the canonical lift of . The conclusion now follows from (1) and Theorem 3.13. ∎
We now record a relation between canonical RDEs and Marcus SDEs.
Proposition 4.16**.**
Let be a semimartingale and its Marcus lift. Then for vector fields in for some , it holds that the canonical RDE
[TABLE]
(i.e., the path function is the taken to be log-linear) coincides a.s. with the Marcus SDE
[TABLE]
Proof.
Recall that, by definition, the Marcus SDE satisfies [31]
[TABLE]
where denotes the flow at time along the vector field from , i.e., the solution at to , .
Recall likewise that, by definition, the rough canonical equation (in the sense of [19, Def. 37]) satisfies
[TABLE]
which agrees with the solution to the canonical RDE (18) from Definition 3.1 (see [19, Thm. 38]; we point out that [31] and [19] are not restricted to the case of finite activity). It remains to verify that a.s.
[TABLE]
To this end, observe that
[TABLE]
where . It follows from Lemma 4.35 that for a (deterministic) sequence of partitions with , we have a.s.
[TABLE]
from which the conclusion readily follows. ∎
Remark 4.17* (Marcus SDEs with piecewise constant driver).*
Observe that a simple special case of Proposition 4.16 (which does not require any probabilistic considerations) is a piecewise constant path , which is constant between the points of a partition . In this case, the solution to
[TABLE]
agrees, for all , with the ODE solution
[TABLE]
where is the piecewise linear path obtained from by connecting with a straight line consecutive points for all .
We are now ready to state the precise criterion for convergence in law (resp. in probability) of Marcus SDEs which was advertised in the introduction and which is analogous to the same criterion for Itô SDEs [32, Thm. 5.4].
Theorem 4.18**.**
Let be a collection of vector fields on for some . Let be a collection of (random) initial conditions in and be a collection of semimartingales such that satisfies UCV and in law (resp. in probability) as (as -valued random variables). Then the solutions to the Marcus SDEs
[TABLE]
converge in law (resp. in probability) as (in the Skorokhod topology) to the solution of the Marcus SDE
[TABLE]
Proof.
This is an immediate consequence of Corollary 4.12, Proposition 4.16, and the deterministic continuity of the solution map (part (ii) of Proposition 3.18). ∎
Remark 4.19*.*
We have not been explicit about filtrations, but of course, every semimartingale above is adapted to some filtration . In the same vain, as is standard in the context of SDEs, the initial datum is assumed to be -measurable, so that (19) makes sense as a bona fide integral equation (as recalled in the proof of Proposition 4.16).
Situations where is independent of the driving noise are then immediately handled. If, on the other hand, depends in some anticipating fashion on the driving noise, then classical SDE theory (Marcus or Itô) breaks down and ideas from anticipating stochastic calculus are necessary (such as composing the stochastic flow with anticipating initial data; in the Marcus context this would be possible thanks to [31, Thm. 3.4]). Our (essentially deterministic) rough path approach bypasses such problems entirely. We shall not pursue further application of rough paths to “anticipating Marcus SDEs” here, but note that this could be done analogously to [11].
4.3 Examples
We now give a list of examples to which Corollary 4.12, Proposition 4.15, and Theorem 4.18 apply. The main criterion of application is of course the UCV condition. For further examples of sequences of semimartingales satisfying UCV, see [32, Sec. 3]. We note that in the framework of Theorem 4.18, the UCV condition cannot in general be ommited (but see Theorem 4.29 below) as seen, e.g., in homogenization theory [34, 15, 28] and non-standard approximations to Brownian motion [41, 16].
Example 4.20** (Piecewise constant approximations).**
Let be a càdlàg semimartingale and be its piecewise constant approximation (see Figure 5) along a sequence of deterministic partitions such that . Then by Theorem 4.18, the solutions to
[TABLE]
converge in probability (for the Skorokhod topology) to the solution of
[TABLE]
Moreover, if is continuous, then converges in probability for the uniform topology on to (which is now also the solution to the Stratonovich SDE).
Example 4.21** (Piecewise linear approximations).**
Let be a càdlàg semimartingale and now let be its piecewise linear (i.e., Wong–Zakai) approximation (see Figure 5) along a sequence of deterministic partitions such that . Consider the solutions to random ODEs
[TABLE]
and the Marcus SDE
[TABLE]
Then, by Proposition 4.15, it holds that in probability.
Moreover, if is continuous, then in light of the last part of Example 4.20 and Remark 4.17, converges in probability for the uniform topology on to (which, we emphasize again, is now the solution to the Stratonovich SDE), which agrees with the classical Wong–Zakai theorem for continuous semimartingales.
Example 4.22** (Donsker approximations to Brownian motion).**
Consider an -valued random walk with iid increments and finite second moments, rescaled so that in law. Here we treat as either piecewise constant or interpolated using any sufficiently nice path function which does not create area, i.e., satisfies (17) (e.g., piecewise linear). Then satisfies UCV, so by Corollary 4.12 we again have convergence of the Marcus SDEs (or random ODEs in case of continuous interpolations)
[TABLE]
in law for the uniform topology on to the Stratonovich limit
[TABLE]
This is a special case of [9, Ex. 5.12] (see also Example 4.23) which improves the main result of Breuillard et al. [5] in the sense that no additional moment assumptions are required (highlighting a benefit of -variation vs. Hölder topology).
Example 4.23** (Null array approximations to Lévy processes).**
Generalizing Example 4.22, consider a null array of -valued random variables , i.e., , and, for every , are independent. Consider the associated random walk
[TABLE]
Suppose in law for a Lévy process (see [27, Thm. 13.28] for necessary and sufficient conditions for this to occur), which in particular implies that for some
, 2. 2.
, and 3. 3.
* for every which is identically zero on a neighbourhood of zero,*
where , , and are determined by the Lévy triplet of (in particular is the Lévy measure of ). As a consequence, it is immediate to verify that satisfies UCV. By Theorem 4.18, the solutions to
[TABLE]
converge in law (for the Skorokhod topology) to the solution of the Marcus SDE
[TABLE]
If, once more, are interpolated using any sufficiently nice path function (which in particular does not create area (17)), an application of Corollary 4.12 implies that (now solutions to random ODEs) converge in law to (now the solution to the random RDE driven by ). Note that if is allowed to create area and are further assumed iid for every , then this is precisely the case addressed in [9, Ex. 5.12] (though in this case one must consider a non-Marcus lift of similar to the upcoming Theorem 4.29).
Example 4.24** (Martingale CLT).**
Let be a sequence of -valued càdlàg local martingales. Suppose that, as ,
[TABLE]
where is continuous and deterministic. Then , where is a continuous Gaussian process with independent increments and [14, Thm. 1.4, p. 339], and moreover the UCV condition is satisfied [33, p. 26]. Therefore solutions to
[TABLE]
converge in law for the uniform topology on to the solution of the Stratonovich SDE
[TABLE]
4.4 Wong–Zakai revisited
In this subsection we significantly expand Proposition 4.15 by showing convergence in probability of very general (area-creating) interpolations of càdlàg semimartingales. If the interpolation creates area, we in general no longer expect to converge to the Marcus lift of (which is the reason one cannot apply Proposition 4.15), and therefore we first modify the lift appropriately.
Throughout the section, we fix a (left-invariant) -approximating path function for some (which we take to be defined on the entire space only for simplicity). Consider the two maps
[TABLE]
defined uniquely by
[TABLE]
(so that is the area generated by the path ).
Let us also fix a càdlàg semimartingale and a sequence of deterministic partitions such that . Consider the following assumption.
Assumption 4.25**.**
There exists a càdlàg bounded variation process such that
[TABLE]
Before stating the Wong–Zakai theorem, we give several examples of for which Assumption 4.25 is satisfied.
Example 4.26** (No area).**
If does not create area for all , so that (e.g., when is the linear interpolation on ), then evidently Assumption 4.25 is satisfied with .
Example 4.27** (Hoff-type process).**
Suppose that , so that . Let travel to first linearly along the -coordinate and then linearly along the -coordinate:
[TABLE]
Then , so that Assumption 4.25 is satisfied with .
Example 4.28** (Regular ).**
Suppose more generally that is twice differentiable at [math], so that by Lemma 4.31
[TABLE]
We see in this case that Assumption 4.25 is satisfied with
[TABLE]
For a partition , let be the piecewise- interpolation of along , and its canonical lift. The following is the main result of this subsection.
Theorem 4.29** (Wong–Zakai).**
Suppose that Assumption 4.25 is satisfied. Let be the modified level-2 lift of defined by
[TABLE]
where is the Marcus lift of . Suppose further that is endpoint continuous.
- (1)
Consider the admissible pair . Then for every , it holds that
[TABLE] 2. (2)
Let denote the flow associated to the RDE (6). Then converges in probability to (as a -valued random variable).
Remark 4.30*.*
Note that the jumps of must necessarily be of the form . Hence , so that indeed and is well-defined (this is an abuse of notation since is a path function on , but because of finite -variation, it can be canonically lifted to a path function ).
For the proof of the theorem, we require several lemmas.
Lemma 4.31**.**
Let be a -variation modulus of (see Definition 2.14). Then
[TABLE]
Proof.
This is immediate from the property
[TABLE]
∎
The following lemma essentially involves no probability.
Lemma 4.32**.**
Suppose is endpoint continuous. Then for a.e. sample path , it holds that for all there exists , such that for all partitions with , there exists , such that for all , there exists such that
[TABLE]
Proof.
Since is càdlàg, for every we can find sufficiently small and a partition so that and
[TABLE]
Then whenever , for every , there exists at most one such that . Now using the fact that is -approximating and endpoint continuous, the claim readily follows. ∎
Lemma 4.33**.**
The family of real random variables is tight.
Remark 4.34*.*
As the proof of Lemma 4.33 will reveal, the biggest difficulty is overcome by the enhanced BDG inequality for càdlàg local martingales (Theorem 4.7). We wish to emphasize that the lemma is even helpful in the context of a rough paths proof of the Wong–Zakai theorem for continuous semimartingales with piecewise linear interpolations (so that ), since an analogous tightness result is still needed in this case and is non-trivial (cf. [20, Thm. 14.15]).
We also mention that part (1) of Proposition 4.15 in particular shows that is tight, which can significantly simply the proof of Lemma 4.33 (at least if one restricts attention to the family ). However we give a direct proof of the general result here.
Proof of Lemma 4.33.
We can decompose (non-uniquely) where is of bounded variation and is a local martingale with jumps bounded by some (e.g., [25, Prop. 4.17, p. 42]). Using the localizing sequence , we may further suppose that is bounded.
For a partition , consider the piecewise constant path which is constant on and for every . Consider also the piecewise constant semimartingale which is constant on and for every . Let be the (Marcus) lift of . By definition of , we have for all
[TABLE]
It follows that
[TABLE]
where .
Observe that is a reparametrization of (where we use the same abuse of notation as in Remark 4.30), so in particular
[TABLE]
Moreover, defining the piecewise constant local martingale , its lift , and the piecewise constant path of bounded variation in the same way as , we note that . Following Lemma 2.16, it suffices to show that the families
[TABLE]
are tight. This in turn follows respectively from Lemma 4.31, the enhanced BDG inequality Theorem 4.7, and the fact that . ∎
Proof of Theorem 4.29.
Note that (2) follows directly from (1) and Theorem 3.13. To show (1), note that a.s. (by Corollary 4.3), so following Lemma 4.33 and interpolation (Lemma 3.11), it suffices to show that to show that
[TABLE]
Let and denote the stochastic area of and respectively. We have for
[TABLE]
and so by the Campbell–Baker–Hausdorff formula
[TABLE]
Likewise
[TABLE]
Recalling further that and , it follows that for all
[TABLE]
Combining Lemma 4.32 with Assumption 4.25, we see that the proof is complete once we show
[TABLE]
which in turn follows from Lemma 4.35. ∎
4.5 Appendix: Vanishing areas
Lemma 4.35**.**
Let be a càdlàg semimartingale, a locally bounded previsible process, and a sequence of deterministic partitions of such that . Define as Itô integrals. Then
[TABLE]
Remark 4.36*.*
In the case , observe that Lemma 4.35 is an immediate consequence of the convergence in part (1) of Proposition 4.15 (where is taken as the piecewise linear interpolation).
Proof.
As in the proof of Lemma 4.33, we can decompose , where is of bounded variation and is a local martingale with bounded jumps (e.g., [25, Prop. 4.17, p. 42]). Using the localizing sequence
[TABLE]
we may further suppose that and are uniformly bounded and that is bounded on by and is constant on . We now write
[TABLE]
For a fixed càdlàg sample path , for every we can find sufficiently small and a partition so that and
[TABLE]
Then whenever , for every , there exists at most one such that .
Since is a process of finite variation, if then
[TABLE]
and if no is in , then the upper bound is . Denoting by the interval in containing , we then have
[TABLE]
from which it follows that the LHS converges to zero a.s. as . It remains to show that
[TABLE]
in probability as . By Itô isometry
[TABLE]
As before, if is in , then
[TABLE]
and if no is in , then the upper bound is . Hence
[TABLE]
from which it follows that the LHS converges to zero a.s. as . As is bounded (so in particular bounded in ), we obtain by dominated convergence
[TABLE]
Finally, applying the classical BDG inequality to the discrete-time martingale
[TABLE]
we see that (20) holds in , and thus in probability as desired. ∎
5 Beyond semimartingales
We have seen in the previous section that (general) multi-dimensional semimartingales give rise to (càdlàg) geometric -rough paths. Marcus lifts of general semimartingales provide us with concrete and important examples of driving signals for canonical RDEs, providing a decisive and long-awaited [47] rough path view on classical stochastic differential equations with jumps. We now discuss several examples to which the theory of Section 3 can be applied which fall outside the scope of classical semimartingale theory.
5.1 Semimartingales perturbed by paths of finite -variation
Keeping focus on -valued processes and their (canonical) lifts, we remark that any process with a decomposition , where is a semimartingale and is a process with finite -variation for some , admits a canonical Marcus lift given by . Note that due to the deterministic nature of the (Young) integrals used to construct , we require no adaptedness assumptions on . We summarize the existence of the lift in the following proposition, which is an immediate consequence of Proposition 3.23.
Proposition 5.1**.**
Let and consider an -valued random variable , and an -valued semimartingale . Write for the area of and define its Marcus lift . Then the process admits a canonical lift, given by , which is a Marcus-like geometric -rough path for any .
We mention that the class of paths with such a decomposition contains some well-studied processes.
Example 5.2** (PII).**
The important class of processes with independent increments (PII) goes beyond semimartingale theory. In fact, every such process can be decomposed (non-uniquely) as , where is a PII and a semimartingale and is a deterministic càdlàg path. Moreover, is a semimartingale if and only if has finite variation on compacts. Provided that the process has finite -variation for some , we immediately see that admits a lift to a (càdlàg) geometric -rough path for any .
We note that there is a natural interest in differential equations driven by PIIs (under the assumptions of Proposition 5.1 this is meaningful!) since the resulting (pathwise) solutions to canonical RDEs
[TABLE]
will be (time-inhomogeneous) Markov processes. A further study and characterization of such processes seems desirable. (For instance, they may not be nicely characterized by their generator. Consider the case with .)
5.2 Markovian and Gaussian càdlàg rough paths
One can use Dirichlet forms to construct Markovian rough paths which are not lifts of semimartingales. In the continuous setting this has been developed in detail in [17]. Including a non-local term in the Dirichlet form will allow to extend this construction to the jump case, but we will not investigate this here. We also note that Gaussian càdlàg rough paths can be constructed; as in the continuous theory, the key condition is finite -variation of the covariance (cf. [18, Sec. 10.2]) but without assuming its continuity.
5.3 Group-valued processes
This point was already made in the context of Lévy rough paths [19, 9], which substantially generalizes the notion of the Marcus lift of an -valued Lévy process (such processes, for example, arise naturally as limits of stochastic flows, see [9, Sec. 5.3.1]). In the same spirit, one can defined “genuine semimartingale rough paths” as -valued process with local characteristics modelled after Lévy (rough path) triplets. (Remark that the Lie group is, in particular, a differentiable manifold so that the theory of manifold-valued semimartingales applies. The issue is to identify those which constitute (geometric) rough paths, which can be done analyzing the local characteristics, as was done in the Lévy case in the afore-mentioned papers. In the same spirit, the afore-mentioned Dirichlet-form construction also extends to the group setting.)
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] {bbook} [author] \bauthor \bsnm Applebaum, \bfnm David \binits D. ( \byear 2009). \btitle Lévy processes and stochastic calculus, \bedition second ed. \bseries Cambridge Studies in Advanced Mathematics \bvolume 116. \bpublisher Cambridge University Press, Cambridge. \bdoi 10.1017/CBO 9780511809781 \bmrnumber 2512800 \endbibitem
- 2[2] {barticle} [author] \bauthor \bsnm Applebaum, \bfnm David \binits D. and \bauthor \bsnm Kunita, \bfnm Hiroshi \binits H. ( \byear 1993). \btitle Lévy flows on manifolds and Lévy processes on Lie groups. \bjournal J. Math. Kyoto Univ. \bvolume 33 \bpages 1103–1123. \bdoi 10.1215/kjm/1250519132 \bmrnumber 1251218 \endbibitem
- 3[3] {barticle} [author] \bauthor \bsnm Applebaum, \bfnm David \binits D. and \bauthor \bsnm Tang, \bfnm Fuchang \binits F. ( \byear 2001). \btitle Stochastic flows of diffeomorphisms on manifolds driven by infinite-dimensional semimartingales with jumps. \bjournal Stochastic Process. Appl. \bvolume 92 \bpages 219–236. \bdoi 10.1016/S 0304-4149(00)00091-0 \bmrnumber 1817587 \endbibitem
- 4[4] {bbook} [author] \bauthor \bsnm Billingsley, \bfnm Patrick \binits P. ( \byear 1999). \btitle Convergence of probability measures, \bedition second ed. \bseries Wiley Series in Probability and Statistics: Probability and Statistics. \bpublisher John Wiley & Sons, Inc., New York \bnote A Wiley-Interscience Publication. \bdoi 10.1002/9780470316962 \bmrnumber 1700749 \endbibitem
- 5[5] {barticle} [author] \bauthor \bsnm Breuillard, \bfnm Emmanuel \binits E., \bauthor \bsnm Friz, \bfnm Peter \binits P. and \bauthor \bsnm Huesmann, \bfnm Martin \binits M. ( \byear 2009). \btitle From random walks to rough paths. \bjournal Proc. Amer. Math. Soc. \bvolume 137 \bpages 3487–3496. \bdoi 10.1090/S 0002-9939-09-09930-4 \bmrnumber 2515418 \endbibitem
- 6[6] {barticle} [author] \bauthor \bsnm Burkholder, \bfnm D. L. \binits D. L. ( \byear 1973). \btitle Distribution function inequalities for martingales. \bjournal Ann. Probability \bvolume 1 \bpages 19–42. \bmrnumber 0365692 \endbibitem
- 7[7] {barticle} [author] \bauthor \bsnm Cass, \bfnm Thomas \binits T. and \bauthor \bsnm Ogrodnik, \bfnm Marcel \binits M. ( \byear 2017). \btitle Tail estimates for Markovian rough paths. \bjournal Ann. Probab. \bvolume 45 \bpages 2477–2504. \bmrnumber 3693967 \endbibitem
- 8[8] {barticle} [author] \bauthor \bsnm Chechkin, \bfnm Alexei \binits A. and \bauthor \bsnm Pavlyukevich, \bfnm Ilya \binits I. ( \byear 2014). \btitle Marcus versus Stratonovich for systems with jump noise. \bjournal J. Phys. A \bvolume 47 \bpages 342001, 15. \bdoi 10.1088/1751-8113/47/34/342001 \bmrnumber 3251981 \endbibitem
