On Multidimensional stable-driven Stochastic Differential Equations with Besov drift
Paul-Eric Chaudru de Raynal (LMJL), St\'ephane Menozzi (LaMME, HSE)

TL;DR
This paper proves the well-posedness of multidimensional stable-driven SDEs with singular Besov space drifts, using PDE smoothing properties and duality techniques, extending the understanding of such stochastic systems.
Contribution
It establishes the existence and uniqueness of solutions for multidimensional stable-driven SDEs with singular Besov drifts, a novel result in this context.
Findings
Well-posedness of the martingale problem established.
Defined a meaningful notion of weak solution for these SDEs.
Utilized PDE smoothing properties and duality in Besov spaces.
Abstract
We establish well-posedness results for multidimensional non degenerate -stable driven SDEs with time inhomogeneous singular drifts in with and in , where and stand for Lebesgue and Besov spaces respectively. Precisely, we first prove the well-posedness of the corresponding martingale problem and then give a precise meaning to the dynamics of the SDE. This allows us in turn to define an ad hoc notion of weak solution, for which well-posedness holds as well. Our results rely on the smoothing properties of the underlying PDE, which is investigated by combining a perturbative approach with duality results between Besov spaces.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
On Multidimensional stable-driven Stochastic Differential Equations with Besov drift
Paul-Éric Chaudru de Raynal
Nantes Université, CNRS, Laboratoire de Mathématiques Jean Leray, LMJL, F-44000 Nantes, France
and
Stéphane Menozzi
Université d’Evry Val d’Essonne, Paris Saclay, Laboratoire de Mathématiques et Modélisation d’Evr, UMR CNRS 8070, 23 Boulevard de France 91037 Evry, France
and Laboratory of Stochastic Analysis, HSE, Prokrovsky Boulevard 11, Moscow, Russian Federation
Abstract.
We establish well-posedness results for multidimensional non degenerate -stable driven SDEs with time inhomogeneous singular drifts in with and in , where and stand for Lebesgue and Besov spaces respectively. Precisely, we first prove the well-posedness of the corresponding martingale problem and then give a precise meaning to the dynamics of the SDE. This allows us in turn to define an ad hoc notion of weak solution, for which well-posedness holds as well. Our results rely on the smoothing properties of the underlying PDE, which is investigated by combining a perturbative approach with duality results between Besov spaces.
1. Introduction
1.1. Statement of the problem and related literature
We are here interested in providing a well-posedness theory for the following formal -dimensional stable driven SDE. For a fixed , :
[TABLE]
where in the above equation is a -dimensional symmetric -stable process, for some in (thus including Brownian noise).
The main point here comes from the fact that the drift is only supposed to belong to the space \mathbb{L}^{r}({\color[rgb]{0,0,0}[0,T]},{\mathbb{B}}_{p,q}^{-1+\gamma}({\mathbb{R}}^{d},{\mathbb{R}}^{d})), where denotes a Besov space. In a nutshell, when , for any non integer , Besov spaces coincide with Hölder spaces ; when , this somehow indicates that the Hölder modulus blows up at rate . The parameters and are related to the integrability of such a modulus. We refer to Section 2.6.4 of [Tri83] and Section 3.1 below for rigorous definition.
The parameters (p,q,\gamma,{\color[rgb]{0,0,0}r}) s.t. , will have to satisfy some constraints to be specified later on in order to give a meaning to (1). Importantly, assuming the parameter to be strictly less than implies that can even not be a function, but just a distribution, so that it is not clear that the integral part in (1) has any meaning, at least as this. This is the reason why, at this stage, we talk about “formal -dimensional stable SDE” or “formal SDE (1)”. There are many approaches to tackle such a problem which mainly depend on the choice of the parameters and the dimension . Let us now try to review some of them.
The Brownian setting: . There already exists a rather large literature about singular/distributional SDEs of type (1). Let us first mention the work by Bass and Chen [BC01] who derived in the Brownian scalar case the strong well-posedness of (1) when the drift writes (still formally) as , for a spatial function being -Hölder continuous with and for a multiplicative noise associated with , i.e. the additive noise in (1) must be replaced by . The key point in this setting is that the underlying generator associated with the SDE writes as {\color[rgb]{0,0,0}L=(1/2)\partial_{x}\big{(}a^{2}\partial_{x}\big{)}}. From this specific divergence form structure, the authors manage to use the theory of Dirichlet forms of Fukushima et al. (see [FOT10]) to give a proper meaning to (1). Importantly, the formal integral corresponding to the drift has to be understood as a Dirichlet process. Also, in the particular case where the distributional derivative of is a signed Radon measure, the authors give an explicit expression of the drift of the SDE in terms of the local time (see Theorem 3.6 therein). In the multi-dimensional Brownian case, Bass and Chen have also established weak well-posedness of SDE of type (1) when the homogeneous drift belongs to the Kato class, see [BC03].
Many authors have also recently investigated SDEs of type (1) in both the scalar and multidimensional Brownian setting for time inhomogeneous drifts in connection with some physical applications. From these works, it clearly appears that handling time inhomogeneous distributional drift can be a more challenging question. Indeed, in the time homogeneous case, denoting by an antiderivative of , one can observe that the generator of (1) can be written in the form (1/2)\exp(-2{\mathbf{F}}(x))\partial_{x}\big{(}\exp(2{\mathbf{F}}(x))\partial_{x}\big{)} and the dynamics can again be investigated within the framework of Dirichlet forms (see e.g. the works by Flandoli, Russo and Wolf, [FRW03], [FRW04]). The crucial point is that in the time inhomogeneous case such connection breaks down.
In this time inhomogeneous framework, we can mention the work by Flandoli, Issoglio and Russo [FIR17] for drifts in fractional Sobolev spaces. The authors establish therein the existence and uniqueness of what they call virtual solutions to (1): such solutions are defined through the diffeomorphism induced by the Zvonkin transform in [Zvo74] which is precisely designed to get rid of the bad drift through Itô’s formula. Namely, they investigated the smoothness properties of the underlying PDE (with the drift as source term and null terminal condition) which formally writes111The PDE investigated in [FIR17] slightly differs from the one introduced here (additional potential term therein). However, the resulting Zvonkin transforms are somehow equivalent. We have thus chosen to present their result according to the approach we adopted here for the sake of clarity.,
[TABLE]
We said formal because above, it is not clear that the product is meaningful, since product between two distributions can only be defined under suitable constraints.
In their work, they managed to prove that this product indeed makes sense and that this PDE admits a unique mild solution (see Definition 8 therein), i.e.
[TABLE]
where denotes the usual heat semi-group, which belongs to a suitable function space, allowing them to define the Zvonkin transform . The authors then introduce the notion of virtual solution. Namely, this is a process defined on a stochastic basis such that:
[TABLE]
The main advantage of such a definition is that the bad drift does not explicitly appears. It actually would from a formal expansion of solving (2) through Itô’s formula. This correspondence also gives that, for reasonable smooth drifts, classical solutions also are virtual solutions. Existence and uniqueness for virtual solutions are then established from the fact that can be shown, from the well-posedness of the mild solution (3), to be a -diffeomorphism for small enough.
The SDE
[TABLE]
has indeed itself a unique weak solution from the smoothness of solving (2) and is a virtual solution for which uniqueness in law holds. As a consequence of this approach only few things can be said about the original dynamics.
In this last perspective, we can also refer to the work of Zhang and Zhao [ZZ17], who established in the time homogeneous case the well-posedness of the martingale problem for the generator associated with (1), which can in their framework additionally contain a non trivial smooth enough diffusion coefficient (see also Remarks 6 and 15 below). Therein, they obtained as well as some Krylov type density estimates in Bessel potential spaces for the solution. Also, they managed to derive a more precise description of the limit drift in the formal dynamics in (1), which is interpreted as a suitable limit of a sequence of mollified drifts, i.e. for a sequence of smooth functions converging to in a suitable sense.
The key point in these works, who heavily rely on PDE arguments, is to establish: (i) that the product in (2) is meaningful as a distribution (thus with the same regularity as ); (ii) that the semi-group associated with the noise maps the quantity in the bracket in right hand side of (3) onto a suitable function space, say at least to define properly. Let illustrate how such constraints translate, assuming for a while that is time homogeneous and belongs to the Besov-Hölder space . From the smoothing effect of the noise (parabolic bootstrap), one expects that the semi-group maps onto (Schauder estimates). The second constraint (ii) then gives . On the other hand, as belongs to , the first constraint (i) gives, from Bony’s paraproduct rule, that the sum of the regularity indexes of and must be strictly positive: .
This is indeed the threshold appearing in [FIR17] and [ZZ17] as well as the one previously obtained in [BC01]. This is precisely the threshold that will guarantee well-posedness of the corresponding Martingale problem and weak well-posedness of the associated dynamics, with the Definitions of Section 1.3 below (see Definitions 2 and 4), for a drift F\in{\mathbb{L}}^{\infty}({\color[rgb]{0,0,0}[0,T]},{\mathbb{B}}_{\infty,\infty}^{-1+\gamma}{\color[rgb]{0,0,0}({\mathbb{R}}^{d},{\mathbb{R}}^{d})}) in the present work.
To bypass such a limit (i.e. ), one therefore has to use a suitable theory in order to first give a meaning to the product . This is, for instance, precisely the aim of either rough paths, regularity structures or paracontrolled calculus. However, as a price to pay to enter this framework, one has to add some structure to the drift assuming that this latter can be enhanced into a rough path structure. In the scalar Brownian setting, and in connection with the KPZ equation, Delarue and Diel [DD16] used such specific structure to extend the previous results for an inhomogeneous drift which can be viewed as the generalized derivative of with Hölder regularity index greater than (i.e. assuming that belongs to {\mathbb{L}}^{\infty}([0,T],{\mathbb{B}}^{(-{\color[rgb]{0,0,0}2}/3)^{+}}_{\infty,\infty}{\color[rgb]{0,0,0}({\mathbb{R}},{\mathbb{R}})}). Importantly, in [DD16] the authors derived a very precise description of the meaning of the formal dynamics (1): they show that the drift of the solution may be understood as stochastic-Young integral involving the mollification of the distribution by the transition density of the underlying noise. As far as we know, it appears to us that such a description is the most accurate that can be found in the literature on stochastic processes (see [CG16] for a pathwise version and Remark 18 in [DD16] for some comparisons between the two approaches). With regard to the martingale problem, the result of [DD16] has then been extended to the multidimensional setting by Cannizzaro and Choukh [CC18], but nothing is said therein about the dynamics.
The pure jump case: . In the pure jump case, there are a few works concerning the well-posedness of (1) in the singular/distributional case. Even for drifts that are functions, strong uniqueness was shown rather recently. Let us distinguish two cases: the sub-critical case , in this case the noise dominates the drift (in term of self-similarity index ) and the super-critical case where the noise does not dominate. In the first case, we can refer for bounded Hölder drifts to Priola [Pri12] who proved that strong uniqueness holds (for time homogeneous) functions in (1) which are -Hölder continuous provided . In the second case, the strong well-posedness has been established under the same previous condition by Chen et al. [CZZ17]. Those results are multi-dimensional. In any cases, the threshold obtained does not allow the Authors to consider singular/distributional drift, so that these results are not comparable with the present work.
On the other hand, in the current distributional framework, and in the scalar case, the martingale problem associated with the formal generator of (1) has been recently investigated by Athreya, Butkovski and Mytnik [ABM20] for and a time homogeneous F\in{\mathbb{B}}_{\infty,\infty}^{-1+{\color[rgb]{0,0,0}\gamma}}{\color[rgb]{0,0,0}({\mathbb{R}},{\mathbb{R}})} under the condition: -1+{\color[rgb]{0,0,0}\gamma}>(1-\alpha)/2\Leftrightarrow\gamma>(3-\alpha)/2. After specifying how the associated dynamics can be understood, viewing namely the drift as a Dirichlet process (similarly to what was already done in the Brownian case in [BC01]), they eventually manage to derive strong uniqueness under the previous condition. Note that results in that direction have also been derived by Bogachev and Pilipenko in [BP15] for drifts belonging to a certain Kato class in the multidimensional setting.
Again, the result obtained by Athreya, Butkovski and Mytnik relies on the Zvonkin transform and hence requires to have a suitable theory for the associated PDE. In our pure-jump time inhomogeneous framework, it formally writes
[TABLE]
where is the generator of a non-degenerate -stable process.
Considering again a time homogeneous in , one may reproduce the analysis done before in the Brownian setting to deduce that the solution is expected to be in (as the smoothing effect of the semi-group associated to the generator is now of order ). The two constraints exposed in the Brownian setting ((i) to define the product ; (ii) to define the gradient ) translate into for (ii) and for (i). This is precisely the threshold that will guarantee weak well-posedness and pathwise uniqueness in the scalar case for a drift F\in{\mathbb{L}}^{\infty}({\color[rgb]{0,0,0}[0,T]},{\mathbb{B}}_{\infty,\infty}^{-1+\gamma}{\color[rgb]{0,0,0}({\mathbb{R}}^{d},{\mathbb{R}}^{d})}) in the present work.
1.2. Aim of the paper
In the current work, we aim at investigating a rather large framework by considering the -dimensional case , with a distributional, potentially singular in time, inhomogeneous drift (in {\mathbb{L}}^{r}([0,T],{\mathbb{B}}^{-1+\gamma}_{p,q}{\color[rgb]{0,0,0}({\mathbb{R}}^{d},{\mathbb{R}}^{d})})) when the noise driving the SDE is a symmetric -stable process, in . This setting thus includes both the Brownian and pure-jump case. In the latter case, we will also be able to consider driving noises with singular spectral measures. As previously done for the aforementioned results, our strategy relies on the idea by Zvonkin. The core of the analysis therefore consists in obtaining suitable a priori estimates on an associated underlying PDE of type (2) or (5). Namely, we will provide a Schauder like theory for the mild solution of such PDE for a large class of data. This result is also part of the novelty of our approach since these estimates are obtained thanks to a rather robust methodology based on heat-kernel estimates on the transition density of the driving noise together with duality results between Besov spaces viewed through their thermic characterization (see Section 3.1 below and Triebel [Tri83] for additional properties on Besov spaces and their characterizations). This approach does not distinguish the pure-jump and Brownian setting provided the heat-kernel estimates hold. It has for instance also been successfully applied in various frameworks, to derive Schauder estimates and strong uniqueness for a degenerate Brownian chain of SDEs (see [CdRHM18a], [CdRHM18b]) or Schauder estimates for super-critical fractional operators [CdRMP20].
More precisely, our first main result consists in deriving the well-posedness of the martingale problem introduced in Definition 2 under suitable conditions on the parameters and , see Theorem 3.
Then, under slightly reinforced conditions on and , we are able to reconstruct the dynamics for the canonical process associated with the solution of the martingale problem, see Theorem 4, specifying how the Dirichlet process associated with the drift writes. In the spirit of [DD16], we in particular exhibit a main contribution in this drift.
Inspired by the dynamics exhibited for the Martingale solution, we define next an ad hoc notion of weak solution in Definition 4 and prove the associated well-posedness result, see Theorem 6. Therein, we also manage to derive pathwise uniqueness in the scalar case, extending partially the previous results of [ABM20].
Eventually, we manage in Proposition 7 to collect all the results we have at hand to specify the dynamics of both the Martingale and weak solution. These results could be useful to investigate the numerical approximations of those singular SDEs (see equations (16) and (17)) and the recent work by De Angelis et al. [DGI19]
Let us conclude by mentioning that, while finishing the preparation of the present manuscript, we discovered a brand new preprint of Ling and Zhao [LZ19] which somehow presents some overlaps with our results. Therein, the Authors investigate a priori estimates for the elliptic version of the PDE of type (2) or (5) with (homogeneous) drift belonging to Hölder-Besov spaces with negative regularity index (i.e. in {\color[rgb]{0,0,0}{\mathbb{B}}_{\infty,\infty}^{-1+\gamma}}{\color[rgb]{0,0,0}({\mathbb{R}}^{d},{\mathbb{R}}^{d})}) and including a non-trivial diffusion coefficient provided the spectral measure of the driving noise is absolutely continuous. As an application, they derive the well-posedness of the associated martingale problem and prove that the drift can be understood as a Dirichlet process. They also obtained quite sharp regularity estimates on the density of the solution and succeeded in including the limit case .
In comparison with their results, we here manage to handle the case of an inhomogeneous and singular in time drift which can also have additional space singularities, since the integrability indexes of the parameter for the Besov space are not supposed to be (recall that we assume F\in{\mathbb{L}}^{r}([0,T],{\mathbb{B}}_{p,q}^{-1+\gamma}{\color[rgb]{0,0,0}({\mathbb{R}}^{d},{\mathbb{R}}^{d})})). Although we did not include it, we could also handle in our framework an additional non-trivial diffusion coefficient under their standing assumptions, we refer to Remarks 6 and 15 below concerning this point. It also turns out that we obtain more accurate version of the dynamics of the solution which is here, as mentioned above, tractable enough for practical purposes. We eventually mention that, as a main difference with our approach, the controls in [LZ19] are mainly obtained through Littlewood-Paley decompositions whereas we rather exploit the thermic characterization and the parabolic framework for the PDE. In this regard, we truly think that the methodology to derive the a priori estimates in both works can be seen as complementary.
1.3. Overview of the paper
We state in this part our main assumptions and results, together with rigorous definition about the meaning of solution of the formal SDE (1).
Framework. As already said, we consider the problem of the solvability, in a sense to be specified later on, of the formal SDE (1) with drift , with and . Concerning the driving noise in (1), we will denote by its generator. When , where stands for the usual Laplace operator on . In the pure-jump stable case , for all :
[TABLE]
where, writing in polar coordinates , the Lévy measure decomposes as with a symmetric non degenerate measure on the sphere . Precisely, we assume:
- (UE)
There exists s.t. for all :
[TABLE]
Observe in particular that a rather large class of spherical measures satisfy (7). From the Lebesgue measure, which actually leads, up to a normalizing constant, to (usual fractional Laplacian of order corresponding to the generator of the isotropic stable process), to sums of Dirac masses in each direction, i.e. , with standing for the canonical basis vectors, which for then yields corresponding to the cylindrical fractional Laplacian of order associated with the sum of scalar symmetric -stable processes in each direction. In particular, it is clear that under (UE), the process admits a smooth density in positive time (see e.g. [Kol00]). Correspondingly, generates a semi-group that will be denoted from now on by . Precisely, for all (space of bounded Borel functions), and all :
[TABLE]
where stands for the density of . Further properties associated with the density , in particular concerning the integrability properties of its derivatives, are stated in Section 3.1.
**Notion of solution of the formal SDE (1). **In order to foster an appropriate notion of solution for the formal SDE (1), let us start with an informal discussion which can be found as well in the work of Cannizzaro and Choukh [CC18]. We reproduce it for the sake of clarity. Let and (space of càdlag function) when . The Stroock and Varadhan formulation of the Martingale Problem associated with an operator reads as: find a probability measure on the space equipped with the canonical filtration so that
- (a)
, 2. (b)
For all in (where here stands for a rich enough function space):
[TABLE]
is a (square integrable if ) martingale under .
Above, the class has to be chosen so that it is sufficiently rich to characterize a Markov process through the martingale formulation, see e.g. [EK86]. In our current setting, the main issue comes from the fact that the operator involves a distributional part (the drift term ) so that even if the products in are well defined, this term could only be a distribution with same regularity as . To avoid such a consideration the idea consists in taking as the set of maps for which is indeed a function , i.e. should be the set of solutions of the Cauchy Problem :
[TABLE]
for and in large enough classes and . Having this in mind, one may thus rewrite the associated Martingale Problem as: find a probability measure on the space equipped with its canonical filtration so that
- (a)
, 2. (b)
For all in ,
[TABLE]
with the solution (in a sense to be specified) of the Cauchy Problem , is a (square integrable if ) martingale under .
Remark 1* (On the link between both Martingale formulation (9) and (11) in a favorable setting).*
Let us try to briefly illustrate the link between both formulations in a favorable setting. Assume first that is bounded, continuous in time and is -Hölder continuous in space. A natural choice for is so that belongs to . This thus means that should be this latter function space while should be the space . As a consequence, both formulations are equivalents. Indeed, under these conditions, for any , Schauder estimates imply that there exists a unique element that solves in a classical sense. Therefore (9) implies (11) and, conversely, (11) implies (9) choosing for any : and .
**Definition of the Martingale solutions and associated well-posedness results. **Let us come back to our general case, . The main point is now to notice that, in order to obtain the martingale formulation (11), we do not need to work with classical solutions. Up to a regularization argument and thanks to Itô’s formula, it is indeed enough to work with mild solutions in with bounded gradient. In our setting, the natural candidate for the mild solution formally writes
[TABLE]
for e.g. continuously differentiable function and continuous in time and bounded in space function . This form requires the last term on the right hand side to be defined as a function, whereas the drift is only assumed to be a distribution. We are thus again led to find appropriate conditions that guarantee that: (i) is well defined; (ii) the product is mapped onto the space of continuous in time and continuously differentiable in space functions with bounded gradient by the semi-group .
This suggests that the gradient should belong to a suitable Lebesgue-Besov space that depends both on the driving noise and the drift. The following lemma, whose proof can be found in Subsection 3.4, is useful to investigate wether the above integral makes sense as a function and also gives some hints concerning the function space in which the solution should be sought:
Lemma 1**.**
Let , and be such that they satisfy a good relation, i.e.
[TABLE]
Define then
[TABLE]
Let in and belongs to for any . Then, the map
[TABLE]
with the semi-group generated by , belongs to . Moreover, the product makes sense as an element of .
We point out that the parameter would correspond to the parabolic bootstrap, which thus also reflects the impact of the integrability indexes (but not ).
We are now in position to define the mild solution of the formal Cauchy problem and then state the associated well-posedness theorem.
Definition 1**.**
Let , and . For any given fixed , we say that is a mild solution of the formal Cauchy problem
[TABLE]
if it belongs to with in for any where is given by (12) and satisfies
[TABLE]
with the semi-group generated by .
Notice that, due to definition of the space in which the solution should be sought, Lemma 1 implies that the above mild formulation is meaningful. We now have the following well-posedness result proved in Section 3 (see in particular Subsection 3.4).
Theorem 2**.**
Let , and satisfy a good relation (GR). For all in and with , where is given by (12), the formal Cauchy problem admits a unique solution in the sense of Definition (1). Moreover it satisfies that for all in , in :
[TABLE]
We can therefore define the associated Martingale Problem and the corresponding well-posedness result proved in Section 2 (see subsection 2.2, points to ).
Definition 2**.**
Let and when . For any \alpha\in(0,2{\color[rgb]{0,0,0}]}, we say that a probability measure on equipped with its canonical filtration is a solution of the Martingale Problem associated with for if
- (i)
, 2. (ii)
, with ,
[TABLE]
is a (square integrable if ) martingale under where is the mild solution of the Cauchy Problem in the sense of Definition 1.
Theorem 3**.**
Let , and satisfy a good relation (GR). Then, the Martingale Problem associated with for , is well-posed in the sense of Definition 2. Moreover, the canonical process under is strong Markov.
Note that, according to the notations of the previous paragraph, we chose above to work with , where stands for the class of Schwartz functions. This is mainly motivated by our approach based on Besov spaces as this class is continuously embedded into any Besov spaces , , , see e.g. paragraph 2.3.3 in [Tri83]. Note that it is as well dense in provided .
Building the dynamics. One may then wonder if something can be said about the dynamics of the underlying process. In other words, the next step consists in linking the Martingale Problem and the formal SDE (1). For sufficiently smooth drifts, the starting point to build a dynamics consists in recovering the noise from the canonical process associated with the martingale solution. Since in the current framework the difficulty consists in specifying the drift, it therefore seems natural to have a noise at hand to precisely recover the drift. This is precisely what leads to consider an enlarged martingale problem which includes the noise. Similar issues were e.g. discussed in Kurtz [Kur10].
For singular drifts this procedure was performed in the specific setting of [DD16]. We here extend this connection to our general setting and explain how one can relate the Martingale solution with the dynamics of (1), and how this latter has to be understood. To do so, we will however be led to slightly reinforce the good relation (GR). Namely, we say that , and satisfy a good relation for the dynamics if the following relation holds:
[TABLE]
Note that when , the above condition and the previous one are equivalent.
Having such a condition at hand, our main strategy consists in following the ideas of [DD16]. Namely, we show that the canonical process can be decomposed into two parts: a stable driving noise, i.e. an -stable process with generator and a drift term, defined as a kind of non-linear stochastic Young integral of a regularized version of the initial drift by the density of the driving process. To the best of our knowledge, non-linear Young integrals have been introduced in [CG16] and further extended from a probabilistic perspective in [DD16]. The rigorous definition of this last object is the following.
Definition 3**.**
Let , be a filtered probability space and let be a progressively measurable process on it. Let be a continuous and progressively measurable map in the sense that for any ,
[TABLE]
is measurable and
[TABLE]
is continuous. For , we call -stochastic Young integral of with respect to the pseudo increment the limit in
[TABLE]
when it exists.
Having such tools at hand, the strategy consists in building simultaneously the Martingale solution and the noise as the solution of a kind of enlarged Martingale Problem whose entries for the underlying canonical process are, respectively, associated with the solution of the martingale problem and the corresponding driving noise. This will allow to build the drift as the difference between them. Indeed, having such a canonical process at hand, we decompose the increment of the process as
[TABLE]
where . Clearly, the first difference in the above right hand side stands for a drift term, while the second stands for a martingale part.It thus suffices to relate both parts with (i) the original drift and (ii) the -stable noise previously built. This is the purpose of the next result proved in Section 4 (see in particular Subsection 4.3 and Proposition 13 below).
Theorem 4**.**
Let , and satisfy a good relation for the dynamics (GR-D). It then holds that there exists a probability measure on when and when such that the canonical process, denoted by , satisfies
- (i)
The law of under is a solution of the Martingale problem associated with , and the law of under is a Brownian motion if and an -stable process if .
- (ii)
The dynamics of the canonical process reads
[TABLE]
where for any , ,
[TABLE]
with the (smooth) density of and where the integral in (16) is understood as an -stochastic Young integral, for any , in the sense of Definition 3.
In fact, we prove a stronger result concerning the dynamics. We show that it is possible to define a stochastic Young integral against the dynamics, leading in turn to use Itô calculus. This is done for a suitable class of progressively measurable processes satisfying appropriate Hölder regularity conditions. For any , any , we set
[TABLE]
As a corollary of the proof of Theorem 4, we have the following result.
Corollary 5** (Associated -stochastic Young integral, ).**
Under the above assumptions, one can define a stochastic Young integral w.r.t. the quantities in (16). Namely, for any , for which there exists satisfying , one has
[TABLE]
for any , for all and where the first term in the above right hand side is defined as a -stochastic Young integral.
Further properties and weak formulation. The previously described dynamics for the Martingale solution strongly suggests that a notion of weak solution associated with the formal SDE (1) can somehow be considered. This leads to the following definition.
Definition 4**.**
We call weak solution of the formal SDE (1) a pair of adapted processes on a filtered probability space such that is an -stable process and satisfies
[TABLE]
for any in and where for any , ,
[TABLE]
with the (smooth) density of and where the integral in (20) is understood as an -stochastic Young integral, in the sense of Definition 3.
We say that weak uniqueness holds for (1) if for any two weak solutions , and , with the same initial condition, then .
We then have the following well-posedness result whose proof is postponed to Section 5 (see Subsection 5.1) and thoroughly exploits that we can apply some stochastic calculus arguments to the dynamics and the smoothness of the underlying PDE (see Corollary 5 and Theorem 2 above).
Theorem 6**.**
Let , and satisfy a good relation for the dynamics (GR-D). Then,
- (i)
the formal SDE (1) admits a unique weak solution in the sense of Definition 4;
- (ii)
if , pathwise uniqueness holds, i.e. the paths of two weak solutions defined on the same probability basis coincide a.s. whenever they start from the same initial condition.
Moreover, one can define an associated -stochastic Young calculus i.e. Corollary 5 hold with therein.
Remark 2* (About the connections between the Martingale and weak solutions).*
- •
Although we obtain well-posedness for both the Martingale problem and the weak formulation associated with the formal SDE (1), we do not claim that equivalence holds between them. In fact, what we are able to prove is that weak existence implies the existence of a Martingale solution and that uniqueness for the Martingale problem implies weak uniqueness, see Remark 17. It is nevertheless more involved to prove that existence of a Martingale solution gives the weak existence and that weak uniqueness implies uniqueness for the Martingale problem. The main issue relies on the fact that the very definition of the Martingale problem does not allow one to define an associated stochastic calculus, while the very definition of a weak solution does. Without the help of Itô’s formula, we are not able to go from the Martingale solution to the weak one, which prevents us from obtaining the equivalence between both formulations.
- •
Pay attention that, in the above result (ii), we do not claim that strong uniqueness holds. This mainly comes from a measurability argument. In [ABM20], the Authors built the drift as a Dirichlet process and then recover the noise part of the dynamics as the difference between the solution and the drift allowing them in turn to work under a more standard framework (in term of measurability), and thus to use the Yamada-Watanabe Theorem. Here, we mainly recover the noise in a canonical way, through the martingale problem, and then build the drift as the difference between the solution and the noise. Such a construction allows us to give a precise meaning to the drift and the loss of measurability can be seen as the price to pay for it. Nevertheless, at this stage, one may restart with the approach of Athreya et al. [ABM20] to define an ad hoc noise as the difference between the process and the drift (which reads as a Dirichlet process), identify the objects obtained with the two approaches and then obtain suitable measurability conditions to apply the Yamada-Watanabe Theorem.
We emphasize that the above results give, to the best of our knowledge, the most accurate description of the dynamics of the formal SDE (1) we found in the literature on SDE with distributional drift. We emphasize as well that such a description includes all those we found and discussed in the Introduction. A crucial point is that, moreover, the above dynamics (for whenever the Martingale solution or the weak solution) coincide with the classical one when the drift is time-space Hölder continuous. All those facts are collected in the following proposition whose proof is given in Section 5 (see Subsection 5.2).
Proposition 7**.**
Either the Martingale solution or the weak solution of the formal SDE (1) is:
- (i)
A virtual solution of the formal SDE (1);
- (ii)
A Dirichlet process;
- (iii)
It holds that for any smooth approximating sequence such that
[TABLE]
[TABLE]
with for the weak solution;
- (iv)
If is time-space -Hölder continuous, the previous construction coincide with the “usual" drift:
[TABLE]
We eventually mention that the previous explicit representation and properties of the drift could also be useful in order to derive numerical approximations for the SDE (16). We can, to this end, mention the recent work by De Angelis et al. [DGI19] who considered in the Brownian scalar case some related issues.
Organization of this paper. The paper is organized as follows. Section 2 is dedicated to the proof of Theorem 3 (well-posedness of the Martingale problem). To do so, we collect some material (i.e. intermediate results) on the PDE, which is used all along this work. These intermediate results are proved in Section 3 and allow, in turn, to prove Theorem 2. As mentioned before, Section 3 is dedicated to the PDE analysis. In Section 4, we mainly reconstruct the dynamics associated with the Martingale solution. Proof of Theorem 4 can be found therein. Having this dynamics at hand, we then investigate the weak solution of the formal SDE (1) and further properties of the obtained drift in Section 5. This last section thus contains the proofs of Theorem 6 and Proposition 7.
Notations. Throughout the document, we denote by some positive constants depending on the non-degeneracy constant in (UE) and on the set of parameters . The notation is used when the constants also depend in a non-decreasing way on time . Other possible dependencies are also explicitly indicated. We also sometimes shorten {\mathbb{L}}^{r}([0,T],{\mathbb{B}}_{p,q}^{-1+\gamma}{\color[rgb]{0,0,0}({\mathbb{R}}^{N},{\mathbb{R}}^{M})}), for in , with the notation or when there are no ambiguities.
2. Well-posedness of the Martingale problem : proof of Theorem 3
In this Section, we mainly prove Theorem 3. To do so, we need some additional material on the PDE, which is collected in Subsection 2.1 below. This material is of crucial importance in our work as it will be used to prove Theorems 2, 4, 6 and Proposition 7 as well. All these PDE results are proved in Section 3 below (see Subsection 3.4). The proof of Theorem 3 is then derived in Subsection 2.2.
2.1. The underlying PDE
As underlined in Definitions 1 and 2, it turns out that the well-posedness of the Martingale Problem associated with , , heavily relies on the construction of a suitable theory for the Cauchy problem (see Definition 1) for some data and belonging to some appropriate function spaces to be specified later on. We recall that, because of the scalar product therein, the aforementioned PDE is only stated formally. Only the mild formulation, given by (14), is licit thanks to Lemma 1.
Hence, as a key intermediate tool we need to introduce what we will later on call the mollified Cauchy problem. Namely, denoting by a sequence of smooth functions such that when , the mollified Cauchy problem reads as
[TABLE]
Remark 3* (Smooth approximating sequence of the drift).*
When , such a sequence can be obtained from functions (or in the Schwartz class in space), see e.g. Theorem 4.1.3 in [AH96]. When , one can approximate by sequence of functions in for any , if and if (taking e.g. their convolution with a gaussian kernel with variance ), see e.g. Definition 2.2 and Lemma 1.3 in [ABLM21] again for the spatial part. Up to an abuse of notation, one can denote by these indexes and still use the analysis done below.
We start with the following control which, in some sense, is the counterpart of Theorem 2 for the mollified Cauchy problem.
Proposition 8**.**
Assume that the parameters , and satisfy a good relation (GR). Let be smooth functions where has as well at most linear growth. Let denote the sequence of classical solutions of the mollified PDE (2.1) i.e. of \big{(}\mathscr{C}(F_{m},L^{\alpha},f,g,T)\big{)}_{m\geq 1}. Then, there exist positive constants C:=C(\|F\|_{{\mathbb{L}}^{r}({\mathbb{B}}_{p,q}^{{\color[rgb]{0,0,0}-1+\gamma}})}), C_{T}:=C(T,\|F\|_{{\mathbb{L}}^{r}({\mathbb{B}}_{p,q}^{{\color[rgb]{0,0,0}-1+\gamma}})}), depending on the known parameters and in (UE), s.t. for all :
[TABLE]
where can be chosen as small as desired, is a non-decreasing function and where, from the definition in (12), .
Moreover, the sequence converges toward the solution , in the sense of Definition 1, of the Cauchy problem uniformly on compact subsets of .
Remark 4*.*
Let us mention that, when the terminal condition is bounded, then the solution is itself bounded, i.e. .
Remark 5* (On the spatial smoothness of the mollified PDE).*
From the conditions on and the definition of in (12), we carefully point out that:
[TABLE]
This reflects the spatial smoothness of the underlying PDE. In particular, the condition provides a pointwise gradient estimate for the solution of the mollified PDE. This key condition rewrites: . It will be implied assuming that , since in this case .
Remark 6* (On the corresponding parabolic bootstrap).*
Observe that, when , we almost have a Schauder type result, namely in (24) and we end up with the corresponding parabolic bootstrap effect for both the solution of and , up to the small exponent which can be chosen arbitrarily small.
Remark 7* (About additional diffusion coefficients).*
It should be noted at this point that we are confident about the extension of the results to differential operators involving non-trivial diffusion coefficient, provided this latter is Hölder-continuous in space. Sketches of proofs in this direction are given in the Remark 15 in Subsection 3.4. However, we avoid investigating this direction for the sake of clarity and in order to focus on the more (unusual) drift component.
The following Proposition and Corollary provide a Zvonkin type theory for the mollified and formal Cauchy problem given respectively by \mathscr{C}(F,L^{\alpha},{\color[rgb]{0,0,0}-F^{k}_{m}},0,T) and \mathscr{C}(F,L^{\alpha},{\color[rgb]{0,0,0}-F^{k}},0,T), where for any in , denotes the component of and denotes its mollification, see Remark 3.
Proposition 9** (Zvonkin type theory for the mollified PDE).**
Let , and satisfy a good relation (GR) and let in . There exists a positive constant s.t. for each and all , the sequence of classical solutions of \big{(}\mathscr{C}(F_{m},L^{\alpha},F_{m}^{k},0,T)\big{)}_{m\geq 1} satisfies:
[TABLE]
where is a non-decreasing function and for which the two last lines in (24) hold as well. Moreover, for any in , the sequence converges towards the solution ,in the sense of Definition 1, of the Cauchy problem , uniformly on compact subsets of .
Corollary 10** (Zvonkin type theory for the formal PDE).**
Let . Under the above assumptions, the formal Cauchy problem admits a unique solution in the sense of Definition 1 which moreover satisfies that there exists a positive constant s.t.
[TABLE]
where is a non-decreasing function.
Remark 8*.*
Of course, in order to use the Zvonkin type theory to derive strong well-posedness in the multidimensional setting some controls of the second order derivatives are needed. This is what Krylov and Röckner did in [KR05] in the Sobolev setting. Let us also specify that, in connection with Theorem 6, in the scalar setting weak and strong uniqueness are somehow closer since, from the PDE viewpoint, they do not require to go up to second order derivatives. Indeed, the strategy is then to develop for two weak solutions of (16), a regularized version of , which somehow makes appear a kind of “local-time" term which is handled through the Hölder controls on the gradients (see the proof of Theorem 6-(ii) in Subsection 5.1-(ii) and e.g. Proposition 2.9 in [ABP18]), whereas in the multidimensional setting, for strong uniqueness, the second derivatives get in.
2.2. From PDE to SDE results: proof of Theorem 3
It is quite standard to derive well-posedness results for a probabilistic problem through PDE estimates. When the drift is a function, such a strategy goes back to e.g. Zvonkin [Zvo74] or Stroock and Varadhan [SV79]. This approach has been made quite systematic in the distributional setting by Delarue and Diel in [DD16] who provide a very robust framework. To investigate the meaning and well-posedness of (1), we adapt their procedure to the current setting. Points (i) to (iii) allow to derive the rigorous proof of Theorem 3 provided Proposition 8, Proposition 9, Corollary 10 and Theorem 2 hold.
(i) Tightness of the sequence of probability measure induced by the solution of the mollified SDE (1). Here, we consider the regular framework induced by the mollified PDE (2.1). Note that in this regularized framework, for any , the Martingale Problem associated with , , is well posed. We denote by the associated solution. Let us generically denote by the associated canonical process. Note that the underlying space where such a process is defined differs according to the values of : when the underlying space is while it is when . Recall that we denoted it by .
Let where each {\color[rgb]{0,0,0}u_{m}^{k}}, is the solution of the mollified Cauchy problem \mathscr{C}(F_{m},L^{\alpha},-F^{{\color[rgb]{0,0,0}k}}_{m},0,T). Let us define for any in and for any the process
[TABLE]
Note that this process makes sense for any , thanks to regularity estimates on from Proposition 9. Next, applying Itô’s formula to {\color[rgb]{0,0,0}(X_{r}^{m}+u_{m}(r,X_{r}^{m}))_{r\in[v,s]}} we obtain
[TABLE]
In order to prove that actually forms a tight sequence of probability measures on , whose limit is denoted by , it is sufficient to prove that there exists and such that {\mathbb{E}}^{{\mathbb{P}}^{{\color[rgb]{0,0,0}2}}_{{\color[rgb]{0,0,0}m}}}[|X^{m}_{s}-X^{m}_{v}|^{\tilde{p}}]\leq c|v-s|^{1+\eta} or {\mathbb{E}}^{{\mathbb{P}}^{\alpha}_{{\color[rgb]{0,0,0}m}}}[|X^{m}_{s}-X^{m}_{0}|^{\tilde{p}}]\leq cs^{\eta}, for thanks to the Kolmogorov (resp. Aldous) Criterion. We refer e.g. for the latter to Proposition 34.9 in Bass [Bas11]. Writing
[TABLE]
the result follows in small time, thanks to Proposition 9 (choosing in the pure jump setting).
(ii) Identification of the limit probability measure. Let us now prove that the limit, denoted by , is indeed a solution of the martingale problem associated with , , in the sense of Definition 2. Let and be a continuous function with gradient in {\mathbb{B}}_{{\color[rgb]{0,0,0}\infty,\infty}}^{\theta-1}({\mathbb{R}}^{d},{\mathbb{R}}^{d}) and let for any be the classical solution of the Cauchy problem .
Applying Itô’s formula for each we obtain that
[TABLE]
where is defined by (25). Thus, using the convergence result of to the solution of on every compact subsets of from Proposition 8 together with a uniform control of the moment of (which also follows from (26) and above conditions on ), we deduce that
[TABLE]
is a -martingale (square integrable when ) by letting the regularization procedure tend to infinity.
(iii) Uniqueness of the limit probability measure. We now come back to the canonical space , and let and be two solutions of the Martingale Problem associated with , . Thus, for all we have, for the solution of the Cauchy problem
[TABLE]
so that the marginal laws of the canonical process are the same under and . We extend the result on thanks to regular conditional probabilities, see Chapter 6.2 in [SV79] . Uniqueness then follows from Corollary 6.2.4 of [SV79]. The strong Markov property follows from Theorem 4.2 in [EK86].
3. PDE analysis
This part is dedicated to the proofs of Proposition 8, Proposition 9, Theorem 2 as well as Corollary 10 and Lemma 1. It is thus the core of this paper as these results allow to recover, specify and extend, most of the previous results on SDEs with distributional drifts discussed in the introduction. Especially, as they are handled, the proofs are essentially the same in the diffusive () and pure jump () setting as they only require heat kernel type estimates on the density of the associated underlying noise. We first start by introducing the mathematical tools in Subsection 3.1. Then, we provide a primer on the formal Cauchy problem by investigating the smoothing properties of the Green kernel associated with the stable noise in Subsection 3.2. Uniform (w.r.t. the mollification) estimates of the solution of the Cauchy problem are investigated in Subsection 3.3. Eventually, we derive in Subsection 3.4 the proofs of Proposition 8, Proposition 9, Theorem 2 as well as Corollary 10 and Lemma 1. We importantly point out that, from now on and in all the current section, we assume without loss of generality that .
3.1. Mathematical tools
In this part, we give the main mathematical tools needed to prove Proposition 8 and Theorem 2.
Heat kernel estimates for the density of the driving process
Under (UE), it is rather well known that the following properties hold for the density of . For the sake of completeness we provide a complete proof.
Lemma 11** (Bounds and Sensitivities for the stable density).**
There exists C:=C({\bf({\color[rgb]{0,0,0}UE})}) s.t. for all , , and :
[TABLE]
where \big{(}q_{\alpha}(t,\cdot)\big{)}_{t>0} is a family of probability densities on such that , , and for all , there exists a constant s.t.
[TABLE]
Remark 9*.*
From now on, for the family of stable densities \big{(}q(t,\cdot)\big{)}_{t>0}, we also use the notation , i.e. without any specified argument stands for the density q{\color[rgb]{0,0,0}(t,\cdot)} at time {\color[rgb]{0,0,0}t=1}.
Proof.
We focus here on the pure jump case . Indeed, for the density of the driving Brownian motion readily satisfies the controls of (28) with replaced by a suitable Gaussian density.
Let us recall that, for a given fixed , we can use an Itô-Lévy decomposition at the associated characteristic stable time scale for (i.e. the truncation is performed at the threshold ) to write where and are independent random variables. More precisely,
[TABLE]
where is the Poisson random measure associated with the process ; for the considered fixed , and correspond to the *small jumps part * and large jumps part respectively. A similar decomposition has been already used in [Wat07], [Szt10] and [HM16], [HMP19] (see in particular Lemma 4.3 therein). It is useful to note that the cutting threshold in (30) precisely yields for the considered that:
[TABLE]
To check the assertion about we start with
[TABLE]
(see [Sat99]). Changing variable we get that for any and this shows the assertion (similarly we get the statement for ). The density of then writes
[TABLE]
where corresponds to the density of and stands for the law of . From Lemma A.2 in [HMP19] (see as well Lemma B.1 in [HM16]), belongs to the Schwartz class and satisfies that for all and all , there exist constants s.t. for all :
[TABLE]
where is chosen in order that * be a probability density.*
We carefully point out that, to establish the indicated results, since we are led to consider potentially singular spherical measures, we only focus on integrability properties similarly to [HMP19] and not on pointwise density estimates as for instance in [HM16]. The main idea thus consists in exploiting (30), (32) and (33). The derivatives on which we want to obtain quantitative bounds will be expressed through derivatives of , which also give the corresponding time singularities. However, as for general stable processes, the integrability restrictions come from the large jumps (here ) and only depend on its index . A crucial point then consists in observing that the convolution actually corresponds to the density of the random variable
[TABLE]
(where has density and is independent of ; to have such decomposition one can define each on a product probability space). Then, the integrability properties of , and more generally of all random variables appearing below, come from those of and .
One can easily check that Hence
[TABLE]
By independence of and , using the Fourier transform, one can easily prove that
[TABLE]
Moreover, This shows that the density of verifies (29). The controls on the spatial derivatives are derived similarly using (33) for and the same previous argument. The bound for the time derivatives follow from the Kolmogorov equation and (32) using the fact that for all (see again Lemma 4.3 in [HMP19] for details). ∎
Thermic characterization of Besov norm
In the sequel, we will intensively use the thermic characterisation of Besov spaces, see e.g. Section 2.6.4 of Triebel [Tri83]. The thermic terminology comes from the fact that such a norm involves convolution with a suitable heat kernel.
In the following, we denote by the Schwartz class. For , (smooth function with compact support) s.t. we set where and respectively denote the Fourier transform of and the inverse Fourier transform of .
The thermic characterisation of Besov spaces we will use in the current work reads as follows: for \vartheta\in{\mathbb{R}},{\color[rgb]{0,0,0}m}\in(0,+\infty],{\color[rgb]{0,0,0}l}\in(0,\infty], {\mathbb{B}}_{{\color[rgb]{0,0,0}l},{\color[rgb]{0,0,0}m}}^{\vartheta}({\mathbb{R}}^{d},{\mathbb{R}}):=\{f\in{\mathcal{S}}^{\prime}({\mathbb{R}}^{d}):\|f\|_{{\mathcal{H}}_{{\color[rgb]{0,0,0}l},{\color[rgb]{0,0,0}m}}^{\vartheta},\tilde{\alpha}}<+\infty\}, for any , with
[TABLE]
where in the above definition ” stands for the spatial convolution.
The parameter is an integer s.t. and denotes the isotropic stable heat kernel on (or the gaussian heat kernel if ). In Section 2.6.4. of [Tri83], the thermic characterization is presented either with the Gaussian heat kernel () or with the Cauchy-Poisson kernel (). It actually turns out from the main characterization of Besov spaces, see Section 2.5.1 in [Tri83], that many kernels can actually be used. We chose the isotropic stable one, with stability index associated with the one of the driving noise in (1), i.e. , since it precisely allows to benefit from the stability by convolution when the underlying stable semi-group, around which we perform the Duhamel expansion (see (53), (43)), is involved. This is particularly well adapted to the computations in the proof of Lemma 12, see Appendix A. In particular satisfies the bounds of Lemma 11 and in that case the upper-bounding density can be specified. Namely, in that case (28) holds with .
In the following we call thermic part the second term in the right hand side of (3.1) denoted by \mathcal{T}_{{\color[rgb]{0,0,0}l},{\color[rgb]{0,0,0}m}}^{\vartheta}[f].
Importantly, it is well known that {\mathbb{B}}_{{\color[rgb]{0,0,0}l},{\color[rgb]{0,0,0}m}}^{\vartheta}({\mathbb{R}}^{d},{\mathbb{R}}) and {\mathbb{B}}_{{\color[rgb]{0,0,0}l}^{\prime},{\color[rgb]{0,0,0}m}^{\prime}}^{-\vartheta}({\mathbb{R}}^{d},{\mathbb{R}}) where {\color[rgb]{0,0,0}l}^{\prime},{\color[rgb]{0,0,0}m}^{\prime} are the conjugates of {\color[rgb]{0,0,0}l},{\color[rgb]{0,0,0}m} can be put in duality. Namely, see e.g. Theorem 4.1.3 in [AH96] or Proposition 3.6 in [LR02], for ({\color[rgb]{0,0,0}l},{\color[rgb]{0,0,0}m})\in{\color[rgb]{0,0,0}[}1,\infty]^{2} and for (f,g)\in{\mathbb{B}}_{l,{\color[rgb]{0,0,0}m}}^{\vartheta}({\mathbb{R}}^{d},{\mathbb{R}})\times{\mathbb{B}}_{{\color[rgb]{0,0,0}l}^{\prime},{\color[rgb]{0,0,0}m}^{\prime}}^{-\vartheta}({\mathbb{R}}^{d},{\mathbb{R}}) which are also functions:
[TABLE]
Remark 10*.*
As it will be clear in the following, the first part of the r.h.s. in (3.1) will be the easiest part to handle (in our case) and will give a negligible contributions. For that reason, we will only focus on the estimation of the thermic part of the Besov norm below. See Remark 18 in the proof of Lemma 12 in Appendix A for details.
Remark 11*.*
One may wonder why we chose to work with the Thermic characterization of Besov spaces instead of the one deriving from the dyadic Littlewood-Paley decomposition. Note first of all that such characterizations are equivalent, we can e.g. refer to Section 2.5.1 of [Tri83] (main Theorem) or Chapters 3 and 5 in [LR02]. The main point here is that we are led to handle a mild (or Duhamel) formulation of a parabolic PDE which itself involves convolutions of distributions by a heat kernel (associated with the driving noise). Such a framework hence naturally leads to consider the thermic characterization and to choose therein a heat kernel which is also compatible with the one of the driving noise. Let us mention that the Littlewood-Paley was in this same context of Schauder type estimates successfully used by Zhang and his co-authors, see e.g. [HWZ20] in the degenerate non-local kinetic case. The thermic approach seems more natural and direct to us for our goal.
Auxiliary estimates
We here provide some useful estimates whose proofs are postponed to Appendix A. We refer to the next Section 3.2 for a flavor of those proofs as well as for applications of such results.
Lemma 12**.**
Let . Assume that for all in the map is in for some \beta\in{\color[rgb]{0,0,0}(0,1]}. Define for any in , for all , the spatial operator by
[TABLE]
and let be the density of defined in (32). Then, there exists a constant such that for any in , any , all in , for all in
[TABLE]
where is the conjugate of . Also, for any in all in , for all in it holds that for all ,
[TABLE]
up to a modification of C{\color[rgb]{0,0,0}:=C({\bf(UE)},T,\beta^{\prime})}.
3.2. A primer on PDE the formal Cauchy problem :
reading almost optimal regularity through Green kernel estimates
Equation (13) can be, still formally, rewritten as
[TABLE]
viewing the first order term as a source (depending here on the solution itself). In order to understand what type of smoothing effects can be expected for rough source we first begin by investigating the smoothness of the following equation:
[TABLE]
The parallel with the initial problem (13), rewritten in (41), is rather clear. We will aim at applying the results obtained below for the solution of (3.2) to {\color[rgb]{0,0,0}\Phi}={\color[rgb]{0,0,0}f}-F\cdot Du (where the roughest part of the source will obviously be ).
Given a map {\color[rgb]{0,0,0}\Phi} in we now specifically concentrate on the gain of regularity which can be obtained through the fractional operator for the solution of (3.2) w.r.t. the data {\color[rgb]{0,0,0}\Phi}. Having a lot of parameters at hand, this will provide a primer to understand what could be, at best, attainable for the target PDE (41)-(13).
The solution of (3.2) corresponds to the Green kernel associated with {\color[rgb]{0,0,0}\Phi} defined as:
[TABLE]
Since to address the well-posedness of the martingale problem we are led to contol, in some sense, gradients, we will here try to do so for the Green kernel introduced in (43) solving the linear problem (3.2) with rough source. Namely for a multi-index , we want to control {\color[rgb]{0,0,0}D_{x}^{\eta}}G^{\alpha}{\color[rgb]{0,0,0}\Phi}(t,x)
Avoiding harmonic analysis techniques, which could in some sense allow to average non-integrable singularities, our approach allows to obtain almost optimal regularity thresholds that could be attainable on . Thanks to the Hölder inequality (in time) and the duality on Besov spaces (see equation (37)) we have that:
[TABLE]
where , and are the conjugate exponents of , and . Let us first focus, for on the thermic part of \|{\color[rgb]{0,0,0}D_{x}^{\eta}}p_{\alpha}(s-t,\cdot-x)\|_{{\mathbb{B}}_{p^{\prime},q^{\prime}}^{1-\gamma}}. We have with the notations of Section 3.1:
[TABLE]
In the above equation, we split the time interval into two parts. On the upper interval, for which there are no time singularities, we use directly convolution inequalities and the available controls for the derivatives of the heat kernel (see Lemma 11). On the lower interval we have to equilibrate the singularities in and use cancellation techniques involving the sensitivities of {\color[rgb]{0,0,0}D_{x}^{\eta}}p_{\alpha} (which again follow from Lemma 11).
Let us begin with the upper part (i.e. the second term in (3.2)). Using the convolution inequality, we have from Lemma 11:
[TABLE]
Indeed, we used for the second inequality that equation (28) and the self similarity of give:
[TABLE]
recalling that and , for the last inequality.
Hence, the map s\mapsto\mathcal{T}_{p^{\prime},q^{\prime}}^{1-\gamma}[{\color[rgb]{0,0,0}D_{x}^{\eta}}p_{\alpha}(s-t,\cdot-x)]|_{[(s-t),1]} belongs to as soon as
[TABLE]
We now focus on the lower part (i.e. the first term in (3.2)). Still from (28) (see again the proof of Lemma 4.3 in [HMP19] for details), one derives that there exists s.t. for all and all ,
[TABLE]
Indeed, (48) is direct if (off-diagonal regime). It suffices to exploit the bound (28) for {\color[rgb]{0,0,0}D_{x}^{\eta}}p_{\alpha}(s-t,y-x) and {\color[rgb]{0,0,0}D_{x}^{\eta}}p_{\alpha}(s-t,z-x) and to observe that \big{(}|z-y|/(s-t)^{1/\alpha}\big{)}^{\beta}\geq 1. If now (diagonal regime), it suffices to observe from (33) that, with the notations of the proof of Lemma 11 (see in particular (32)), for all :
[TABLE]
Therefore, in the diagonal case, (48) follows from (LABEL:MIN_JUMP_GREENPART) and (32) writing
[TABLE]
for all (exploiting again that for the last inequality). From (48) we now derive:
[TABLE]
From the convolution inequality and Lemma 11 (see also (46)) we thus obtain:
[TABLE]
Hence,
[TABLE]
provided for the second inequality (which can be assumed since we can choose arbitrarily in ). The map s\mapsto\mathcal{T}_{p^{\prime},q^{\prime}}^{1-\gamma}[{\color[rgb]{0,0,0}D_{x}^{\eta}}p_{\alpha}(s-t,\cdot-x)]|_{[0,(s-t)]} hence belongs to under the same previous condition on than in (47). Let us eventually mention that the above arguments somehow provide the lines of the proof of Lemma 12 for . The proof in its whole generality is provided in Appendix A.
Remark 12* (Pointwise gradient estimate on ).*
The condition in (47) then precisely gives that the gradient of the Green kernel will exist pointwise (with uniform bound depending on the Besov norm of {\color[rgb]{0,0,0}\Phi}) as soon as:
[TABLE]
In particular, provided (52) holds, the same type of arguments would also lead to a Hölder control of the gradient in space of index . The previous computations somehow provide the almost optimal regularity that could be attainable for (through what can be derived from solving (3.2)). The purpose of the next section will precisely be to prove that these arguments can be adapted to that framework. The price to pay will be some additional constraint on the because we will precisely have to handle the product on the way.
Remark 13* (On the second integrability parameter “” in the Besov norm).*
Eventually, we emphasize that the parameter does not play a key role in the previous analysis. Indeed, none of the thresholds appearing depend on this parameter. Since for all we have that for all that the above analysis suggests that it could be enough to consider the case . Nevertheless, as it does not provide any additional difficulties, we let the parameter vary in the following.
3.3. Uniform estimates of the solution of the Cauchy problem
and associated (uniform) Hölder controls
It is known that, under (UE) and for if g\in{\mathbb{B}}_{\infty,\infty}^{\vartheta}{\color[rgb]{0,0,0}({\mathbb{R}}^{d},{\mathbb{R}})} is also bounded and , for any there exists a unique classical solution to the Cauchy problem . This is indeed the usual Schauder estimates for sub-critical stable operators (see e.g. Priola [Pri12] or Mikulevicius and Pragarauskas who also address the case of a multiplicative noise [MP14]). It is clear that the following Duhamel representation formula holds for . With the notations of (8):
[TABLE]
where the Green kernel is defined by (43) and where the remainder term is defined as follows:
[TABLE]
It is plain to check that, if we now relax the boundedness assumption on , supposing it can have linear growth, there exists such that
[TABLE]
We also refer to the section concerning the smoothness in time below for specific arguments related to a terminal condition with linear growth.
In the following, we will extend the previous bounds in order to consider singular sources as well. In order to keep the notations as clear as possible, we drop the superscript associated with the mollifying procedure for the rest of the section. Note also that the following analysis will allow us to drop the above condition , i.e. the above Duhamel representation holds in our setting with .
(i) Gradient bound. Let us first control the terminal condition. We have, integrating by parts and using usual cancelation arguments,
[TABLE]
We now turn to control the Green kernel part. Write
[TABLE]
From the very definition (12) of we have and . We can thus apply Lemma 12 (see eq. (39) with , , and therein) to obtain
[TABLE]
Recalling , we thus obtain
[TABLE]
Let us now focus on first gradient estimate of . Using the Hölder inequality and then Besov duality we have,
[TABLE]
so that the main issue consists in establishing the required control on the map for any in . Note that since for all in the map y\mapsto{\color[rgb]{0,0,0}u(s,y)} is in for any , we have in particular from the very definition of (see eq. (12)) and assumptions on that there exists such that , and for all in the map is in . One can hence apply Lemma 12 so that (see eq. (39) with , and therein)
[TABLE]
This map hence belongs to as soon as
[TABLE]
which follows from the assumptions on . We then obtain, after taking the norm of the above estimate, that
[TABLE]
recalling from (12) that .
(ii) Hölder norm of the gradient. As in the above proof we obtain gradient bounds depending on the spatial Hölder norm of , we now have to precisely estimate this quantity. The main difficulty is induced by the remainder term:
[TABLE]
using again the Hölder inequality and duality between the considered Besov spaces (see Section 3.1). Hence, the main issue consists in establishing the required control on the map
[TABLE]
for any in . Since , one can again apply Lemma 12 so that (see eq. (40) with , , and therein):
[TABLE]
The above map hence belongs to as soon as
[TABLE]
which readily follows from the very definition of (see eq. (12)) and the fact that . We then obtain
[TABLE]
Remark 14*.*
Note that assuming that is fixed, we readily obtain from (60) together with the constraint the initial constraint
[TABLE]
In comparison with the threshold obtained when investigating the smoothing effect of the Green kernel (see eq. (52) and the related discussion) this additional regularity allows to handle the product , in the sense that it allows to give a meaning to this product as a distribution. Indeed, as suggested by Lemma 12 (replacing therein the heat kernel by a smooth test function), a sufficient condition to define the product is to obtain estimate on the Hölder modulus of the map of order for some (see (39)). The threshold (62) precisely reflects that constraint and appears as the price to pay to define such a modulus.
Let us eventually estimate the Hölder moduli of the gradients of the first and second terms in the Duhamel representation (53). We first note that, for the Green kernel, the proof follows from the above lines. When doing so, we obtain that
[TABLE]
Concerning the terminal condition, we have on the one hand, when (off-diagonal regime), that:
[TABLE]
recalling that is a density for the first inequality. On the other hand, when (diagonal regime), we have using cancellations arguments
[TABLE]
Hence
[TABLE]
Putting together estimates (55), (56), (59), (61), (63) and (64) we deduce that
[TABLE]
In particular, when , when tends to [math].
(iii) Smoothness in time for and . We restart here from the Duhamel representation (53). Namely,
[TABLE]
where from (54), the remainder term writes:
[TABLE]
We now want to control for a fixed and the difference:
[TABLE]
For the first term in the r.h.s. of (66) we write:
[TABLE]
From the Fubini’s theorem and usual cancellation arguments we get:
[TABLE]
We indeed recall that, because of the symmetry of the driving process , and since , one has for all , . Recalling as well that we assumed , we therefore derive from Lemma 11:
[TABLE]
recalling from (12) that for the last inequality. Observe now that since , one has for all . Hence,
[TABLE]
which is the expected control. We now focus on the remainder term since the control of the Green kernel is easier and can be derived following the same lines of reasoning. Write
[TABLE]
From Lemma 12 (see eq. (39) with and ) it can be deduced (see computations in point (i) of the current section) that
[TABLE]
Let us now focus on
[TABLE]
We have
[TABLE]
Applying Lemma 12 (see eq. (39) with and therein), we get:
[TABLE]
Thus, from (3.3) (recall from (12) that ):
[TABLE]
Therefore, from (72) and (71), we derive:
[TABLE]
which in turn, plugged into (LABEL:DEV_SENSI_EN_TEMPS), gives:
[TABLE]
From (73), (69) and (68) we thus obtain:
[TABLE]
The Hölder control of the Green kernel follows from similar arguments. Indeed, repeating the above proof it is plain to check that there exists s.t. for all , :
[TABLE]
The final control of (24) concerning the smoothness in time then follows plugging (67), (74) and (75) into (66). The control concerning the time sensitivity of the spatial gradient would be obtained following the same lines.
3.4. Proofs of Theorem 2, Proposition 8, Proposition 9, Corollary 10 and Lemma 1
Points (i) to (iii) conclude the proof of Proposition 8 up to the convergence assertion. Let us notice that the previous analysis allows to obtain Proposition 9 up to this assertion as well. Indeed, in such a case, the map is the coordinate of and should thus be estimated in term of its norm. The associate control can be obtained following exactly the same strategy as the one we used to handle the remainder term in the Duhamel representation (53) except that we do not need to deal with the additional gradient of the solution. The fact that the constant therein are decreasing w.r.t. time follows from the fact that , see (3.3).
Eventually, end of the proof of Proposition 8 and the proof of Theorem 2 follows from compactness arguments together with the Schauder like control of Propositions 8 and the previous analysis. Uniqueness follows from the Schauder like control of Propositions 8 as well, as the underlying PDE is linear. The representation (14) holds through similar computations. Corollary 10 is derived in the same way through Proposition 9, whose proof is concluded following the same lines as for Proposition 8.
Lemma 1 follows from the above calculations as well. The fact that the product makes sense is an easy consequence of Lemma 12 (see estimate (40) with the heat kernel therein replaced by a smooth test function) together with the regularity of .
Remark 15* (About additional diffusion coefficients).*
Let us first explain how, in the diffusive setting, the diffusion coefficient can be handled. Namely, this would lead to consider for the PDE with mollified coefficients an additional term in the Duhamel formulation that would write:
[TABLE]
for an auxiliary parameter which will be taken equal to after potential differentiations in (76). Here, P_{{\color[rgb]{0,0,0}s-t}}^{\alpha,\xi,m} denotes the two-parameter semi-group associated with \big{(}\frac{1}{2}{\rm Tr}\big{(}a_{m}(v,\xi)D^{2}\big{)}\big{)}_{v\in[s,t]} (mollified diffusion coefficient frozen at point ). Let us focus on the second order term. Recall from the above proof of Proposition 8 that we aim at estimating the gradient pointwise, deriving as well some Hölder continuity for it. Hence, focusing on the additional term, we write for the gradient part:
[TABLE]
From the previous Proposition 8, we aim at establishing that has Hölder index and therefore . Assume for a while that . The goal is now to bound the above term through Besov duality. Namely, taking after having taken the gradient w.r.t. for the heat kernel, we get:
[TABLE]
Now, in the considered case . Recalling that for any for and small enough, we will indeed have that D_{x}p_{\alpha}^{\xi,m}(t,s,x,\cdot)\big{(}(a_{m,i,j}(s,\cdot)-a_{m,i,j}(s,\xi))\in{\mathbb{B}}_{1,1}^{2+\varepsilon-\theta} provided the bounded function itself has the same regularity, i.e. , the integrability of the product deriving from the one of the heat kernel. Since \|{\color[rgb]{0,0,0}\partial_{i,j}^{2}}u_{m}(s,\cdot)\|_{{\mathbb{B}}_{\infty,\infty}^{\theta-2-\varepsilon}}\leq C\|Du_{m}(s,\cdot)\|_{{\mathbb{B}}_{\infty,\infty}^{\theta-1-\varepsilon}}, see e.g. Triebel [Tri83], this roughly means that, the same Schauder estimate should hold with a diffusion coefficient . Similar thresholds also appear more generally in [ZZ17]. The general diffusive case for and satisfying the conditions of Theorem 3 can be handled similarly through duality arguments.
For the pure jump case, we illustrate for simplicity what happens if the diffusion coefficient is scalar. Namely, when L^{\alpha,\sigma}\varphi(x)={\rm p.v.}\int_{{\mathbb{R}}^{d}}\big{(}\varphi(x+\sigma(x)z)-\varphi(x)\big{)}\nu(dz)=-\sigma^{\alpha}(x)(-\Delta)^{\alpha/2}\varphi(x), where is a non-degenerate diffusion coefficient. Introducing L^{\alpha,\sigma,\xi}\varphi(x)={\rm p.v.}\int_{{\mathbb{R}}^{d}}\big{(}\varphi(x+\sigma(\xi)z)-\varphi(x)\big{)} , we rewrite for the Duhamel formula, similarly to (76):
[TABLE]
Focusing again on the non-local term, we write for the gradient part:
[TABLE]
Consider again the case . Since , we thus have that , where . Still by duality one has to control the norm of the term \big{(}\sigma_{m}^{\alpha}(s,y)-\sigma_{m}^{\alpha}(s,\xi)\big{)} in the Besov space . Since and , this will be the case provided the coefficient for small enough observing that 1-\gamma+\varepsilon{\color[rgb]{0,0,0}<}(\alpha-1)/2.
Note that, in comparison with the result obtained in [LZ19], the above threshold is precisely the one appearing in [LZ19] in this specific case. The general matrix case for is more involved. It requires in [LZ19] the Bony decomposition. We believe it could also be treated through the duality approach considered here but postpone this discussion to further research. In the scalar case, the analysis for general as in Theorem 3 could be performed similarly.
4. Dynamics of the formal SDE (1)
In this part, we aim at proving Theorem 4 and Corollary 5. We restrict here to the pure jump case , since the diffusive one was already considered in [DD16]. We adapt here their procedure to the current framework.
In subsection 4.1, we first recover the noise through an enlarged martingale problem (point (i) of Proposition 13 below), then recover a drift as the difference between the Martingale solution and the noise obtained before and estimate its contribution (point (ii) of Proposition 13 below). With this contribution at hand, we show that the drift decomposes as a principal part plus a remainder which has a negligible contribution (point (iii) of Proposition 13 below). Then, we recall in Subsection 4.2 how the general construction of the stochastic Young integral from [DD16] translates in our setting. Eventually, we derive in subsection 4.3 the dynamics associated with the solution of the Martingale Problem and define the class of processes to which an associated Itô’s formula holds. This last part thus conclude the proof of Theorem 4 and Corollary 5.
4.1. Shape of the drift
Proposition 13**.**
Let . For any initial point , one can find a probability measure on s.t. the canonical process satisfies the following properties:
- (i)
Under , the law of is a solution of the Martingale Problem associated with data (, and the law of corresponds to the one of a -dimensional stable process with generator .
- (ii)
For any , there exists a constant s.t. for any :
[TABLE]
- (iii)
Let ({\mathcal{F}}_{v})_{v\geq 0}:=\big{(}\sigma((X_{w},\mathcal{W}_{w})_{0\leq w\leq v})\big{)}_{v\geq 0} denote the filtration generated by the couple . For any , it holds that:
[TABLE]
*with , where is the mild solution of the Cauchy problem *(note that the dependence of on is precisely through the Cauchy problem ).
Furthermore, the following decomposition holds:
[TABLE]
Remark 16*.*
The above proposition gives a first information on the shape of the drift. Indeed, using the decomposition
[TABLE]
one can see that the infinitesimal increment of the canonical process involves a drift part (first term in the above r.h.s.) and a martingale part (second and third terms in the above r.h.s.). The main point being now that
[TABLE]
meaning that only the first term matters i.e. the infinitesimal dynamics involves, as a drift, the mollified version of the initial one along the density of the driving noise.
Proof.
- (i)
Coming back to point (i) in Section 2.2 we have that the couple \big{(}(X_{t}^{m},\mathcal{W}_{t}^{m})_{t\in[0,T]}\big{)}_{m\geq 0} is tight (pay attention that the stable noise feels the mollifying procedure as it is obtained through solvability of the Martingale Problem) so that it converges, along a subsequence, to the couple .
(ii)
Let . Let where each , in is chosen as the solution the Cauchy problem where is the coordinate of . We have
[TABLE]
Let us now notice that
[TABLE]
so that, from Itô’s formula
[TABLE]
From the smoothness properties of established in Proposition 8 (in particular and the gradient is uniformly bounded) we have
[TABLE]
recalling that for all in , so that , and using estimate (24). Note that \big{(}\mathcal{M}_{S}^{m}(v,s)\big{)}_{0\leq v<s\leq T} and \big{(}\mathcal{M}_{L}^{m}(v,s)\big{)}_{0\leq v<s\leq T} are respectively and martingales associated respectively with the “small” and “large” jumps. Let us first handle the “large” jumps. We have by the Burkholder-Davies-Gundy (BDG) inequality that
[TABLE]
where denotes the corresponding bracket given by the expression . Using the linear growth of w.r.t. its third variable (uniformly w.r.t. the second one) from (81) together with the fact that we obtain
[TABLE]
We then readily get from the compensation formula that
[TABLE]
We now deal with the “small” jumps and split them w.r.t. their characteristic scale writing
[TABLE]
In the off-diagonal regime (namely for ), we do not face any integrability problem w.r.t. the Lévy measure. The main idea consists then in using first the BDG inequality, then the compensation formula and (81), and eventually usual convexity arguments together with the compensation formula again to obtain
[TABLE]
In the diagonal regime (i.e. for ) we use the BDG inequality and (81) to recover integrability w.r.t. the Lévy measure and then use the additional integrability to obtain better estimate. Namely:
[TABLE]
Using the above estimates on the -moments of , and the statement follows passing to the limit in , thanks to Proposition 8.
(iii)
Letting ({\mathcal{F}}_{v}^{m})_{v\geq 0}:=\big{(}\sigma((X_{w}^{m},\mathcal{W}_{w}^{m})_{0\leq w\leq v})\big{)}_{v\geq 0}, restarting from ((ii)) and taking the conditional expectation w.r.t. yields
[TABLE]
Passing to the (weak) limit in , it can be deduced that from Proposition 8 that
[TABLE]
where is the mild solution of . From the mild definition of in Theorem 2 we obtain that for all :
[TABLE]
integrating by parts to derive the last inequality. We thus get:
[TABLE]
where we have again plugged the mild formulation of from (14). Let us first prove that the first term in the above has the right order. Thanks to Lemma 12 (with and therein) we obtain that:
[TABLE]
Let us now prove that the second term in the r.h.s. of (82) is a negligible perturbation. Setting
[TABLE]
we write:
[TABLE]
We thus have the following estimate:
[TABLE]
Let us now consider the thermic part of , which can be split again into a lower and an upper part, as in (3.2). We first deal with the upper part and then with the lower one. With the same previous notations333Pay attention that, in order to absorb some singularities we cannot here directly appeal to Lemma 12 but simply exploit some {\color[rgb]{0,0,0}{\mathbb{L}}}^{\infty} of D{\color[rgb]{0,0,0}\mathfrak{r}}(t,\cdot) in terms of .:
[TABLE]
using (59) and (46) for the last inequality. Hence,
[TABLE]
Observe that, for this term to be a remainder on small time intervals, we need:
[TABLE]
Recalling the definition of in (12), we obtain the condition:
[TABLE]
This stronger condition appears only in the case where one is interested in expliciting exactly the dynamics in terms of a drift which actually writes as the mollified version of the initial one along the density of the driving noise (regularizing kernel). Note that if one chooses to work in a bounded setting, i.e. for , (87) again corresponds to the condition appearing in Theorem 3.
Let us now deal with the lower part of the thermic characterization. Using a cancellation argument, restarting from (61) and (28), exploiting as well (3.3) and (48), we get for :
[TABLE]
recalling also (59) for the last inequality and denoting by the homogeneous Besov norm (Hölder modulus of order ). Hence:
[TABLE]
which precisely gives a contribution homogeneous to the one of (86).We eventually derive that, under the condition (87), the remainder in (84) is s.t. there exists for which
[TABLE]
∎
4.2. The non-linear stochastic Young integral
Having derived the shape of the drift, let us try to sum up how such a construction can be adapted in our setting. As in Section 4.4.1 of [DD16], we introduce in a generic way the process as for any , or (ii)A(t,{\color[rgb]{0,0,0}t+h})=\mathcal{W}_{t+h}-\mathcal{W}_{t} or (iii)A(t,{\color[rgb]{0,0,0}t+h})=\mathfrak{f}(t,X_{t},h). We then claim that the following estimates hold: there exists , such that for any there exists a constant such that
[TABLE]
Then, we aim at defining for any the stochastic integral , for processes in (see (18) for the definition) with such that , for any {\color[rgb]{0,0,0}1\leq}\ell<\alpha and , as an limit of the associated Riemann sum: for
[TABLE]
which justifies the fact that such an integral is called stochastic-Young integral by the Authors. To do so, the main idea in [DD16] consists in splitting the process as the sum of a drift and a martingale:
[TABLE]
and define -stochastic-Young integral w.r.t. each of these terms. We then have
Theorem 14** (Theorem 16 of [DD16]).**
There exists such that, given two subdivisions of , such that ,
[TABLE]
where denotes the step size of the subdivision and with \eta=\min\{{\color[rgb]{0,0,0}\varepsilon_{0}}-\varepsilon_{2},\varepsilon_{1},\varepsilon_{1}^{\prime}\}.
Proof.
The main point consists in noticing that the proof in [DD16] remains valid in our setting (for parameter therein) and that the only difference is the possible presence of jumps. To handle that, the key idea is then to split the martingale part (which in our current framework may involve jumps) into two parts: an -martingale (which includes the compensated small jumps) and an {\mathbb{L}}^{{\color[rgb]{0,0,0}\ell}}-martingale (which includes the compensated large jumps). The first part can be handled using the BDG inequality (and this is what is done in [DD16]) and the other part by using the compensation formula (such a strategy is somehow classical in the pure-jump setting and has been implemented to prove point (ii) in Proposition 13 above). ∎
Thus, we obtain that for any fixed in we are able to define an additive (on ) integral . The main point consists now in giving a meaning on this quantity as a process (i.e. that all the time integrals can be defined simultaneously). In the current pure-jump setting, we rely on the Aldous criterion, whereas in the diffusive framework of [DD16], the Kolmogorov continuity criterion was used. Thanks to Theorem 14, one has
[TABLE]
so that one can apply Proposition 34.9 in Bass [Bas11] and Proposition 4.8.2 in Kolokoltsov [Kol11] to the sequence \big{(}\int_{0}^{t}\psi_{s}A(s,s+ds)\big{)}_{s\leq t} and deduce that the limit is stochastically continuous.
4.3. Building the dynamics: Proofs of Theorem 4 and Corollary 5
We here follow Section 4.6 of [DD16]. Let us first emphasize that (4.2) hold in the case and mentioned above from Proposition 13 and Theorem 2 (the two last estimates are equals to [math], since the process is additive).
we can thus define the process \big{(}\int_{0}^{t}\psi_{s}dX_{s}\big{)}_{0\leq t\leq T} for any progressively measurable in (see (18)), , with . Setting
[TABLE]
the construction of the stochastic Young integral sketched above (see as well subsections 4.4 and 4.5 of [DD16]) allows to define as well the processes \big{(}\int_{0}^{t}\psi_{s}R(s,s+ds)\big{)}_{0\leq t\leq T} and \big{(}\int_{0}^{t}\psi_{s}M(s,s+ds)\big{)}_{0\leq t\leq T} and the following relation holds
[TABLE]
Thanks to Proposition 13 we have that, actually \big{(}\int_{0}^{t}\psi_{s}R(s,s+ds)\big{)}_{0\leq t\leq T}=\big{(}\int_{0}^{t}\psi_{s}\mathfrak{f}(s,X_{s},ds)\big{)}_{0\leq t\leq T}, so that the r.h.s. is well defined. Also, we have that \big{(}\int_{0}^{t}\psi_{s}(R(s,s+ds)-\mathscr{F}(s,X_{s},ds))\big{)}_{0\leq t\leq T}=\big{(}\int_{0}^{t}\psi_{s}\mathscr{R}(s,X_{s},ds)\big{)}_{0\leq t\leq T} is well defined and is null since the bound appearing in the increment of the l.h.s. is greater than one. Hence,
[TABLE]
On the other hand, we have that \big{(}\int_{0}^{t}\psi_{s}M(s,s+ds)\big{)}_{0\leq t\leq T} is well defined as well and that \big{(}\int_{0}^{t}\psi_{s}M(s,s+ds)-d\mathcal{W}_{t}\big{)}_{0\leq t\leq T}=\big{(}\int_{0}^{t}\psi_{s}\hat{M}(s,s+ds)\big{)}_{0\leq t\leq T} where
[TABLE]
is an martingale with moment bounded by so that it is null as well, meaning that when reconstructing the drift as above, we indeed get that only the “original” noise part in the dynamics matters. In other words, for any in , with ,
[TABLE]
5. Weak formulation and further properties of the drift
5.1. Weak solutions
In this part, we mainly prove Theorem 6. Note first that the existence of a weak solution is a consequence of Theorem 4. It thus only remain to prove weak uniqueness for any to prove Theorem 6-(i) (see the corresponding point below) and pathwise uniqueness for to prove Theorem 6-(ii) (see the corresponding point below as well). In any cases, we will need to expand a weak solution along the sequence of classical solution of the Cauchy problem , where is a smooth approximation of in the sense of Remark 3 and for some smooth functions through Itô’s formula. We are therefore led to check whenever the stochastic integrals \big{(}\int_{0}^{t}Du_{m}(s,Y_{s})dY_{s}\big{)}_{0\leq t\leq T} can be defined as an -stochastic Young integral in the sense of Definition 3. This is the purpose of the next two lemmas. In the first one, we prove that one may define a stochastic calculus w.r.t. the weak solution (i.e. w.r.t. quantities in (20)), proving thus the last assertion of Theorem 6. In the second one, we prove that one can expand the solution of the mollified PDE from Proposition 8 along the weak solution through Itô’s formula.
Lemma 15**.**
Assume that the parameters and satisfy a good relation for the dynamics (GR-D) Then, the processes \big{(}\int_{0}^{t}\psi_{s}dY_{s}\big{)}_{0\leq t\leq T} and \big{(}\int_{0}^{t}\psi_{s}\mathscr{F}(s,Y_{s},ds)\big{)}_{0\leq t\leq T} are well defined for any progressively measurable process in for all and \mathfrak{q}^{\prime}\in\big{(}[\alpha/(\alpha-1)],\infty\big{]}.
Proof.
Starting from the very definition (see Definition 4) of a weak solution , we readily get from (83) that
[TABLE]
From the construction in Subsection 4.2, this implies in turn that we can define the process \big{(}\int_{0}^{t}\psi_{s}dY_{s}\big{)}_{0\leq t\leq T} and thus the process \big{(}\int_{0}^{t}\psi_{s}\mathscr{F}(s,Y_{s},ds)\big{)}_{0\leq t\leq T} by the methodology of Subsection 4.3, for any progressively measurable process in for any and such that with . ∎
Lemma 16**.**
Assume that the parameters and satisfy a good relation for the dynamics (GR-D) and let be a weak solutions of (1) in the sense of Definition 4. Then,
[TABLE]
where denotes the solution of the Cauchy problem with a smooth approximation of in the sense of Remark 3 and are smooth functions, are well defined as -stochastic Young integral in the sense of Definition 3.
Proof.
Thanks to the previous lemma it remains to check that there exists in \big{(}0,[(\theta-1)/\alpha]\big{)} and in \big{(}[\alpha/(\alpha-1)],\infty\big{]} such that belongs to . From (96), we deduce
[TABLE]
The point is now to notice that, from Proposition 8, we have
[TABLE]
for any . Set now where we recall that \varepsilon_{2}\in\big{(}0,[(\theta-1)/\alpha]\big{)}. Thus, if
[TABLE]
we can use (97) together with previous estimate to obtain
[TABLE]
from which we deduce, thanks to Lemma 15, that both processes
[TABLE]
are well defined. It thus now remains to prove and to conclude the proof.
From the very definition of , rewrites . Hence, a sufficient condition for to hold is to prove that there exists \tilde{\rho}\in\big{(}0,2[(\theta-1)/\alpha]\big{)} such that (and thus to choose ) . Notice now from the very definition of in (12) and the good relation for the dynamics (GR-D) that we have . Hence, for any choice of the parameters satisfying (GR-D), one can find such a . More precisely, for any choice of the parameters satisfying (GR-D), there exists such that implies that holds. This allows to choose . It remains to check wether such a choice allows to obtain \mathfrak{q}^{\prime}\in\big{(}[\alpha/(\alpha-1)],\infty\big{]} so that holds. Choose and let , then, holds. Indeed, we have and . As we obtain , which concludes the claim. ∎
(i) Weak uniqueness in any dimension: proof of point (i) of Theorem 6. Having this result at hand, one can now expand any weak solution of the formal SDE (1) along the solution of the Cauchy problem through Itô’s formula for any smooth to obtain for any in ,
[TABLE]
where M_{{\color[rgb]{0,0,0}0},t}(\alpha,u_{m},Y), defined by (25) (up to the substitution of by therein), is a true martingale thanks to Proposition 8. From Proposition 7 and again Proposition 8 one can pass to the (weak) limit in (99) to get that for any in ,
[TABLE]
where is the solution of the Cauchy problem \mathscr{C}({\color[rgb]{0,0,0}F},L^{\alpha},g,f,T) with with , where is given by (12) and M_{{\color[rgb]{0,0,0}0},t}(\alpha,u,Y) is again a true martingale.
Taking now the expectation, one gets that for any in ,
[TABLE]
Choosing and we obtain that the left hand side does not depend on the specific choice of , so that uniqueness in law follows. This concludes the proof of point (i) in Theorem 6.∎
Remark 17*.*
Observe that the right hand side of (100) is a -martingale and one can use usual arguments to build a probability measure on the associated canonical space from and for which finite dimensional marginal coincide. Hence, the Martingale formulation, in the sense of Definition 2, holds. In other words, existence of a weak solution implies the existence of a Martingale solution. Also, as a consequence of the previous arguments, it is plain to check that uniqueness of the Martingale solution implies weak uniqueness.
(ii) Pathwise uniqueness in dimension one: proof of point (ii) of Theorem 6. The aim of this part is to prove Theorem 6-(ii), adapting to this end the proof of Proposition 2.9 in [ABM20] to our current inhomogeneous and parabolic (for the auxiliary PDE concerned) framework. Let us consider and two weak solutions of the formal SDE (1) in the sense of Definition 4. Let also be the solution of the Cauchy problem . Thanks to Lemma 16, one can apply Itô’s formula on , to obtain for any in the two corresponding Itô-Zvonkin transforms
[TABLE]
where is as in (25) with instead of therein and .
We point out that we here use the mollified PDE, keeping therefore the remainder term and dependence in for the martingale part. Of course, we will have to control the remainders, which is precisely possible from Proposition 7. From now on, we assume that . The case is indeed easier and can be handled following the arguments below.
As a starting point, we now expand, with
[TABLE]
the smooth approximation of . For fixed , thanks to Lemma 16, we can apply Itô’s formula to obtain:
[TABLE]
recalling that , using the definition of and denoting for all ({\color[rgb]{0,0,0}s},x_{1},x_{2},r)\in{\color[rgb]{0,0,0}[0,t]\times}{\mathbb{R}}^{3}:
[TABLE]
The point is now to take the expectations in (101). Since is a martingale, we then readily get . On the other hand, since , we also have from Proposition 7 that:
[TABLE]
It now remains to handle the compensator terms. For the large jumps, we readily write:
[TABLE]
observing that |h_{m}({\color[rgb]{0,0,0}s},x_{1},x_{2},r)|\leq 2\|Du_{m}\|_{{\mathbb{L}}^{\infty}({\mathbb{L}}^{\infty})}|x_{1}-x_{2}|. Also, from Proposition 9, uniformly in (as the terminal condition of the PDE is 0). In particular, for small enough one has and
[TABLE]
Hence,
[TABLE]
Therefore, if , we have for any either that X_{s}^{Z,m,1}-X_{s}^{Z,m,2}+h_{m}({\color[rgb]{0,0,0}s},X_{s}^{1},X_{s}^{2},r)\geq 1/n if {\color[rgb]{0,0,0}X}_{s}^{Z,m,1}-X_{s}^{Z,m,2}\geq 3/n, or X_{s}^{Z,m,1}-X_{s}^{Z,m,2}+h_{m}({\color[rgb]{0,0,0}s},X_{s}^{1},X_{s}^{2},r)\leq-1/n if {\color[rgb]{0,0,0}X}_{s}^{Z,m,1}-X_{s}^{Z,m,2}\leq-3/n. It is thus readily seen that \psi_{n}(X_{s}^{Z,m,1}-X_{s}^{Z,m,2},h_{m}({\color[rgb]{0,0,0}s},X_{s}^{1},X_{s}^{2},r))=0. We thus have:
[TABLE]
using for the last inequality the definition of which gives that there exists s.t. for all , . We now use the definition of and the smoothness of in order to balance the explosive contribution in and to keep an exponent of which allows to integrate the small jumps. From (102) and usual interpolation techniques (see e.g. Lemma 5.5 in [ABM20] or Lemma 4.1 in [Pri12]) we get:
[TABLE]
. The point is now to apply the above identity with large enough in order to get rid of the explosive term in (LABEL:BD_EXPLO) (i.e. {\color[rgb]{0,0,0}\eta_{1}}>1/2) and with sufficiently large in order to guarantee the integrability of the Lévy measure (i.e. {\color[rgb]{0,0,0}\eta_{2}>\alpha/2}). From the very definition of , see (12), the constraint is satisfied as soon as , which is precisely the condition ensured when the parameters satisfy a good relation for the dynamics, see (GR-D). In such a case, for any choice of the parameters , one can find such that , and .
Hence,
[TABLE]
using (105) and the definition of for the last but one inequality. Plugging (108), (104) into (101) (taking therein the expectations) and recalling that , eventually yields:
[TABLE]
Passing to the limit, first in recalling that {\mathbb{E}}[|{\color[rgb]{0,0,0}\Delta\mathcal{R}}_{0,t}^{m,n}|]\underset{m}{\rightarrow}0 uniformly in , gives (from the smoothness properties of in Proposition 8, see also point (ii) in Section 2.2):
[TABLE]
Take now the limit in and write from (105) (which also holds replacing by ):
[TABLE]
which readily gives from the Gronwall Lemma This concludes the proof for small enough. One may then iterate the argument on small time intervals to extend the result for any arbitrary and then on the whole positive real line.∎
5.2. Further properties of the drift
We here give the proof of Proposition 7. For both the martingale and weak solution, proof of point (i) is obvious from the definition of the dynamics. Point (ii) follows from applying Itô’s formula on the the sequence of classical solution of the Cauchy problem , where is a smooth approximation of in the sense of Remark 3, along the Martingale or the weak solutions, which is licit from Lemma 16 (note that this Lemma only use the very definition of the dynamics, so that it holds for the Martingale solution as well). We can then conclude this point by passing to the limit in together with Proposition 9. We now prove points (iii) and (iv).
Let satisfying
[TABLE]
We aim at proving that, for either the Martingale or the weak solutions one has for all in ,
[TABLE]
with in the case of a weak solution (while being not recalling, this last fact will be implicitly assumed in the following). We want to investigate:
[TABLE]
for any , \varepsilon_{2}\in\big{(}0,[(\theta-1)/\alpha]\big{)}, \mathfrak{q}^{\prime}\in\big{(}[\alpha/(\alpha-1)],\infty\big{]}. Coming back to the definition of such integrals, this means that we want to control
[TABLE]
We have the following decomposition:
[TABLE]
with the previous notations. Note that , uniformly w.r.t. and that from estimate (83) and by construction (see Subsection 4.2) for each , tends to some in as . One can hence swap both limits and therefore deduce that
[TABLE]
For the second term, we note that, using Minkowski’s and then Hölder’s inequalities due to the regularity of (using e.g. its norm, for some ) that
[TABLE]
so that . This proves (iii). To prove (iv), it suffices to notice that in such a case, the term defined above is 0.∎
Appendix A Proof of Lemma 12
We start with the proof of estimate (39). Having in mind the thermic characterization of the Besov norm (3.1), the main point consists in establishing suitable controls on the thermic part of (3.1) (i.e. the second term in the r.h.s. therein) viewed as the map
[TABLE]
Splitting the interval in function of the current time increment (meant to be small) considering (low and high cut-off), we write:
[TABLE]
For the high cut-off, the singularity induced by the differentiation of the heat kernel in the thermic part is always integrable. Hence using convolution inequalities we have
[TABLE]
From (11) and similarly to (46), we have
[TABLE]
We thus obtain
[TABLE]
To deal with the low cut-off of the thermic part, we need to smoothen the singularity induced by the differentiation of the heat kernel of the thermic characterization. Coming back to the very definition (111) of this term, we note that
[TABLE]
To smoothen the singularity, one then needs to establish a suitable control on the Hölder moduli of the product . We claim that for all in , for all in ({\mathbb{R}}^{{\color[rgb]{0,0,0}d}})^{2}:
[TABLE]
This readily gives, using convolution estimates and (46), that
[TABLE]
Putting together estimates (112) and (A) into (111) yields the estimate (39) in Lemma 12.
Remark 18* (On the control of the first term in the r.h.s. (3.1)).*
This term is easily handled by the norm of the product and hence on norm of times the norm of . This, in view of (46), clearly brings only a negligible contribution in comparison with the one of the thermic part.
To conclude with (39), it remains to prove (A). From (28) (see again the proof of Lemma 4.3 in [HMP19] for details), we claim that there exists s.t. for all and all ,
[TABLE]
Indeed, (116) is direct if (off-diagonal regime). It suffices to exploit the bound (28) for and and to observe that \big{(}|z-y|/(s-t)^{1/\alpha}\big{)}^{\beta^{\prime}}\geq 1. If now (diagonal regime), it suffices to observe from (33) that, with the notations of the proof of Lemma 11 (see in particular (32)), for all :
[TABLE]
Therefore, in the diagonal case (116) follows from (117) and (32) writing |\mathscr{D}^{\eta}p_{\alpha}(s-t,z-x)-\mathscr{D}^{\eta}p_{\alpha}(s-t,y-x)|\leq\int_{0}^{1}d\lambda|\mathscr{D}^{\eta}Dp_{\alpha}(s-t,y-x+\lambda(y-z))\cdot(y-z)|\leq 2C_{m}(s-t)^{-[({\color[rgb]{0,0,0}\eta+1})/\alpha]}q_{\alpha}(s-t,y-x)|z-y|\leq\tilde{C}_{m}(s-t)^{-[({\color[rgb]{0,0,0}\eta+\beta^{\prime}})/\alpha]}q_{\alpha}(s-t,y-x)|z-y|^{\beta^{\prime}} for all (exploiting again that for the last inequality). We conclude the proof of (A) noticing that for all in the map is -Hölder continuous and choosing in the above estimate.
We now prove (40). Splitting again the thermic part of the Besov norm into two parts (high and low cut-off) we write
[TABLE]
Proceeding as we did before for the high cut-off and using (116), we have for any in :
[TABLE]
To deal with the low cut-off, we proceed as we did for (A) in order to smoothen the singularity induced by the differentiation of the thermic kernel. We are hence led to control the Hölder moduli of \Psi(s,\cdot)\Big{(}\mathscr{D}^{\eta}{p}_{\alpha}(s-t,\cdot-x)-\mathscr{D}^{\eta}{p}_{\alpha}(s-t,\cdot-x^{\prime})\Big{)}. We claim that for any in and all in , we have that for all in :
[TABLE]
Repeating the computations in (A) and using the above estimate, we obtain that:
[TABLE]
provided
[TABLE]
It thus remains to prove (118). It directly follows from (116) that:
[TABLE]
Setting:
[TABLE]
it now remains to control this term. Precisely,
If , we write:
[TABLE]
- -
If , we write symmetrically:
[TABLE]
- -
If and , we get:
[TABLE]
proceeding as in (117) and exploiting (32) for the last identity. Plugging (- ‣ A), (- ‣ A) and (- ‣ A) into (A) eventually yields the control (118).
∎
Acknowledgements
This work has been funded by the Russian Science Foundation project (project N*∘* 20-11-20119).
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[ABLM 21] S. Athreya, O. Butkovsky, K. Lê, and L. Mytnik. Well-posedness of stochastic heat equation with distributional drift and skew stochastic heat equation. ar Xi V:2011.13498 , 2021.
- 2[ABM 20] S. Athreya, O. Butkovsky, and L. Mytnik. Strong existence and uniqueness for stable stochastic differential equations with distributional drift. Ann. Probab. , 48–1:178–210, 2020.
- 3[AH 96] D. R. Adams and L. I. Hedberg. Function spaces and Potential Theory . Springer, 1996.
- 4[Bas 11] R. F. Bass. Stochastic processes , volume 33. Cambridge series in statistical and probabilistic mathematics, 2011.
- 5[BC 01] R. F. Bass and Z. Q. Chen. Stochastic differential equations for Dirichlet processes. Probability Theory and Related Fields , 121(3):422–446, November 2001.
- 6[BC 03] R. F. Bass and Z.-Q. Chen. Brownian motion with singular drift. Ann. Probab. , 31–2:791–817, 2003.
- 7[BP 15] V. I. Bogachev and A. Y. Pilipenko. Strong solutions to stochastic equations with Lévy noise and a discontinuous drift coefficient. Doklady Mathematics , 92:471–475, 2015.
- 8[CC 18] G. Cannizzaro and K. Chouk. Multidimensional SD Es with singular drift and universal construction of the polymer measure with white noise potential. Ann. Probab. , 46(3):1710–1763, 05 2018.
