Backward It{\^o}-Ventzell and stochastic interpolation formulae
Pierre del Moral (ASTRAL), Sumeetpal Sidhu Singh

TL;DR
This paper introduces a new backward Itô-Ventzell formula and extends stochastic interpolation formulas to stochastic flows, providing spectral conditions for uniform estimates and applications in diffusion perturbation and approximation theories.
Contribution
The paper presents a novel backward Itô-Ventzell formula and extends stochastic interpolation formulas to stochastic flows, with spectral conditions for uniform flow difference estimates.
Findings
New backward Itô-Ventzell formula introduced
Spectral conditions enable simple proofs of flow estimates
Applications demonstrated in diffusion perturbation and approximation
Abstract
We present a novel backward It{\^o}-Ventzell formula and an extension of the Aleeksev-Gr\"obner interpolating formula to stochastic flows. We also present some natural spectral conditions that yield direct and simple proofs of time uniform estimates of the difference between the two stochastic flows when their drift and diffusion functions are not the same, yielding what seems to be the first results of this type for this class of anticipative models. We illustrate the impact of these results in the context of diffusion perturbation theory, interacting diffusions and discrete time approximations
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Backward Itô-Ventzell and stochastic interpolation formulae
P. Del Moral P. Del Moral was supported in part from the Chair Stress Test, RISK Management and Financial Steering, led by the French Ecole polytechnique and its Foundation and sponsored by BNP Paribas, and by the ANR Quamprocs on quantitative analysis of metastable processes.
Authors declaration of interests: none INRIA, Bordeaux Research Center & CMAP, Polytechnique Palaiseau, France
S. S. Singh
Department of Engineering, University of Cambridge, United Kingdom.
Abstract
We present a novel backward Itô-Ventzell formula and an extension of the Alekseev-Gröbner interpolating formula to stochastic flows. We also present some natural spectral conditions that yield direct and simple proofs of time uniform estimates of the difference between the two stochastic flows when their drift and diffusion functions are not the same, yielding what seems to be the first results of this type for this class of anticipative models. We illustrate the impact of these results in the context of diffusion perturbation theory, interacting diffusions and discrete time approximations.
Keywords : Stochastic flows, variational equations, tangent and Hessian processes, perturbation semigroups, backward Itô-Ventzell formula, Alekseev-Gröbner lemma, Skorohod stochastic integral, two-sided stochastic integration, Malliavin differential, Bismut-Elworthy-Li formulae.
Mathematics Subject Classification : 47D07, 93E15, 60H07.
1 Introduction
Let be a vector-valued function from into and be a matrix-valued function from into , for some parameters . Both functions will be assumed to be differentiable. Let be an -dimensional Brownian motion and denote by the -field generated by the increments of the Brownian motion, with .
For any time horizon we denote by the stochastic flow defined for any and any starting point by the stochastic differential equation
[TABLE]
We assume that and have continuous and uniformly bounded derivatives up to the third order. This condition is clearly met for linear Gaussian models as well as for the geometric Brownian motion. This condition ensures that the stochastic flow is a twice differentiable function of the initialisation . In addition, all absolute moments of the flow and the ones of its first and second order derivatives exists for any time horizon. As it is well known, dynamical systems and hence stochastic models involving drift functions with quadratic growth require additional regularity conditions to ensure non explosion of the solution in finite time. It is also implicitly assumed that all functions are smooth functions w.r.t. the time parameter. The present article develop several constructive and stochastic analysis tools including Bismut-Elworthy-Li formulae, stochastic semigroup perturbation formulae, extended two-sided stochastic integration, Malliavin calculus, gradient and Hessian semigroup processes estimates. We are also looking for useful quantitative and time uniform estimates which are valid under a single set of easily checked conditions that only depend on the parameters of the model. Various techniques presented in the article and many results can be separately and readily extended to more general models with weaker and abstract custom assumptions that depend on the different quantities to handle.
Let be the stochastic flow associated with a stochastic differential equation defined as (1.1) by replacing by some drift and diffusion functions with the same regularity properties. Constant diffusion functions are defined by
[TABLE]
In this context, we will assume that and are uniformly bounded w.r.t. the time horizon.
The Markov transition semigroups associated with the flows and are defined for any measurable function on by the formula
[TABLE]
In this paper we derive equations for the differences and in terms of the difference of their corresponding drifts and diffusion functions,
[TABLE]
where and . In some applications the functions and can be interpreted as a local perturbation of the drift and the diffusion of the stochastic flow .
We also address the problem of finding time-uniform estimates for the difference between the stochastic flows and and their corresponding Markov transition kernels and .
These important questions arise in a variety of domains including stochastic perturbation theory as well as in the stability and the qualitative theory of stochastic systems. Classical analytic estimates on the difference between the stochastic flows driven by different drift and diffusion functions are often much too large for most diffusion processes of practical interest. In some instances none of the diffusion flows are stable. In this context, any local perturbation of the stochastic model propagates so that any global error estimate eventually tends to as the time horizon .
Whenever one of the stochastic flows is stable, classical perturbation bounds combining Lipschitz type inequalities with Gronwall lemma [8, 25] yield exceedingly pessimistic global estimates that grows exponentially fast w.r.t. the time horizon. Notice that an exponential type estimate of the form for some parameter and some time horizon s.t. would induce an error bound larger than the estimated number of elementary particles of matters in the visible universe. As mentioned in [29] in the context of Euler scheme type approximations of deterministic dynamical systems, one may encounter situations where and and the resulting exponential bounds are clearly impractical from a numerical perspective.
The statement of the main results of the article are presented in section 1.1:
- i.
Section 1.1.1 presents a novel generalized backward Itô-Ventzell formula (cf. theorem 1.1). The Itô-Ventzell is a very important formula, arguably as useful as the Itô’s change of variable, but surprisingly the backward Itô-Ventzell presented in this work has never been studied before. Theorem 1.1 can be seen as a new generalized backward version of the generalized Itô-Ventzell formula presented in [41]. 2. ii.
In section 1.1.2 we apply the backward Itô-Ventzell formula to derive a forward-backward stochastic perturbation formula that expresses the difference between the stochastic flows and in terms of first and second order derivatives of the flows, which we call the tangent and Hessian processes respectively, with respect to the space parameter (cf. theorem 1.2). 3. iii.
Section 1.1.2 also provides a novel forward-backward Itô type differential formula for interpolating stochastic diffusion flows (cf. the change of variable formula (1.9)). 4. iv.
In the beginning of section 1.1.2 we present a discrete time approach based on the pivotal interpolating telescoping sum formula (4.2). This interpolating stochastic semigroup technique can be seen as an extension to stochastic flows of the stochastic perturbation analysis developed in [22, 18, 20, 21] and in [3, 5, 11] in the context of discrete time models, matrix and nonlinear interacting processes (see also [4, 5]). For a more thorough discussion on these models, we refer to section 1.2. This approach allows to derive a stochastic interpolation formula (1.10) with a fluctuation term (1.12) defined by an extended two-sided stochastic integral. 5. v.
Section 1.1.3 presents some natural spectral conditions on the gradients of and that allows us to derive in a direct way a series of realistic uniform estimates with respect to the time horizon.
The rest of the article is organized as follows:
Section 3 provides some basic tools associated with the first and second variational equations associated with a diffusion flow. We also present some quantitative estimates of the tangent and the Hessian processes. For a more thorough discussion on stochastic flows and their differentiability properties we refer to [14, 32, 40].
Section 4 is mainly concerned with the forward-backward stochastic interpolation formula (1.10) stated in theorem 1.2. Two approaches are presented: The first one discussed in section 4.1 is based on an extension of the two-sided stochastic calculus introduced by Pardoux and Protter in [43] to stochastic interpolation flows. The second one discussed in section 4.2 is based on the generalized backward Itô-Ventzell formula. This section also discusses a multivariate Skorohod-Alekseev-Gröbner formula. Apart from more complex and sophisticated tensor notation, the quantitative stochastic analysis of these multivariate formulae follows the same arguments as the ones used in the proof of theorem 1.3. Thus, we have chosen to concentrate this introduction on stochastic flows.
Some extensions of the stochastic interpolation formula (1.10) are discussed in section 4.4.
Section 5 is dedicated to the analysis of the Skorohod fluctuation process introduced in (1.12).
Section 6 is dedicated to the analysis of an extended version of two-sided stochastic integrals and a generalized backward Itô-Ventzell formula.
Section 7 presents some illustrations of the forward-backward interpolation formulae discussed in the present article in the context of diffusion perturbation theory, interacting diffusions and discrete time approximations.
The technical proofs of some results are housed in the appendix.
1.1 Statement of some main results
1.1.1 A backward Itô-Ventzell formula
We represent the gradient of a real valued function of several variables as a column vector while the gradient and the Hessian of a (column) vector valued function as tensors of type and , see for instance (2.2) and (2.3); in more layman terms a tensor is a matrix while the tensor can be visualized as a “row of matrices” where the entries are matrices of a common dimension. We also use the tensor product and the transpose operator defined in (2.1), see also (2.4).
We denote by the Malliavin derivative from some dense domain into the space . For multivariate -column vector random variables with entries , we use the same rules as for the gradient and is the -matrix with entries . For -matrices with entries we let be the tensor with entries .
For a more thorough discussion on Malliavin derivatives and Skorohod integration we refer to section 2.3.
Let be some function from into , and let be some given state, for some . Suppose we are given a forward -dimensional continuous semi-martingale and a backward random field from into with a column-vector type canonical representation of the following form:
[TABLE]
for some -adapted functions with appropriate dimensions and satisfying the following conditions:
: The functions , and as well as , and the derivatives and are continuous w.r.t. the state and the time variables for any given .
* The function , and the derivatives have at most polynomial growth w.r.t. the state variable, uniformly with respect to .*
* The processes as well as are continuous and have moments of any order.
In this notation, the first main result of this article is the following theorem.
Theorem 1.1**.**
Assume conditions are satisfied. In this situation, for any we have the generalized backward Itô-Ventzell formula
[TABLE]
The stochastic anticipating integral in the r.h.s. of 1.5 is understood as a Skorohod stochastic integral.
The above theorem can be seen as the backward version of the generalized Itô-Ventzell formula presented in [41, 42]. The proof of the above theorem is provided in section 6.2 (see theorem 6.3).
Conventional forward and backward Itô stochastic integrals are particular instances of the two-sided stochastic integrals introduced by Pardoux and Protter in [43]. The terminology " two-sided " coined by the authors in [43] comes from the fact that the integrand of the Skorohod integral depend on the past as well as on the future of the history generated by the Brownian motion.
The stochastic anticipating integral in the r.h.s. of (1.5) involves a backward random field and a forward semimartingale, thus it is tempting to interpret this integral as a two sided integral. Unfortunately, this class of integrands are not considered in the construction of the two-sided stochastic integrals defined in [43]. In section 4.1 and section 6.1 we shall present an extended version of the two-sided stochastic integrals introduced in [43] that applies to integrands defined as a compositions of backward and forward stochastic flows. This extended version applies to backward stochastic flows but it doesn’t encapsulate more general backward random fields. We believe more general extensions of the two-sided integrals can be developed but it is out of the scope of this article to develop a theory on generalized two-sided stochastic integrals. We finally mention that all two-sided stochastic integrals discussed in this article are particular instances of Skorohod integrals
1.1.2 A stochastic flow interpolation formula
The diffusion flow (1.1) is defined in term of a column vector with twice continuously differentiable entries. For we use the backward approximation:
[TABLE]
In the above display, stands for the composition of the mappings and .
The above approximations are rigorously justified in section 4.1 and lead to the backward stochastic flow evolution equation:
[TABLE]
In the above display, represents the change in w.r.t. the variable .
In the same vein, for any we have the interpolating semigroup decompositions
[TABLE]
as well as the forward approximations
[TABLE]
The above approximations are rigorously justified in section 4.1 and lead to the forward-backward stochastic interpolation equation
[TABLE]
The discrete time version of the forward-backward stochastic formula in the above display reduces to the telescoping sum formula (4.2) and the second order Taylor expansions discussed in section 4.1. We already mention that (4.2) can be interpreted as a discrete time version of the Alekseev-Gröbner lemma [1, 24]. The terminology forward-backward comes from the forward and backward nature of (1.9) and the telescoping sum formula (4.2).
Also notice that (1.7) can also be deduced formally from (1.9) by replacing by the stochastic flow in (1.9), and then letting .
This yields the following interpolation theorem.
Theorem 1.2**.**
We have the forward-backward stochastic interpolation formula
[TABLE]
with the stochastic process
[TABLE]
and the fluctuation term given by the Skorohod stochastic integral
[TABLE]
The fluctuation term in the above display can also be seen as the extended two-sided stochastic integral defined in (4.3) (see also proposition 6.2).
These interpolation formulae combine the backward evolution (1.7) with the conventional forward evolution of the perturbed flow.
The proof of the interpolation formula (1.10) is provided in section 4.
We will present two different approaches: The first one presented in section 4.1 is rather elementary and very intuitive. It combines the conventional Itô-type discrete time approximations of stochastic integrals discussed above with the two-sided stochastic integration calculus introduced in [43]. Using this approximation technique the fluctuation term is defined by the extended two-sided stochastic integral defined in (4.3). In this interpretation, the equation (1.10) can be seen as an extended version of the Itô-type change rule formula stated in theorem 6.1 in the article [43] to the interpolating flow
[TABLE]
Roughly speaking, the increments of the interpolating path are decomposed into two parts:
One comes from the backward increments of the flow given the past values of the stochastic flow . The other one comes from the conventional Itô increments of given the future values of the stochastic flow .
The second approach discussed in section 4.2 is based on the generalized backward Itô-Ventzell formula stated in theorem 1.1. More precisely we also recover (1.10) from (1.5) by choosing
[TABLE]
and letting in (1.5). The regularity conditions on the drift and the diffusion function ensure that conditions with stated in section 1.1.1 are satisfied.
We emphasize that the backward diffusion flow discussed in (1.7) and (4.1) is essential to apply theorem 1.1. Section 4.2 also provides a multivariate version of (1.10).
The interpolation formula (1.10) with a fluctuation term given by the Skorohod stochastic integral (1.12) can be seen as a Alekseev-Gröbner formula of Skorohod type.
In this context, the integrability of the fluctuation term and any quantitative type estimates require a refined analysis of the Malliavin derivatives of the integrand. Under our regularity conditions the stochastic flows and are Holder-continuous w.r.t. the time parameters as well as twice differentiable w.r.t. the space variables, with almost sure uniformly bounded first and second order derivatives. In addition, for any all the -absolute moments of the stochastic flows are finite with at most linear growth w.r.t. the initial values. These properties ensure that the Skorohod stochastic integral (1.12) is well defined and they allow to derive several quantitative estimates. Section 5 provides a refined of the fluctuation term; see for instance theorem 5.2.
When the flow is deterministic so that the Skorohod fluctuation term (1.12) reduces to the traditional Itô stochastic integral. In this context, quantitative estimates of the fluctuation term are obtained combining Burkholder-Davis-Gundy inequalities with the generalized Minkowski inequality. The resulting interpolation formula (1.10) can be seen as a Alekseev-Gröbner formula of Itô-type.
To distinguish these two classes of models, the interpolation formulae (1.10) associated with the case will be called an Itô-Alekseev-Gröbner formula; the one associated with the case will be called a Skorohod-Alekseev-Gröbner formula.
1.1.3 Uniform estimates w.r.t. the time horizon
The final objective of this article is to derive uniform estimates w.r.t. the time parameter. Our methodology is mainly based on two different types of regularity conditions to be defined and discussed in detail in section 2.2:
The first is a technical condition that ensures that the -absolute moments of the flows and are uniformly bounded w.r.t. the time horizon; we call this condition .
The second is a spectral condition on the gradient of the drift and diffusion matrices of the stochastic flows, which we call condition . Without going into details, we state one usual case of interest: for constant diffusion functions (1.2) the spectral condition is met for any as soon as the following log-norm conditions are met
[TABLE]
To motivate the above condition consider a linear drift function of the form and . In this case the tangent process satisfies a time-varying deterministic linear dynamical system
[TABLE]
The asymptotic behavior of this process cannot be characterized by the statistical properties of the spectral abscissa of the matrices . Indeed, unstable semigroups associated with time-varying (deterministic) matrices with negative eigenvalues are exemplified in [15, 49]. Conversely, stable semigroups with having positive eigenvalues are given by Wu in [49]. In contrast, the uniform log-norm condition (1.14) provides a readily verifiable condition.
To describe with some precision the second main result of the article, we need to introduce some additional terminology. When there is no ambiguity, we denote by any (equivalent) norm on some finite dimensional vector space. For some multivariate function , for , let and the uniform norm be . For any we also set
[TABLE]
We denote by and some constants that depend on some parameters and but do not depend on the time horizon, nor on the space variable.
In this notation, the second main result of the article takes basically the following form.
Theorem 1.3**.**
Assume conditions and are satisfied for some parameters and . In this situation, we have the time-uniform estimates
[TABLE]
For constant diffusion functions (1.2), the estimate simplifies to
[TABLE]
The estimates (1.16) come from (7.5) and (5.9). A more detailed proof is provided in the appendix, on page Proof of (1.16). The estimates (1.17) are direct consequences of (2.17) and (5.11).
When the Skorohod term is indeed absent and (1.10) reduces to
[TABLE]
We recover the interpolation formula for nonlinear stochastic flows presented in section 3.1 in the article [3]. In this context the analysis of -errors will proceed via two-step procedure. In section 3.1 we will derive the exponential bound
[TABLE]
Using the Minkowski integral inequality in (1.18) yields
[TABLE]
A further conditioning argument and the above exponential bound on the tangent process yields
[TABLE]
Replacing the term outside the time integral with yields the stated result in (1.16) excluding the terms representing the difference in the diffusions.
We illustrate one use of theorem 1.2 in the context of analyzing the error in discretising the diffusion for some initial time point . Let denote the discretisation interval size and for any let
[TABLE]
for a fixed diffusion matrix . Here is the discretisation of with resolution . Note that that the drift at time is not a function of the instantaneous value of , at time , but rather the value it took at the largest discrete time-point before . In section 4.4 we discuss how the formula in (1.10) also applies in this context and establish that
[TABLE]
This comparison result when combined with the regularity assumptions (1.19) yields the moment bound below.
Proposition 1.4**.**
Assume that
[TABLE]
for some , In this situation, for any we have the uniform estimates
[TABLE]
where .
Proposition 1.4 is proved in section 7.3. To apply proposition 1.4 to a Langevin diffusion with a convex potential , the drift would be and the corresponding assumptions on are typical.
1.2 Comments and comparisons with existing literature
The interpolation formula (1.10) can be interpreted as an extension of Alekseev-Gröbner lemma [1, 24, 30] as well as an extended version of the variation-of-constant and related Gronwall type lemma [8, 25] to diffusion processes. In this connection we underline that the forward-backward formula (1.10) differs from the stochastic Gronwall lemma presented in [45] based on particular classes of stochastic linear inequalities that doesn’t involve Skorohod type integrals.
The forward-backward interpolation formula (1.10) can also be seen as an extension of theorem 6.1 in [43] on two-sided stochastic integrals to diffusion flows. This interpolation formula can also be interpreted as a backward version of the generalized Itô-Ventzell formula presented in [41] (see also theorem 3.2.11 in [37]).
Stochastic interpolation formulae of the form (1.10) and their discrete time version discussed in (4.2) are not really new. To describe their origins, it is worth to mention that the stochastic perturbations may come from auxiliary random sources, uncertainty propagations, as well as time discretization schemes and mean field type particle fluctuations.
The pivotal interpolating telescoping sum formula (4.2) and the second order forward-backward perturbation semigroup methodology discussed in the present article can also be found in chapter 7 in [18] for discrete time models as well as in the series of articles [20, 21, 22] published at the beginning of the 2000s, see also chapter 10 in [19]. In this context, the random perturbations come from the fluctuations of a genetic type particle interpretation of nonlinear Feynman-Kac semigroups.
The more recent articles [9, 10, 11] also provide a series of backward-forward interpolation formulae of the same form as (1.10) for stochastic matrix Riccati diffusion flows arising in data assimilation theory (cf. for instance theorem 1.3 in [11] as well as section 2.2 in [10] and the proof of theorem 2.3 in [9]). In this context, the random perturbations come from the fluctuations of a mean field particle interpretation of a class of nonlinear diffusions equipped with an interacting sample covariance matrix functional.
We underline that the Itô-Alekseev-Gröbner formula (4.6) discussed in [11] is an extension of the interpolation formula (1.10) to stochastic diffusion flows in matrix spaces. In this context the unperturbed model is given by the flow of a deterministic matrix Riccati differential equation and the random perturbations are described by matrix-valued diffusion martingales. The corresponding Itô-Alekseev-Gröbner formulae can be seen as a matrix version of theorem 1.2 in the present article when . These stochastic interpolation formulae were used in [11] to quantify the fluctuation of the stochastic flow around the limiting deterministic Riccati equation, at any order. We will briefly discuss the analog of these Taylor type expansions in section 7.1 in the context of Euclidian diffusions.
The forward-backward perturbation methodology discussed in the present article has also been used in [3, 5] in the context of nonlinear diffusions and their mean field type interacting particle interpretations, see for instance section 2.3 in [5]. In this context, the random perturbations come from the fluctuations of a mean field particle interpretation of a class of nonlinear diffusions. The extended version of the Itô-Alekseev-Gröbner formula (1.18) to nonlinear diffusions is also discussed in section 3.1 in the article [3]. In this situation, the time varying drift and diffusion functions of the stochastic flows depend on some possibly different nonlinear measure valued semigroups which may start from two possibly different initial distributions. For a more thorough discussion on this class of nonlinear diffusions, we refer to the Itô-Alekseev-Gröbner formula (3.2) and corollary 3.2 in the article [3]. These Itô-Alekseev-Gröbner formulae correspond to theorem 1.2 in the present article when .
The interpolating stochastic semigroup techniques discussed in the present article are also applied to mean field particle systems and deterministic nonlinear measure valued semigroups. In this context, the process is given a deterministic measure-valued process and represents the evolution of the particle density profiles associated with an approximating mean field particle interpretation of . For instance, the article [4] is concerned with interacting jumps models on path spaces, the second article [5] discusses the propagation of chaos properties of mean field type interacting diffusions. The stochastic interpolation formulae discussed in [4, 5] correspond to the case (1.10) with and or (see for instance the interpolation formula (3.5), theorem 2.6, theorem 2.7 and the interpolating telescoping sum in section 1.2 in [5])
In the series of articles discussed above, as in (1.9) the central common idea is to analyse the evolution of the interpolating process (1.13) between a given process and some stochastic flow with an extra level of randomness. In discrete time settings, the differential interpolation formula (1.9) can also recasted in terms of a telescoping sum of the same form as (4.2) combined with a second order Taylor expansion reflecting the differences between a stochastic semigroup and its perturbations, see for instance chapter 7 in [18].
In most of the application domains discussed above, this second order stochastic perturbation methodology has been developed to quantify uniformly w.r.t. the time horizon the propagations of some stochastic perturbations entering in some deterministic and stable reference or unperturbed process. In the context of Euclidian diffusions, this corresponds to the situation where the diffusion function (the case can be treated by symmetry arguments). The Itô-Alekseev-Gröbner type formulae discussed in section 3.1 in the article [3] correspond to theorem 1.2 in the present article when .
The present article can be seen as a natural extension of the second order perturbation methodology developed in the above referenced articles to diffusion type perturbed processes when .
To the best of our knowledge, the first article considering the case with and is the independent work of Hudde-Hutzenthaler-Jentzen-Mazzonetto [27]. In this article, the authors discuss an Itô-Alekseev-Gröbner formula for abstract diffusion perturbation models of the form (4.11). Here again, as in the list of referenced articles discussed above, the common central idea is to use discrete time approximations and combine the pivotal interpolating telescoping sum formulae (4.2) with a second order Taylor expansion. Besides this fact and in contrast with our analysis, the fluctuation term (1.12) discussed in [27] cannot be interpreted in terms of the extended two-sided stochastic integral defined in (4.3) (see also proposition 6.2) but only in terms of a Skorohod stochastic integral. The study [27] is also based on a series of particularly chosen and custom regularity conditions. For instance, the authors assume that the abstract diffusion perturbation models are chosen so that the Skorohod fluctuation term exists without providing any quantitative type estimate. This work is also not connected to the two-sided stochastic integration calculus developed by Pardoux and Protter in [43] nor to any type of backward Itô-Ventzell formula.
We feel that our approach is more direct and intuitive as it relies on an extended version of Itô’s change rule formula (1.9) to interpolating stochastic flows. It also allows to interpret the fluctuation term (1.12) as an extended two-sided stochastic integral.
In section 5 in the present article, we will also see that any quantitative analysis requires to estimate the absolute moments of the Malliavin derivatives of the stochastic integrands of the Brownian motion arising in the Skorohod fluctuation term. In our framework, these Malliavin derivatives depend on the gradient of both of the diffusion functions as well as on the tangent process of the perturbed diffusion flow. The quantitative analysis developed in 5 can be extended without difficulties to abstract diffusion perturbation models satisfying appropriate differentiability and integrability conditions.
The article [27] also presents an application to tamed Euler type discrete time approximations of a stochastic van-der-Pol process introduced in [47], simplifying the analysis provided in an earlier work [28]. In this situation, we underline that the Skorohod fluctuation term is null so that the resulting Alekseev-Gröbner type formula resumes to the simple and elementary case discussed in (1.18) and in the article [3]. As expected for this class of "unstable processes", the authors recast a series of -estimates discussed in [28] into a series of estimates that grow exponentially fast with respect to the time horizon.
In contrast with the present work, the above article doesn’t discuss any quantitative uniform estimates w.r.t. the time horizon. The analysis presented in [27] is mainly concerned with the proof of a Skorohod-Alekseev-Gröbner type formula for abstract diffusion perturbation models and it doesn’t apply to derive any type of estimates to general diffusion perturbation models without adding regularity conditions.
Besides its elegance the forward-backward interpolation formula (1.10) is clearly of rather poor mathematical and numerical interest without a better understanding of the variational processes and the Skorohod fluctuation term (1.12). A crucial problem is to avoid exceedingly pessimistic exponential estimates that grow exponentially fast w.r.t. the time horizon.
One advantage of the second order perturbation methodology developed in the present article is that it takes advantage of the stability properties of the tangent and the Hessian flow in the estimation of Skorohod fluctuation term and this sharpen analysis of the difference between stochastic flows. Our main contribution is to develop a refined analysis of these variational processes and the Skorohod fluctuation terms. We also deduce several uniform perturbation propagation estimates with respect to the time horizon, yielding what seems to be the first results of this type for this class of models.
The forward-backward stochastic interpolation formula (1.10) can also be extended to more general classes of stochastic flows on abstract state spaces. For instance the recent article [30] provides a deterministic first order version of (1.10) on abstract Banach spaces. The stochastic perturbation analysis developed in the series of articles [4, 5, 9, 10, 11, 20, 21, 22] and the books [18, 19] is applied to matrix-valued diffusions and measure valued processes, including mean field type interacting diffusions and Feynman-Kac type interacting jumps models.
The stability properties of these abstract models discussed above depend on the problem at hand. To focus on the main ideas without clouding the article with unnecessary technical details and sophisticated mathematical tools based on abstract ad hoc regularity conditions we have chosen to concentrate the article on diffusion flows on Euclidian spaces with simple and easily checked regularity conditions.
2 Preliminary results
2.1 Some basic notation
With a slight abuse of notation, we denote by the identity -matrix, for any . We also denote by any (equivalent) norm on a finite dimensional vector space over . All vectors are column vectors by default.
We introduce some matrix notation needed from the onset.
We denote by , , resp. and the trace, the spectral norm, the Frobenius norm, and the logarithmic norm of some matrix . is the transpose of and the largest eigenvalue. The spectral norm is sub-multiplicative or and compatible with the Euclidean norm for vectors, by that we mean for a vector we have .
Let be the set of multiple indexes over some finite set . We denote by the entries of a -tensor with index set for and for . For the sake of brevity, the index sets will be implicitly defined through the context.
For a given -tensor and a given tensor , and is a -tensor resp. a -tensor with entries given by
[TABLE]
The symmetric part of a -tensor is the -tensor with entries
[TABLE]
We consider the Frobenius inner product given for any -tensors and by
[TABLE]
For any -tensors and we also check the Cauchy-Schwartz inequality
[TABLE]
For any tensors with appropriate dimensions we have the inequality
[TABLE]
Given some tensor valued function we also set
[TABLE]
Given some smooth function from into we denote by
[TABLE]
the gradient -matrix associated with the column vector-valued function . Building on this notation: let and let the mapping . Then . Let
[TABLE]
The Hessian associated with the function is a -tensor where . In this notation we can compactly represent the second order term of the Taylor expansion of the the vector valued function . For a vector
[TABLE]
where we have regarded the matrix as the -tensor with .
In the same vein, in terms of the tensor product (2.1), for any pair of column vector-valued function and and any matrix function from into , for any parameter we also have
[TABLE]
In a more compact form, the above formula takes the form
[TABLE]
For any we let be the convex set of probability measures on with absolute -th moment and equipped with the Wasserstein distance of order denoted by
[TABLE]
In the above display the infimum is taken over all pair or random variables with marginal distributions . The stochastic transition semigroups associated with the flows and are defined for any measurable function on by the formulae
[TABLE]
Given some column vector-valued function , let and denote the column vector-valued functions with entries and . Building on the tensor notation, let and respectively denote the and -tensor valued functions with entries
[TABLE]
We also consider the random and -tensors given by
[TABLE]
Throughout the rest of the article, unless otherwise stated denote constants whose values may vary from line to line but only depend on the parameters in their subscripts, i.e. and , as well as on the parameters of the model; that is, on the drift and diffusion functions. We also use the letters to denote universal constants. Importantly these contants do not depend on the time horizon. We also consider the uniform log-norm parameters
[TABLE]
and the parameters {\mathchoice{\raisebox{0.0pt}{\displaystyle\chi}}{\raisebox{0.0pt}{\textstyle\chi}}{\raisebox{0.0pt}{\scriptstyle\chi}}{\raisebox{0.0pt}{\scriptscriptstyle\chi}}}(b,\sigma) defined by
[TABLE]
2.2 Regularity conditions and some preliminary results
We consider two different types of regularity conditions ()n and , indexed by some parameter , for the diffusion .
There exists some parameter such that for any we have
[TABLE]
There exists some parameter such that
[TABLE]
where denotes the -th column of In addition, the following condition is satisfied
[TABLE]
We now define the corresponding assumptions for the diffusion .
The regularity condition defined as in for the diffusion .
Let be the symmetric matrix defined as in (2.7) when . Assume there exists some such that . Furthermore, assume where is defined as when .
We write when both conditions and are satisfied.
Both conditions and are met, and let
[TABLE]
In practice, the uniform moment condition is often checked using Lyapunov techniques. For example we can use the following polynomial growth condition.
There exists some parameters with such that for any and any we have
[TABLE]
for some norm of the matrix-valued diffusion function. In addition, we have
[TABLE]
Lemma 2.1**.**
For any we have
[TABLE]
The proof of the above assertion follows standard stochastic calculations, thus it is housed in the appendix, on page Proof of (2.10).
For one-dimensional geometric Brownian motions the condition is a sufficient and necessary condition for the existence of uniformly bounded absolute -moments. In this case coincides with by setting
[TABLE]
Whenever condition is met for some , we also check the uniform estimates
[TABLE]
with the same parameter as the one associated with the condition .
Recalling that the functions and have at most linear growth, with the -norms introduced in (1.15) we also have that
[TABLE]
To give more insight where these assumptions will be used, we now briefly state the stability results that stem from them. Condition ensures that the exponential decays of the absolute and uniform -moments of the tangent and the Hessian processes; that is, when is met for some we have that
[TABLE]
A more precise statement is provided in proposition 3.2 and proposition 3.10. These uniform estimates clearly imply, via a conditioning argument, that for any and we have
[TABLE]
with the same parameters as in (2.13).
The case will also serve a useful purpose, for example in analysing the error of a numerical implementation as in proposition 1.4. For instance whenever is met we have the almost sure and uniform gradient estimates
[TABLE]
In addition, we have the almost sure and uniform Hessian estimates
[TABLE]
A proof of the above estimates is provided in the beginning of section 3.1 and section 3.2. In this situation, whenever is met we have
[TABLE]
In the above display, stands for the stochastic process discussed in (1.11), and stands for some finite constant that doesn’t depend on the parameter . For instance, for a Langevin diffusion associated with some convex potential function we have and . Then assuming
[TABLE]
where the almost sure tangent and Hessian bounds follow from (2.15) and (2.16) respectively.
In practice, it is often easier to work with than and we now discuss some ways of estimating in terms of and in the reverse direction. The latter is straightforward:
[TABLE]
To estimate in terms of , assume the following ellipticity condition is satisfied
[TABLE]
We recall the Ando-Hemmen inequality [2] for any symmetric positive definite matrices
[TABLE]
for any unitary invariant matrix norm . In the above display, stands for the minimal eigenvalue. We also have the square root inequality
[TABLE]
See for instance theorem 6.2 on page 135 in [26], as well as proposition 3.2 in [2]. A proof of (2.21) can be found in [7]. In this situation, using (2.20) and (2.21) we check that
[TABLE]
This provides a way to estimate the growth of in terms of the one of . For instance the estimate (1.16) combined with (2.22) implies that
[TABLE]
Assume that is satisfied for some . Also let be some multivariate function such that
[TABLE]
In this situation, we have the estimates
[TABLE]
2.3 Some results on anticipating stochastic calculus
In this section we review some results on Malliavin derivatives and Skorohod integration calculus which will be needed below. We restrict the presentation to unit time intervals. Let be the canonical space equipped with the Wiener measure associated with the -dimensional Brownian motion discussed in the introduction.
The Malliavin derivative is a linear operator from some dense domain into the space of -dimensional processes with square integrable states on the unit time interval. For multivariate -column vector random variables with entries , we use the same rules as for the gradient and we set
[TABLE]
For -matrices with entries we let be the tensor with entries
[TABLE]
It is clearly out of the scope of this article to review the analytical construction of Malliavin differential calculus. For a more thorough discussion we refer the reader to the seminal book by Nualart [37], see also the more synthetic presentation in the articles [38, 41].
Formally, one can think the Malliavin derivatives of some as way to extract from the random variable the integrand of Brownian increment . For instance, when we have
[TABLE]
As conventional differentials, for any smooth function from into , Malliavin derivatives satisfy the chain rule properties
[TABLE]
For instance, for any we have
[TABLE]
In the same vein, we have
[TABLE]
Let be the Hilbert space of -dimensional process with Malliavin differentiable entries equipped with the norm
[TABLE]
The Skorohod integral w.r.t. the Brownian motion on the unit interval is defined a linear and continuous mapping from
[TABLE]
characterized by the two following properties
[TABLE]
The above formula can be seen as an extended version of the Itô isometry to Skorohod integrals, for instance [39], as well as chapters 1.3 to 1.5 in the book by Nualart [37].
As for the Itô integral, the Skorohod integral w.r.t. the -dimensional Brownian motion of a matrix valued process with entries is defined by the column vector with entries
[TABLE]
3 Variational equations
3.1 The tangent process
In terms of the tensor product (2.4), the gradient of the diffusion flow is given by the gradient -matrix
[TABLE]
where is the -th component of the Brownian motion. After some calculations we check that
[TABLE]
with the matrix function defined in (2.7) and the symmetric matrix valued martingale
[TABLE]
These expansions, when combined with condition , yield the following estimates of the difference between and .
Proposition 3.1**.**
Assume is satisfied. Then
[TABLE]
In addition, we have the almost sure estimate
[TABLE]
Proof of Prop. 3.3.
Whenever is met, we have the following uniform estimate from (3.1)
[TABLE]
where the term arises from imposing the initial condition on the resulting differential equation for . In addition, when the martingale is null and as a consequence of (3.1) we have the following almost sure estimate
[TABLE]
The Taylor expansion
[TABLE]
combined with (3.4) and (3.5) completes the proof. ∎
These contraction inequalities quantify the stability of the stochastic flow w.r.t. the initial state . For instance, the estimate (3.2) ensures that the Markov transition semigroup is exponentially stable; that is, we have that
[TABLE]
For the Langevin diffusions discussed in (2.18) the stochastic flow is time homogeneous; that is we have that and . In addition when , the diffusion flow has a single invariant measure on given by the Boltzmann-Gibbs measure
[TABLE]
From (2.18), it follows that
[TABLE]
for all .
Taking the trace in (3.1) we also find that
[TABLE]
with the martingale
[TABLE]
Observe that
[TABLE]
This implies that
[TABLE]
Whenever is met, we have the estimate
[TABLE]
with the uniform log-norm parameter defined in (2.5). This yields the estimate
[TABLE]
More generally, we readily check the following result.
Proposition 3.2**.**
When condition is met we have the following time-uniform bounds,
[TABLE]
3.2 The Hessian process
In terms of the tensor product (2.1), we have the matrix diffusion equation
[TABLE]
with the null matrix initial condition and the matrix-valued martingale
[TABLE]
Consider the tensor functions
[TABLE]
After some computations, we check that
[TABLE]
with the matrix function defined in (2.7) and the tensor-valued martingale
[TABLE]
When the above equation reduces to
[TABLE]
Whenever is met, taking the trace in the above display we check that
[TABLE]
This yields the estimate
[TABLE]
Using (2.15) this implies that
[TABLE]
This ends the proof of the almost sure estimate (2.16).
For more general models, we have that
[TABLE]
with a continuous martingale with angle bracket
[TABLE]
Proposition 3.3**.**
Assume is met. In this situation, for any s.t. we have
[TABLE]
with the parameters {\mathchoice{\raisebox{0.0pt}{\displaystyle\chi}}{\raisebox{0.0pt}{\textstyle\chi}}{\raisebox{0.0pt}{\scriptstyle\chi}}{\raisebox{0.0pt}{\scriptscriptstyle\chi}}}(b,\sigma) and defined in (2.6) and (2.8).
In the above display, is defined in (2.5). The proof of the above estimate is technical and thus housed in the appendix on page Proof of proposition 3.3
3.3 Bismut-Elworthy-Li formulae
We further assume that ellipticity condition (2.19) is met. In this situation, we can extend gradient semigroup formulae to measurable functions using the Bismut-Elworthy-Li formula
[TABLE]
with the stochastic process
[TABLE]
The above formula is valid for any function of the following form
[TABLE]
for some non decreasing differentiable function on with bounded continuous derivatives and such that
[TABLE]
Whenever is met, combining (3.4) with (3.11), for any s.t. we check that
[TABLE]
Let with be some differentiable function on null on and such that and . For instance we can choose
[TABLE]
In this situation, we check that
[TABLE]
from which we find the rather crude uniform estimate
[TABLE]
In the same vein, for any we have the formulae
[TABLE]
with the process
[TABLE]
In the above display stands for the tensor function
[TABLE]
A detailed proof of the formulae (3.14) and (3.15) in the context of nonlinear diffusion flows can be found in the appendix in [5].
Observe that
[TABLE]
Whenever is met, using the estimate (3.3) for any
[TABLE]
In the same vein, using (3.15) for any and any bounded measurable function s.t. we also check the rather crude uniform estimate
[TABLE]
Choosing in the above display we check that for any we obtain the uniform estimate
[TABLE]
The extended versions of the above formulae in the context of diffusions on differentiable manifolds can be found in the series of articles [6, 13, 23, 36, 46].
4 Backward semigroup analysis
4.1 The two-sided stochastic integration
For any given time horizon we have the rather well known backward stochastic flow equation
[TABLE]
The right hand side integral is understood as a conventional backward Itô-integral. In a more synthetic form, the above backward formula reduces to (1.7).
An elementary proof of the above formula based on Taylor expansions is presented in [17], different approaches can also be found in [31] and [33]. Extensions of the backward Itô formula (4.1) to jump type diffusion models as well as nonlinear diffusion flows can also be found in [16] and in the appendix of [3].
Consider the discrete time interval associated with some refining time mesh from to , for some time step . In this notation, combining (1.6) with (1.8) for any we have the Taylor type approximation
[TABLE]
This yields the interpolating forward-backward telescoping sum formula
[TABLE]
We obtain formally (1.10) by summing the above terms and passing to the limit .
To be more precise, we follow the two-sided stochastic integration calculus introduced by Pardoux and Protter in [43]. As mentioned by the authors this methodology can be seen as a variation of Itô original construction of the stochastic integral. In this framework, the Skorohod stochastic integral (1.12) arising in (1.9) is defined by the -convergence
[TABLE]
The proof of the above assertion is based on a slight extension of proposition 3.3 in [43] to Skorohod integrals of the form (1.12). For the convenience of the reader, a detailed proof of the above assertion for one dimensional models is provided in section 6.1.
Using (4.3), the complete proof of (1.9) now follows the same line of arguments as the ones used in the proof of Itô-type change rule formula stated in theorem 6.1 in [43], thus it is skipped.
4.2 A multivariate stochastic interpolation formulae
In terms of the tensor product (2.1), for any and any twice differentiable function from into with at most polynomial growth the function satisfies the backward formula (1.4) with the random fields
[TABLE]
Using the quantitative estimates presented in section 5.2, we checked that the regularity conditions , and stated in section 1.1.1 are satisfied. Rewritten in terms of the stochastic semigroups and we obtain the forward-backward multivariate interpolation formula
[TABLE]
with the stochastic integro-differential operator
[TABLE]
and the two-sided stochastic integral term given by
[TABLE]
Using elementary differential calculus, for twice differentiable (column vector-valued) function from into we readily check the gradient and the Hessian formulae
[TABLE]
This shows that and have the same form as the integrals and defined in (1.10) and (1.11) up to some terms involving the gradient and the Hessian of the function . For instance, we have the two-sided stochastic integral formula
[TABLE]
Also observe that (4.4) coincides with (1.10) for the identity function; that is, we have that
[TABLE]
The above discussion shows that the analysis of the differences of the stochastic semigroups in terms of the tangent and the Hessian processes is essentially the same as the one of the difference of the stochastic flows . For instance using the discussion provided section 5.3, when the gradient and the Hessian of the function are uniformly bounded the estimates stated in theorem 1.3 can be easily extended at the level of the stochastic semigroups.
The -norm of the two-sided stochastic integrals in (1.10) and (4.4) are uniformly estimated as soon as the pair of drift and diffusion functions and satisfy condition . For a more thorough discussion we refer to section 5.1, see for instance the -norm estimates presented in theorem 5.2 applied to the difference function .
4.3 Semigroup perturbation formulae
Besides the fact that the Skorohod integral in the r.h.s. of (4.4) is not a martingale (w.r.t. the Brownian motion filtration) it is centered (see for instance (2.26) and the argument provided in the beginning of section 5.1). Thus, taking the expectation in the univariate version of (4.4) we obtain the following interpolation semigroup decomposition.
Corollary 4.1**.**
For any twice differentiable function from into with bounded derivatives we have the forward-backward semigroup interpolation formula
[TABLE]
In addition, under some appropriate regularity conditions for any differentiable function such that and we have the uniform estimate
[TABLE]
Rewritten in terms of the infinitesimal generators of the stochastic flows we recover the rather well known semigroup perturbation formula
[TABLE]
The above formula can be readily checked using the interpolating formula given for any by the evolution equation
[TABLE]
Now we come to the proof of (4.9). Whenever is met, combining (3.13) with (3.16) for any differentiable function s.t. and and for any we check that
[TABLE]
This ends the proof of (4.9).
After some elementary manipulations the forward-backward interpolation formula (4.8) yields the following corollary.
Corollary 4.2**.**
Let and be some ergodic diffusions associated with some time homogeneous drift and diffusion functions and . The invariant probability measures and of and are connected for any twice differentiable function from into with bounded derivatives by the following interpolation formula
[TABLE]
In the above display stands for a random variable with distribution and stands for the Markov transition semigroup of the process .
The formula (4.10) can be used to estimate the invariant measure of a stochastic flow associated with some perturbations of the drift and the diffusion function.
For instance, for homogeneous Langevin diffusions associated with some convex potential function we have
[TABLE]
In the above display, stands for the Lebesgue measure on . In this situation, using (4.10), for any ergodic diffusion flow with some drift and an unit diffusion matrix we have
[TABLE]
Notice that the above formula is implicit as the r.h.s. term depends on . By symmetry arguments, we also have the following more explicit perturbation formula
[TABLE]
In the above display stands for a random variable with distribution and stands for the Markov transition semigroup of the process .
4.4 Some extensions
Several extensions of the forward-backward stochastic interpolation formula (1.10) to more general stochastic perturbation processes can be developed. For instance, suppose we are given some stochastic processes and adapted to the filtration of the Brownian motion , and let be the stochastic flow defined by the stochastic differential equation
[TABLE]
In this situation, the interpolation formula (1.9) remains valid when is replaced by the stochastic matrices . This yields without further work the forward-backward stochastic interpolation formula (1.10) with the local perturbations
[TABLE]
The corresponding interpolation formula should be used with some caution as the -norm of the two-sided stochastic integral (1.12) depends on the Malliavin differential of the integrand process of the Brownian motion; see for instance the variance formula provided in lemma 5.1.
Assume that and the regularity condition is met. Also suppose is given by a stochastic differential equation of the form (4.11) with and . Arguing as above, in terms of the tensor product (2.1) we have
[TABLE]
Combining (2.15) with the generalized Minkowski inequality, we check the following proposition.
Proposition 4.3**.**
Assume that is met for some . In this situation, for any we have the estimates
[TABLE]
In the same vein, we have
[TABLE]
For instance, for the Langevin diffusion discussed in (2.18) and (3.7) the weak expansion (4.14) implies that
[TABLE]
This yields the -Wasserstein estimate
[TABLE]
Combining (3.13) with (4.15), for any we also have the total variation norm estimate
[TABLE]
5 Skorohod fluctuation processes
5.1 A variance formula
Let be some differentiable -matrix valued function on such that
[TABLE]
Recalling that is independent of the flows and , the discrete time approximation (4.3) shows that Skorohod stochastic integral is centered; that is, we have that .
Following (4.3), the variance can be computed using the following approximation formula
[TABLE]
The proof of the above assertion is provided in section 6.1, see for instance proposition 6.2.
Consider the matrix valued function
[TABLE]
In this notation, the limiting diagonal term in the r.h.s. of (5.2) is clearly equal to
[TABLE]
In addition, whenever condition is met and is bounded, (3.4) readily yields the estimate
[TABLE]
More generally, using (3.8) whenever and are met for some we have the estimate
[TABLE]
This implies that
[TABLE]
The non-diagonal term can be computed in a more direct way using Malliavin derivatives of the functions . For any we have
[TABLE]
As expected, observe that
[TABLE]
In the reverse angle, whenever we have the chain rule formula
[TABLE]
As above, Malliavin differentials and can be computed using the chain rule formulae (2.24).
A more detailed analysis of the chain rules formulae (2.24), (2.25) and (5.7) for one dimensional models is provided in section 6.1 (cf. lemma 6.1).
Observe that
[TABLE]
We consider the inner product
[TABLE]
In this notation, an explicit description of the -norm of the two-sided stochastic integral in terms of Malliavin derivatives is given below.
Lemma 5.1**.**
The -norm of the Skorohod integral introduced in (4.3) is given for any and by the formulae
[TABLE]
with the random matrix function defined in (5.3) and the Malliavin derivative given in formulae (5.6) and (5.7). In addition, we have
[TABLE]
The above lemma can be interpreted as a matrix version of the isometry property (2.26). A proof of the above lemma based on the -approximation of two-sided stochastic integrals is provided in section 6.1 (see for instance proposition 6.2).
5.2 Quantitative estimates
For any and any tensor norms we also quote the rather well known -norm estimates
[TABLE]
for some finite constants whose values only depend on . A proof of these estimates can be found in [38, 48], see also [39] for multiple Skorohod integrals. By the generalized Minkowski inequality, for any we also have the estimate
[TABLE]
Observe that for any we have
[TABLE]
The main objective of this section is to prove the following theorem.
Theorem 5.2**.**
Assume that and are satisfied for some parameter and some . In this situation, we have the uniform estimate
[TABLE]
For uniformly bounded diffusion functions whenever is met for some we have
[TABLE]
In addition, for constant diffusion functions whenever is met, for any we have the uniform estimate
[TABLE]
The proof of the above theorem, including a more detailed description of the parameters and is provided below.
Next, we estimate the -norm of the Malliavin differential in the two cases and .
Case :
Using (5.6) we have
[TABLE]
Using (2.24) and (2.25) this yields the estimate
[TABLE]
with the functions
[TABLE]
In the above display, stands for the interpolating flow defined in (1.13).
- •
Firstly assume that and is satisfied for some parameter . In this situation, applying proposition 3.2 and proposition 3.3, for any we have the uniform estimates
[TABLE]
with the parameter {\mathchoice{\raisebox{0.0pt}{\displaystyle\chi}}{\raisebox{0.0pt}{\textstyle\chi}}{\raisebox{0.0pt}{\scriptstyle\chi}}{\raisebox{0.0pt}{\scriptscriptstyle\chi}}}_{n,\epsilon}(b,\sigma) given by
[TABLE]
- •
More generally, when the functions and may grow at the most linearly with respect to . Assume that conditions and condition are satisfied for some parameters and . In this situation, applying Hölder inequality we check that
[TABLE]
Applying proposition 3.2 we check that
[TABLE]
In the same vein, combining proposition 3.2 and proposition 3.3 with the uniform moment estimates (2.11) we check that
[TABLE]
We conclude that
[TABLE]
with the parameter
[TABLE]
Case :
We use (5.7) to check that
[TABLE]
On the other hand, using the chain rules (2.24) we have
[TABLE]
This yields the estimate
[TABLE]
- •
Firstly assume that and condition is satisfied for some . In this situation, arguing as above for any we have the uniform estimates
[TABLE]
for some universal constant and the parameter \overline{{\mathchoice{\raisebox{0.0pt}{\displaystyle\chi}}{\raisebox{0.0pt}{\textstyle\chi}}{\raisebox{0.0pt}{\scriptstyle\chi}}{\raisebox{0.0pt}{\scriptscriptstyle\chi}}}}_{n,\epsilon}(b,\sigma) given by
[TABLE]
- •
More generally assume that . Also assume that conditions and are satisfied for some parameters and . In this situation, we have
[TABLE]
with the parameter
[TABLE]
The end of the proof of theorem 5.2 is a direct consequence of the estimates discussed above combined with (5.8) and the diagonal estimates presented in (5.4).
5.3 Some extensions
This section is concerned with the two-sided stochastic integral (4.6). Using the gradient formula in (4.7) the Skorohod stochastic integral in (4.6) takes the form
[TABLE]
with the integrands
[TABLE]
As in (2.25), using the chain rule properties of Malliavin derivatives we check that
[TABLE]
as well as
[TABLE]
This yields the differential formula
[TABLE]
The Malliavin derivatives are computed using formulae (5.6) and (5.7); thus, it remains to compute the Malliavin derivatives of the interpolating path.
When we have
[TABLE]
In this situation, as in (2.24) using the chain rule properties of Malliavin derivatives we check that
[TABLE]
By (2.23) we conclude that
[TABLE]
When we have
[TABLE]
In this situation, arguing as above we check that
[TABLE]
By (2.23) we conclude that
[TABLE]
6 Some anticipative calculus
For clarity and to avoid unnecessary sophisticated multi-index notation, we only consider one dimensional model. The proof of the results presented in this section in the general case can be reproduced word-for-word for multidimensional models.
To simplify the presentation, we write the derivative of order of a smooth function . We also set . We also reduce the analysis to the unit interval. In this context, for any we set
[TABLE]
6.1 Extended two-sided stochastic integrals
The aim of this section is to extend the two-sided stochastic integration introduced in [43] to Skorohod integrals of the form (4.3), for some time homogeneous function satisfying (5.1). For any we set
[TABLE]
In this notation the limiting integral in (4.3) takes formally the following form
[TABLE]
The existence of this two-sided stochastic integral is discussed below in (6.4).
To simplify the presentation, we fix the state variable and we write and instead of and . Next technical lemma provided a more explicit description of the Malliavin derivatives of the processes .
Lemma 6.1**.**
For any we have
[TABLE]
In addition, we have
[TABLE]
Proof.
Using the chain rules properties, for any we have
[TABLE]
The end of the proof of the first assertion comes from the fact that
[TABLE]
In the same vein, we have
[TABLE]
We also have that
[TABLE]
The last assertion comes from the fact that
[TABLE]
The r.h.s. term in the above display can be rewritten as follows
[TABLE]
In the same vein, we have
[TABLE]
This ends the proof of the second assertion. The proof of the lemma is now completed.
From the above lemma, we also check that all the -absolute moments of the Malliavin derivatives are finite with at most quadratic growth w.r.t. the initial values.
Next proposition extends proposition 3.3 in [43] to stochastic processes of the form (6.2).
Proposition 6.2**.**
Let be any refining sequence of partitions of the unit interval. For any we define
[TABLE]
Then is a Cauchy sequence in . In addition, for any decreasing sequence of time steps we have the formula
[TABLE]
Before entering into the details of the proof of the proposition, we give a couple of comments. The hypothesis that is a refining sequence indexed by is not essential but it simplifies the proof of the proposition, see for instance lemma 3.1.1 in [37]. Arguing as in the proof of theorem 3.3 and theorem 7.1 in [43] the above proposition ensures that the two-sided integral defined by the -limit coincides with the two-sided stochastic integral of the process over the unit interval; that is, we have that
[TABLE]
In this context, proposition 6.2 can be interpreted as a version of the isometry property (2.26) for the generalized two-sided integral defined above.
Proof of proposition 6.2:
We fix and we assume that is a refinement of . For any we also set
[TABLE]
With a slight abuse of notation we set
[TABLE]
For any overlapping pair using the decomposition
[TABLE]
we have
[TABLE]
It follows from the continuity properties of the processes that
[TABLE]
When we have
[TABLE]
On the other hand, we have the decomposition
[TABLE]
with the increment functions
[TABLE]
With a slight abuse of notation, we shall denote by some possible random variable with any -absolute moment of order , for some with . In this notation, we have
[TABLE]
Given a smooth function we set
[TABLE]
In this notation, we have the first and second order decompositions
[TABLE]
This implies that
[TABLE]
from which we conclude that
[TABLE]
This yields the first order decomposition
[TABLE]
with the functions
[TABLE]
Notice that none of the functions but the increment functions and depend on , nor on .
In the reverse angle, we have
[TABLE]
with
[TABLE]
Arguing as above, we have
[TABLE]
We conclude that
[TABLE]
In the same vein, we have
[TABLE]
Multiplying these terms, we check that
[TABLE]
with the functions
[TABLE]
None of the functions but the increment depend on , nor on .
Recall that the functions and don’t depend on . In addition, the functions and don’t depend on . This yields the formula
[TABLE]
To take the final step, observe that
[TABLE]
In the same vein, we have
[TABLE]
and
[TABLE]
This shows that
[TABLE]
It follows that
[TABLE]
We end the proof of (6.3) using lemma 6.1 and symmetry arguments. This ends the proof of the proposition.
6.2 Generalized backward Itô-Ventzell formula
This section is mainly concerned with the proof of theorem 1.1. Before entering into the details of the proof we discuss how it applies to the process introduced in (6.1).
Consider the random fields
[TABLE]
In this notation, the backward random field formula (4.1) with takes the form
[TABLE]
We fix some given and we write instead of and set
[TABLE]
In this notation, we have
[TABLE]
Observe that as well as the Malliavin derivatives have moments of any order. Consider the processes
[TABLE]
In this notation, up to a change of sign and replacing by in (1.10) the stochastic interpolation formula stated in theorem 1.2 on the unit interval takes the following form
[TABLE]
More generally, suppose we are given a forward real valued continuous semi-martingale of the form (6.7) for some -adapted functions and , and a backward random field models of the form (6.6) for some -adapted functions .
We consider the following conditions:
: The functions , and as well as the differentials and are continuous w.r.t. for any given . In addition, for any we have
[TABLE]
: The Malliavin derivatives and are continuous w.r.t. and for any given . In addition, for any we have
[TABLE]
*: The random processes as well as are continuous w.r.t. the time parameter and they have moments of any order.
The next theorem is a slight extension of theorem 1.1 applied to the semi-martingale and the random fields models discussed in (6.7) and (6.5).
Theorem 6.3**.**
Consider a backward random field models of the form (6.6) for some functions satisfying and . Also let be a continuous semi-martingale of the form (6.7) functions and satisfying . In this situation, for any we have the generalized backward Itô-Ventzell formula
[TABLE]
The r.h.s. term in the above display is understood as a Skorohod integral.
Proof: We use the same approximation technique as in [12, 41] and [42] (see also the proof of theorem 3.2.11 in [37]). Consider a mollifier type approximation of the identify given for any by the function
[TABLE]
For any , applying the Itô-type change rule formula stated in proposition 8.2 in [38] to the product function
[TABLE]
we check that
[TABLE]
with
[TABLE]
The stochastic integral in the r.h.s. of (6.11) can be interpreted as a two-sided stochastic integral. Recalling that
[TABLE]
we check that
[TABLE]
Condition ensures that the processes and have moments of any order. In addition, under the regularity conditions and we check that
[TABLE]
Applying the Fubini theorem for Skorohod and measure theory integrals (see for instance [34, 37, 44] and the work by Leon [35]) we check that
[TABLE]
with
[TABLE]
Integrating by parts where derivatives of appear we check that
[TABLE]
From the a.s. continuity of in for each , we have
[TABLE]
The functions , and are almost surely continuous w.r.t. and uniformly locally bounded. In addition, the random variables and are integrable at any order. Moreover, under there exists some parameter depending on the support of such that for any we have the estimate
[TABLE]
Thus, by the dominated convergence theorem on equipped with the measure we have
[TABLE]
It remains to check that
[TABLE]
Observe that
[TABLE]
Using the chain rule property we have
[TABLE]
Integrating by parts, we check that
[TABLE]
Observe that
[TABLE]
On the other hand, we have
[TABLE]
Arguing as above, we have the estimate
[TABLE]
In the above display, stands for the sequences
[TABLE]
The last two terms depend on the Malliavin derivatives of and are they are given by
[TABLE]
Arguing as above, by the dominated convergence theorem we conclude that the Skorohod integral
[TABLE]
This ends the proof of (6.12), and the proof of the theorem is now easily completed.
We end this section with some comments.
Remark 6.4**.**
Recalling that the diffusion flow introduced in (6.1) has finite absolute moments of any order, the integrability conditions stated in (6.8) and (6.9) are satisfied as soon as the functions , the differentials , and the Malliavin derivatives have at most polynomial growth w.r.t. the state variable.
It is now readily check that and are met for the random fields introduced in (6.5).
The proof can be also be extended without difficulties to multivariate models. Following the proof of proposition 3.1 in [41], an alternative proof of theorem 6.3 based on Itô formula for Hilbert space valued processes can be developed. This elegant functional approach requires to introduce a custom Hilbert-space valued processes framework but this approach avoids to do explicitly the interchange of integration using the Fubini theorem for Skorohod and measure theory integrals. As the statement of proposition 3.1 in [41], the assumptions of theorem 6.3 can also be weaken when expressed in terms of this generalized stochastic calculus for Hilbert-space valued processes.
7 Illustrations
7.1 Perturbation analysis
Assume that and the drift function is given by a first order expansion
[TABLE]
for some perturbation parameter and some functions with .
In this context, the stochastic flow can be seen as a -perturbation of .
We further assume that the unperturbed diffusion satisfies condition .
To avoid unnecessary technical discussions on the existence of absolute moments of the flows we also assume that are uniformly bounded w.r.t. the parameters . In addition, is differentiable w.r.t. the coordinate and it has uniformly bounded gradients. In this situation, we set
[TABLE]
With some additional work to estimate the absolute moments of the flows, the perturbation analysis presented below allows to handle more general models. The methodology described in this section can also be extended to expand the flow at any order as soon as is sufficiently smooth.
The first order approximation is given by the following theorem.
Theorem 7.1**.**
For any , and we have the first order expansion
[TABLE]
with the first order stochastic flow
[TABLE]
The remainder second order term in the above display is such that for any s.t. we have the uniform estimate
[TABLE]
Proof.
Using (4.12) we readily check that
[TABLE]
By proposition 3.2 for any we have
[TABLE]
This yields the first order Taylor expansion (7.1) with
[TABLE]
and the second order remainder terms
[TABLE]
Arguing as above, for any s.t. we have the uniform estimate
[TABLE]
To estimate we need to consider the second order decompositions
[TABLE]
Combining proposition 3.3 with the estimate (7.2) for any s.t. we check that
[TABLE]
for some universal constant and the parameter {\mathchoice{\raisebox{0.0pt}{\displaystyle\chi}}{\raisebox{0.0pt}{\textstyle\chi}}{\raisebox{0.0pt}{\scriptstyle\chi}}{\raisebox{0.0pt}{\scriptscriptstyle\chi}}}(b,\sigma) introduced in (2.6). This ends the proof of (7.1). The proof of the theorem is completed.
7.2 Interacting diffusions
Consider a system of interacting and -valued diffusion flows , with given by a stochastic differential equation of the form
[TABLE]
for some Lipschitz functions and with appropriate dimensions. In the above display, stands for a collection of independent copies of -dimensional Brownian motion . Assume that linear w.r.t. the first coordinate.
In this situation, up to a change of probability space, the empirical mean of the process
[TABLE]
satisfies the stochastic differential equation
[TABLE]
Formally, the above diffusion converges as to the flow of the dynamical system defined by
[TABLE]
More rigorously and without further work, the forward-backward interpolation formula (1.10) yields directly the bias-variance error decomposition
[TABLE]
This readily implies the a.s. convergence
[TABLE]
After some elementary manipulations we check the bias formula
[TABLE]
We also have the almost sure fluctuation theorem
[TABLE]
7.3 Time discretization schemes
This section is mainly concerned with the proof of proposition 1.4. We fix some parameter and some and for any we set
[TABLE]
for some fluctuation parameter . For any we have
[TABLE]
Using (4.12), in terms of the tensor product (2.1) we readily check that
[TABLE]
Combining (3.5) with the Minkowski integral inequality we check that
[TABLE]
where the second line follows from the exponential estimate of the tangent process from proposition 3.3. The integrand will be bounded as follows: for any and any we have
[TABLE]
which then yields the stated result of the proposition. We now prove the stated bound on the difference of the drift processes. For any we have
[TABLE]
The -norm of the second integral term is bounded by .
The assumption , for some , implies the stochastic flows has uniform absolute moments of any order w.r.t. the time horizon, that is, we have that
[TABLE]
The stochastic flows also obey a similar moment bound: observe that for any we have
[TABLE]
Thus, for any we have
[TABLE]
We can check that the stochastic flows also have uniform moments w.r.t. the time horizon; that is, for any we have that
[TABLE]
Using this bounds, we check that
[TABLE]
The end of the proof now follows elementary manipulations, thus it is skipped. The proof of proposition 1.4 is now completed.
Appendix
In this appendix we prove the estimates (1.16) and (2.10) and proposition 3.3.
Proof of (2.10)
Whenever is satisfied, we have
[TABLE]
with the parameters
[TABLE]
Observe that
[TABLE]
After some elementary computations, for any we check that
[TABLE]
This implies that
[TABLE]
from which we check that for any we have
[TABLE]
This implies that
[TABLE]
from which we check that
[TABLE]
as soon as and . Replacing by and then by we check that
[TABLE]
This ends the proof of (2.10).
Proof of proposition 3.3
The proof of the estimate (3.10) is mainly based on the following technical lemma of its own interest.
Lemma 7.2**.**
Let be a non negative diffusion process satisfying in integral sense an inequality of the following form
[TABLE]
for some parameters and , and some non negative processes . In this situation, for any we have
[TABLE]
with the parameters
[TABLE]
Proof.
Applying Itô’s formula, for any , we have
[TABLE]
On the other hand, for any we have the almost sure inequality
[TABLE]
This implies that
[TABLE]
Applying Hölder inequality we check that
[TABLE]
This yields the estimate
[TABLE]
This ends the proof of the lemma.
We set
[TABLE]
and we also consider the collection of parameters
[TABLE]
with the tensor functions introduced in (3.9). Observe that
[TABLE]
Whenever is met we have
[TABLE]
Also observe that
[TABLE]
and
[TABLE]
In the same vein, we have
[TABLE]
We are now in position to prove proposition 3.3.
Proof of proposition 3.3:
Applying the above lemma to the processes
[TABLE]
and the parameters
[TABLE]
we obtain the estimate (7.4) with the parameters
[TABLE]
Observe that
[TABLE]
for some universal constant and the parameter {\mathchoice{\raisebox{0.0pt}{\displaystyle\chi}}{\raisebox{0.0pt}{\textstyle\chi}}{\raisebox{0.0pt}{\scriptstyle\chi}}{\raisebox{0.0pt}{\scriptscriptstyle\chi}}}(b,\sigma) defined in (2.6). Using (3.8) we check that
[TABLE]
Assume that
[TABLE]
In this case there exists some such that for any we have
[TABLE]
and therefore
[TABLE]
This ends the proof of the proposition.
Proof of (1.16)
Using (2.14), the generalized Minkowski inequality applied to (1.10) whenever is met for some and gives
[TABLE]
The Skorohod integral is estimated using theorem 5.2. Using (7.5) and (5.9) we check that
[TABLE]
as soon as the regularity conditions , and are satisfied for some parameter and some . Choosing and setting we check that
[TABLE]
as soon as and are satisfied for some parameter and some . For instance, and are satisfied as soon as
[TABLE]
This ends the proof of (1.16).
Acknowledgments
P. Del Moral is supported in part from the Chair Stress Test, RISK Management and Financial Steering, led by the French Ecole polytechnique and its Foundation and sponsored by BNP Paribas, and by the ANR Quamprocs on quantitative analysis of metastable processes.
We also thank the anonymous reviewers for their excellent suggestions for improving the paper. Their detailed comments greatly improved the presentation of the article.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] V. Alekseev. An estimate for the perturbations of the solution of ordinary differential equations. Vestn. Mosk.Univ., Ser. I, Math. Meh. vol. 2, (1961).
- 2[2] T. Ando and J. L. van Hemmen. An inequality for trace ideals. Commun. Math. Phys., vol. 76, pp. 143–148 (1980).
- 3[3] M. Arnaudon, P. Del Moral. A variational approach to nonlinear and interacting diffusions. Ar Xiv:1812.04269 (2018). Stochastic Analysis and Applications DOI: 10.1080/07362994.2019.1609985 (2019).
- 4[4] M. Arnaudon, P. Del Moral. A duality formula and a particle Gibbs sampler for continuous time Feynman-Kac measures on path spaces. Ar Xiv 1805.05044 (2018). Electronic Journal of Probability 25 (2020).
- 5[5] M. Arnaudon, P. Del Moral. A second order analysis of Mc Kean-Vlasov semigroups. Ar Xiv:1906.05140 (2019 ), Annals of Applied Probability, vol. 30, no. 6, pp. 2613--2664. (2020).
- 6[6] M. Arnaudon, H. Plank, A. Thalmaier. A Bismut type formula for the Hessian of heat semigroups. C. R. Math. Acad. Sci. Paris, vol. 336, no. 8, pp. 661--666 (2003).
- 7[7] R. Bellman. Some inequalities for the square Root of a Positive Definite Matrix. Linear Algebra and its applications, vol. 1, no. 3, pp. 321--324 (1968).
- 8[8] R. Bellman, Stability Theory of Differential Equations, Mc Graw Hill, New York, (1953).
