Spectral-free estimation of L\'evy densities in high-frequency regime
C\'eline Duval (MAP5), Ester Mariucci

TL;DR
This paper introduces a pathwise, spectral-free method for estimating the Le9vy density of pure jump Le9vy processes from high-frequency data, avoiding spectral techniques and the Le9vy--Khintchine formula.
Contribution
It proposes a novel direct, pathwise estimation procedure for Le9vy densities that works for infinite variation processes, bypassing spectral methods and using wavelet estimators.
Findings
Achieves classical nonparametric rates for finite variation processes
Demonstrates robustness near the critical estimation boundary
Shows effectiveness even with a Brownian component present
Abstract
We construct an estimator of the L\'evy density of a pure jump L\'evy process, possibly of infinite variation, from the discrete observation of one trajectory at high frequency. The novelty of our procedure is that we directly estimate the L\'evy density relying on a pathwise strategy, whereas existing procedures rely on spectral techniques. By taking advantage of a compound Poisson approximation, we circumvent the use of spectral techniques and in particular of the L\'evy--Khintchine formula. A linear wavelet estimator is built and its performance is studied in terms of loss functions, , over Besov balls. We recover classical nonparametric rates for finite variation L\'evy processes and for a large nonparametric class of symmetric infinite variation L\'evy processes. We show that the procedure is robust when the estimation set gets close to the critical value 0 and also…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Spectral-free estimation of Lévy densities in high-frequency regime
Céline Duval 111Université Paris Descartes, MAP5, UMR CNRS 8145. E-mail: [email protected] and Ester Mariucci 222Otto von Guericke Universität Magdeburg, Germany. E-mail: [email protected].
Abstract
We construct an estimator of the Lévy density of a pure jump Lévy process, possibly of infinite variation, from the discrete observation of one trajectory at high frequency. The novelty of our procedure is that we directly estimate the Lévy density relying on a pathwise strategy, whereas existing procedures rely on spectral techniques. By taking advantage of a compound Poisson approximation, we circumvent the use of spectral techniques and in particular of the Lévy–Khintchine formula. A linear wavelet estimator is built and its performance is studied in terms of loss functions, , over Besov balls. We recover classical nonparametric rates for finite variation Lévy processes and for a large nonparametric class of symmetric infinite variation Lévy processes. We show that the procedure is robust when the estimation set gets close to the critical value 0 and also discuss its robustness to the presence of a Brownian part.
Keywords. Lévy density estimation, infinite variation, Lévy processes, nonparametric estimation.
AMS Classification. 60E07, 60G51, 62G07, 62M99.
1 Introduction
1.1 Motivations
It is now acknowledged that diffusion processes with jumps are good tools for modeling time varying random phenomena whose evolution exhibits sudden changes in value. One of the simplest way to allow for jumps is by considering a Lévy process, that is a continuous time process of the form , where is a Brownian motion. The peculiarity of a Lévy process is that for any , the law of is infinitely divisible and the paths of may have discontinuities. This explains why Lévy processes are a fundamental building block of many stochastic models; many of them have been suggested and extensively studied, for example, in mathematical finance; in physics, for turbulence, laser cooling and in quantum theory; in engineering for networks, queues and dams; in economics for continuous time-series models, in actuarial science for the calculation of insurance and re-insurance risk (see e.g. [2, 5, 6, 8, 31] for reviews and other applications).
The continuous part of is characterized by two real parameters and it can be handled easily. The behavior of the jump part is instead described by an infinite-dimensional object, the so-called Lévy measure or, equivalently, by the Lévy density whenever the Lévy measure admits a density with respect to the Lebesgue measure. If the Lévy density is continuous, determines how frequent jumps of size close to are to occur per unit of time. Thus, to understand the jump behavior of , it is of crucial importance to estimate .
When dealing with Lévy processes, two approaches are typically used:
- •
A spectral approach based on the Lévy–Khintchine formula which relates the characteristic function of to the Lévy density .
- •
A pathwise approach based on the Lévy–Itô decomposition (see (1.2) below).
Techniques employed to address the estimation problem of Lévy densities systematically rely on spectral approaches. They have proven their efficiency both theoretically and numerically. An exception is represented by [16], where the properties of a projection estimator of the Lévy density are discussed. In the present work we circumvent the use of spectral techniques in favor of a pathwise strategy. Having a spectral-free procedure paves the way to new techniques for studying richer classes of jump processes for which an equivalent of the Lévy–Khintchine formula is not available.
One way to proceed is to translate from a probabilistic to a statistical setting Corollary 8.8 in [33]: “Every infinitely divisible distribution is the limit of a sequence of compound Poisson distributions.” We are also motivated by the fact that a compound Poisson approximation has been successfully applied to approximate general pure jump Lévy processes, both theoretically and for applications. For example, it is a standard way to simulate trajectories of pure jump Lévy processes (see e.g. Chapter 6, Section 3 in [13]). An alternative strategy would consist in taking advantage of the asymptotic equivalence result in [27] to construct an estimator of the Lévy density . Yet, the resulting estimator would have the strong disadvantage of being randomized and, more fundamentally, it would require the knowledge of in a neighborhood of the origin.
In the literature, nonparametric estimation of finite Lévy densities, i.e. Lévy densities of compound Poisson processes, is well understood both from high frequency and low frequency observations (see, among others, [4, 7, 10, 9, 14, 35]). Building an estimator of for a Lévy process with infinite Lévy measure is a more demanding task; for any time interval , the process almost certainly jumps infinitely many times. In particular, is unbounded in any neighborhood of the origin. The techniques used for compound Poisson processes do not generalize immediately. Nevertheless, many results on the estimation of -infinite- Lévy densities from discrete data already exist. Spectral techniques enabled to build estimates of functionals of the Lévy density, such as or —which arise naturally when using the Lévy–Khintchine formula— leading to estimators of on compact sets away from 0. A non-exhaustive list of works estimating for and loss functions includes [12, 11, 17, 18, 20, 21, 24, 28, 34]; a review is also available in the textbook [3].
1.2 Notations and definitions
Before detailing the results, we introduce some necessary notations and definitions.
Lévy–Itô decomposition
It is well known that any Lévy process is a càdlàg process that can be written as
[TABLE]
where , , is a Borel measure on such that and , is a standard Brownian motion, and the processes , and are independent.
In this paper we focus on the class of pure jump Lévy processes of the form
[TABLE]
For all , the Lévy–Itô decomposition allows to write a Lévy process as in (2) as the sum of two independent Lévy processes, the first one (resp. the second one) having jumps smaller (resp. larger) in absolute value than . Namely
[TABLE]
where the drift is defined as
[TABLE]
and are two independent Lévy processes. The process is a centered martingale consisting of the sum of the small jumps i.e. the jumps of size smaller than . The process instead, is a compound Poisson process defined as follows: is a Poisson process of intensity and are i.i.d. random variables independent of such that , for all .
Compound Poisson approximation
Denote by (resp. ) the Lévy density of (resp. ), i.e. (resp. ). Let be the density, with respect to the Lebesgue measure, of the random variables , i.e. . We are interested in estimating in any set of the form for all , where . The latter condition is technical; if is a compound Poisson process we may choose , otherwise we work under the simplifying assumption that is a bounded set. Observe that, for any ,
[TABLE]
Therefore, estimating in from the increments of is equivalent to estimating the Lévy density of the compound Poisson part of , namely , from the increments of . When , we may take and Equation (1.2) reduces to a compound Poisson process with intensity and jump density .
A nonparametric class of Lévy densities
Assume that the Lévy measure is absolutely continuous with respect to the Lebesgue measure and denote by the Lévy density of . We pay special attention to the following nonparametric class of Lévy densities. Consider and a positive constant, define the class of functions
[TABLE]
A Lévy density belongs to the class , , , if . In particular contains any -stable Lévy density such that . Any finite variation Lévy process is in the class for some positive .
In Theorem 2, we provide upper bounds for general Lévy densities. To derive explicit rates of convergence we will examine, in particular, the cases where belongs to relying on the results of [15] (see Appendix B).
Observation setting and loss function
Suppose we observe on at the sampling rate . Without loss of generality, set with , and define
[TABLE]
We consider the high frequency setting where and as . The assumption is necessary to construct a consistent estimator of . The difference with the works listed in Section 1.1 is that we build a spectral-free estimator of , without smoothing treatment at the origin, and study the following risk. Define the class L_{p,\varepsilon}=\big{\{}g:\|g\|_{L_{p},\varepsilon}:=\Big{(}\int_{A(\varepsilon)}|g(x)|^{p}dx\Big{)}^{{1}/{p}}<\infty\big{\}}, where and is the estimation set defined above. Define the loss function
[TABLE]
Finally, denote by the distribution of the random variable and by the law of the random vector defined in (7). Since is a Lévy process, its increments are i.i.d., hence
[TABLE]
In the following, whenever confusion may arise, the reference probability in expectations is explicitly stated, for example, writing .
1.3 Estimation strategy and results
For any fixed (when the choice is allowed), taking advantage of Equation (5), we build an estimator of on the set by constructing estimators for and separately. For that we consider the increments of (7) larger than in absolute value. Define the dataset
[TABLE]
where is the subset of indices such that \mathscr{I}_{\varepsilon}:=\big{\{}i=1,\dots,n:|X_{(i-1)\Delta}-X_{i\Delta}|>\varepsilon\big{\}}. Its random cardinality is denoted by
[TABLE]
Our estimation strategy is the following.
We build an estimator of using the following Lemma:
Lemma 1**.**
Let be a Lévy process with Lévy measure absolutely continuous with respect to the Lebesgue measure. Then, for all
[TABLE]
In particular, Lemma 1 implies and
[TABLE]
Lemma 1 is a modification of Lemma 6 in Rüschendorf and Woerner [32]. Their proof relies on spectral arguments, we provide a spectral free version of the proof in the Appendix. 2. 2.
From the observations in (8) we build a wavelet estimator of using that, for small, the random variables are i.i.d. with a density close to (see Lemma 3 below). 3. 3.
Finally, we estimate on following (5) by
[TABLE]
The presence of the small jumps makes the estimation of and from observed increments larger than delicate. Indeed, if is such that , it is not automatically true that there exists such that (or any other fixed positive number).
In Section 2 we establish upper bounds for the risks of the estimators and (see Theorem 1, Proposition 1 and Corollary 2). The main difficulty in their study lies in the presence of the small jumps that play a role in both cases. We stress that when is fixed, the quantity is bounded. But as we generalize the results to the case (see Section 3.2.3) this is no longer true: whenever , it holds as . Therefore, in all the results of the paper we always keep explicit the dependency in .
Our main results on the estimator (11) of the Lévy density are given in Section 3. Theorem 2 provides a general upper bound that tends to 0, regardless of the rate at which tends to 0. Interestingly, the estimation strategy leads to an upper bound where terms depending on the behavior of the small jumps appear. These terms make it difficult to derive an explicit rate of convergence without additional assumptions on the Lévy density. Therefore, in Theorems 3 and 4 we consider additional assumptions, satisfied in particular by the classes introduced in (6), and derive explicit rates of convergence. Similarly to what happens when using spectral procedures, Theorem 3 ensures that we recover the classical rates for finite variation Lévy densities when (see e.g. [11, 12]). Furthermore, we show that this rate is attained for a large nonparametric class of symmetric infinite variation Lévy processes. Theorem 4 generalizes the rates of Theorem 3, in particular to slower regimes for . Finally, Theorem 5 shows that our procedure is robust to the case . For the sake of clarity, the results of Sections 2 and 3 are stated for purely discontinuous Lévy processes, even though our procedure generalizes to the presence of a Gaussian part as detailed in Section 3.4.
Finally, Section 4 contains the proofs of the main results while Appendix A collects the proofs of the auxiliary results and in Appendix B technical results of [15] are partially reproduced and used to establish Theorems 3, 4 and 5.
2 Preliminary estimators
2.1 Statistical properties of
First, we define an estimator of the intensity of the Poisson process in terms of , the number of jumps that exceed following (10).
Definition 1**.**
Let be the estimator of defined by
[TABLE]
where is defined as in (9).
Observe that as , which is the maximum likelihood estimator of in the experiment . In [29] and [30], estimators of the cumulative distribution function of the Lévy measure, which is closely related to , are build and Donsker theorems are derived. In [30] a direct approach similar to (12) is considered, the performances are investigated in , for a domain bounded away from 0. We establish the following bound for .
Theorem 1**.**
Let be a Lévy process as in (2) and let , , be such that . Let be the estimator of defined in (12). Then, there exists a constant , depending only on , such that
[TABLE]
In general, the quantity is not easy to handle. By Lemma 1, it holds \lim_{\Delta\to 0}\bigg{|}\lambda_{\varepsilon}-\frac{\mathbb{P}(|X_{\Delta}|>\varepsilon)}{\Delta}\bigg{|}=0, but the rate of convergence is not known in general. Nevertheless, in many cases of interest, it holds that as This motivates Assumption () below that leads to Corollary 1.
Assumption : is a Lévy process with a Lévy measure such that
[TABLE]
where is a constant that does not depend on .
Corollary 1**.**
Let be as in (2) and such that () is satisfied for some and . Let , and ; the estimator (12) of satisfies for all
[TABLE]
where is a positive constant depending on and .
Assumption () requires a non-asymptotic control on the cumulative distribution function of the Lévy process for small times. Asymptotic expansions have been established in the literature such as Theorem 3.2 in [18], where a control of is given for bounded away from the origin. However, no indication on how small should be nor on how large should be is given.
It is possible to establish that () holds true on the nonparametric class (see (6)). This is a consequence of [15] whose main results are reproduced in Appendix B. Theorem 7 ensures that for and , () is satisfied for and depending on and . Moreover, if is symmetric and its Lévy density is -Lipschitz on the interval , Theorem 10 ensures that for and , () is satisfied for and depending on and . Note that any Lévy density of the form for all where is a bounded differentiable function with bounded derivative is -Lipschitz on the interval . In particular it is satisfied if is an -stable process. Furthermore, in Theorems 7 and 10, the dependency in of the constant is explicit, which will allow to control the loss between and in the asymptotic .
2.2 Statistical properties of
2.2.1 Construction of
To recover the jump density , we exploit the high frequency setting. For small, it holds h_{\varepsilon}\approx\mathcal{L}(X_{\Delta}\big{|}|X_{\Delta}|>\varepsilon). Focusing on the increments larger than in absolute value, we estimate the density using a linear wavelet density estimator and study its performances uniformly over Besov balls (see Kerkyacharian and Picard [25] or Härdle et al. [22]). We state the result and assumptions in terms of the quantity of interest .
Preliminary on Besov spaces
Let be a pair of scaling function and mother wavelet which are compactly supported, of class and generate a regular wavelet basis adapted to the estimation set (e.g. Daubechie’s wavelet). Moreover suppose that is an orthonormal family of . For all we write for
[TABLE]
where , and the coefficients are
[TABLE]
As we consider compactly supported wavelets, for every , the set incorporates boundary terms that we choose not to distinguish in notation for simplicity. In the sequel we apply this decomposition to . This is justified because implies and the coefficients of its decomposition are and . The latter can be interpreted as the expectations of and where is a random variable with density with respect to the Lebesgue measure.
We define Besov spaces in terms of wavelet coefficients as follows. For , and a function belongs to the Besov space if the norm
[TABLE]
is finite, with the usual modification if . We consider Lévy densities with respect to the Lebesgue measure, whose restriction to the set lies into a Besov ball:
[TABLE]
where , for a fixed constant . Note that the regularity assumption is imposed on viewed as an function. Therefore the dependency in lies in . Also, the parameter measuring the loss of our estimator is the same as the one measuring the Besov regularity of the function, this is discussed in Section 2.2.2. Lemma 2 below follows immediately from the definitions of and the Besov norm (13).
Lemma 2**.**
For all , let be in . Then, belongs to the class \mathscr{F}\big{(}s,p,q,{\mathfrak{M}},A(\varepsilon)\big{)}.
Construction of
To estimate the jump density , we only have access to the indirect observations , where for each , it holds
[TABLE]
The problem is twofold. First, there is a deconvolution problem as the information on is contained in the observations . The distribution of the noise is unknown, but it is small as its variance as . Then, we neglect this noise:
[TABLE]
Second, overlooking that it is possible that for some , and yet the common density of is not but it is given by
[TABLE]
where denotes the convolution product. Again, in the asymptotic , we neglect the possibility that more than one jump of occurred in an interval of length .
Lemma 3**.**
For all , and , it holds \big{\|}{p}_{\Delta,\varepsilon}-h_{\varepsilon}\big{\|}_{L_{p,\varepsilon}}\leq 2\Delta e^{\lambda_{\varepsilon}\Delta}\|f\|_{L_{p,\varepsilon}}.
Define the estimator based on the chain of approximations
[TABLE]
where is an integer to be chosen and
[TABLE]
If , the estimator is 0, which occurs with probability . We work with a linear estimator even if linear estimators are not always minimax for general Besov spaces , (). Indeed, to evaluate the loss caused by neglecting the small jumps (see (15)), we make an approximation at order 1 of our estimator . We thus require our estimator to depend smoothly on the observations, which is not the case for usual thresholding methods. Finally, we recall that on the class this estimator is optimal in the context of density estimation from direct i.i.d. observations (see [25], Theorem 3).
2.2.2 Upper bound results
Adapting the results of [25], we derive a conditional upper bound for the estimation of when the Lévy measure is infinite. Recall that with .
Proposition 1**.**
Suppose , fix and that belongs to the class defined in (14), for some , , and . If suppose that , for some symmetric function . Let and let be the wavelet estimator of on , defined in (17). Let , , and . For any such that , the following inequality holds. For all and for all finite ,
[TABLE]
where denotes the cardinality of , and only depends on , , , , and .
Comments
If , then and we get \mathbb{E}\big{[}\|\widehat{h}_{n,\varepsilon}-h_{\varepsilon}\|_{L_{p,\varepsilon}}^{p}|\mathscr{I}_{\varepsilon}\big{]}\leq\|h_{\varepsilon}\|_{L_{p,\varepsilon}}^{p}. A straightforward adaptation of the proof of Proposition 1 allows to take if is a compound Poisson process. The constraint on for is classical (see e.g. [25]). For instance, it is satisfied if is compactly supported. Assumption is not restrictive, by means of Lemma 1, it holds for all
[TABLE]
Moreover, on the class , and defined in (6), Theorems 6 and 7 permit to derive a bound in when is fixed. A similar result can be obtained if assuming additionally that is symmetric: Theorems 8 and 9 give a bound of the form when is fixed. Finally, if , the same holds by replacing with .
To get unconditional bounds we introduce the following result.
Lemma 4**.**
Let . For all we have
[TABLE]
Using Lemma 4, we remove the conditioning on The terms appearing in the following upper bound are discussed in the more general setting of Section 3.1.1 below.
Corollary 2**.**
Fix , assume that belongs to the class defined in (14), for some , and . If suppose that , for some symmetric function . Let and let be the wavelet estimator of on , defined in (17). For any and such that and , the following inequality holds. For all and :
[TABLE]
for and depending only on , , , , and .
3 Statistical properties of
3.1 Main Theorem: general result
Combining the results in Proposition 1 and Corollary 2 we derive the following upper bound for the estimator of the Lévy density , when .
Theorem 2**.**
Fix , suppose belongs to the class defined in (14), for some , and . If suppose that , for some symmetric function . Let and let be the estimator of on , defined in (11). For any and such that and , the following inequality holds. For all , , there exists such that:
[TABLE]
where , , , , and depends on , , , , and .
3.1.1 Comments on the upper bound
Theorem 2 gives an explicit upper bound for the -risk restricted to the estimation set for the estimation of the Lévy density of a pure jump Lévy process as in (2). Note that Lemma 1 (see also (18)) ensures that tends to 0 when . Therefore the upper bound of Theorem 2 tends to 0 whenever and , under the assumption that the Lévy density has a Besov norm restricted to that does not grow more than a constant times .
We provide below a rough intuition of the different terms appearing in Theorem 2. The estimation strategy relies on different approximations that entail four different sources of errors (points 3-4-5 hereafter are related to the estimation of ).
Controlling the presence of jumps: The term \|f_{\varepsilon}\|^{p}_{L_{p,\varepsilon}}\big{(}1-F_{\Delta}(\varepsilon)\big{)}^{n} provides a control of the risk when no jumps larger than are observed in a dataset. This term is bounded by and tends to 0 exponentially fast under the assumption as . 2. 2.
Estimation of : it leads to the error \Big{(}\frac{F_{\Delta}(\varepsilon)}{n\Delta^{2}}\Big{)}^{\frac{p}{2}}+\Big{|}\lambda_{\varepsilon}-\frac{F_{\Delta}(\varepsilon)}{\Delta}\Big{|}^{p}:=E_{1}. 3. 3.
Neglecting the event : Considering that each time an increment exceeds the threshold the associated Poisson process is nonzero leads to the error
[TABLE]
This error is unavoidable as we do not observe and separately. 4. 4.
Neglecting the presence of : In (15) we ignore the convolution structure of the observations. This produces an error in
[TABLE]
It seems reasonable to neglect : the distribution of is unknown. Even if we did know it, deconvolution methods essentially rely on spectral approaches which we meant to avoid. 5. 5.
Estimation of the compound Poisson : This estimation problem is solved in two steps. First, we neglect the event which generates the error: This could have been improved considering a corrected estimator as in [14], but it would have added even more heaviness in the final result. Second, for , we recover an estimation error that is classical for the density estimation problem from i.i.d. observations in E_{5}:=2^{-Js}+2^{J/2}\ell_{p,\varepsilon}^{1/2}\big{(}nF_{\Delta}(\varepsilon)\big{)}^{-1/2}.
The rate of convergence is not explicit in terms of , it depends on the quantities and and therefore on the Lévy measure . Consequently, we cannot say in general which one of the above error terms , , , or is predominant.
Moreover, the rate of convergence depends on the choice made for . There is a bias term in (see ), where is the regularity of , that decreases with , whereas all the other —variance type— terms (, and the second term of ) increase with . Ideally, should be selected of the order of a minimizer of the upper bound of Theorem 2. In the results below, from the idea that and are –under suitable assumptions– remainder terms and should not intervene in the final rate, we focus only on and select as a minimizer of . An adaptive procedure to select is discussed in Section 3.3.
In Theorems 3 and 4 below, the rates of convergence for the -risk \ell_{p,\varepsilon}\big{(}\widehat{f}_{n,\varepsilon},f\big{)} are given under additional assumptions on and , which are satisfied on the class defined in (6). Finally, Theorem 5 deals with the case , it shows that our estimator is robust in this context.
3.1.2 Example: Compound Poisson process
If is a compound Poisson process, we fix . Then , and . The bound given in Theorem 2 simplifies and choosing such that we get \big{[}\ell_{p,0}\big{(}\widehat{f}_{n,0},f\big{)}\big{]}^{p}\leq C\big{\{}(n\Delta)^{-\frac{sp}{2s+1}}+\Delta^{p}\big{\}}, where the first term is the optimal rate of convergence to estimate from the observations and the second term is a deterministic error due to the omission of the event that more than one jump may occur in an interval of length (see also [14]).
3.2 An explicit rate under additional assumptions
3.2.1 Regimes where
Without any specific assumption on the Lévy density, the rate of Theorem 2 is not explicit. This rate can be simplified under the additional assumptions (), and as well as choosing such that . The dimension is selected such that the estimator of is rate optimal from the observations . The following theorem is derived from Theorem 2 (see Appendix A for the proof).
Theorem 3**.**
Let be a Lévy process as in (2) and let be a Lévy measure admitting a density with respect to the Lebesgue measure. Fix , suppose that belongs to the class where is defined in (14), for some , , and and where , for some and . Let and let be the estimator of on , defined in (11), let and be such that , , and . Then, for all , it holds that \big{[}\ell_{p,\varepsilon}\big{(}\widehat{f}_{n,\varepsilon},f\big{)}\big{]}^{p} is bounded uniformly over by
[TABLE]
where is a constant depending on , , , , , and .
If , the result of Theorem 3 follows from the case and the Hölder inequality, as the set is bounded and .
The quantity appearing in the hypotheses of Theorem 3 can be easily controlled on the class of subordinators satisfying (), , for instance for a Gamma process. Indeed, for any subordinator it holds under Assumption () it becomes v_{\Delta}(\varepsilon)=e^{\lambda_{\varepsilon}\Delta}(F_{\Delta}(\varepsilon)+e^{-\lambda_{\varepsilon}\Delta}-1)=O\big{(}\Delta^{2}\big{)} as . More generally, it is possible to give explicit upper bounds for on a larger class of Lévy processes than the subordinators, namely the class , and , on which it is possible to show that
[TABLE]
see Theorem 6. The same result can be obtained on the class , , under the additional assumption of a symmetric Lévy density which is also -Lipschitz on the interval , see Equation (40). Concerning the generality of Assumption () we emphasize that it is always satisfied for Lévy densities in , (see Theorem 7) and for symmetric Lévy densities in , that are -Lipschitz on the interval , see Theorem 10.
3.2.2 An explicit rate for all regimes
Assumption : is a Lévy process as in (2) such that
[TABLE]
for some and where is a positive constant that does not depend on .
Note that () is satisfied for if belongs to the class , (see (6) and Theorem 6) or if is a subordinator such that () is satisfied. If is a symmetric infinite variation process, () is satisfied for if belongs to the class , (see Theorem 8). Observe that the exponent cannot be improved in general, see Section 2.2 in [15]). However, if, additionally, is -Lipschitz on the interval , the result can be improved with (see (40)). The case being covered by Theorem 3, we concentrate on the cases or
Theorem 4**.**
Let be a Lévy process as in (2) and let be a Lévy measure admitting a density with respect to the Lebesgue measure. Fix , suppose that belongs to the class defined by
[TABLE]
with defined in (14), for some , , and , and for some , and . Let and let be the estimator of on , defined in (11). Suppose that and that is such that .
Then, for all , the -risk \big{[}\ell_{p,\varepsilon}\big{(}\widehat{f}_{n,\varepsilon},f\big{)}\big{]}^{p} is bounded uniformly over by
[TABLE]
where , and , is a constant depending on , , , , , and .
From the proof of Theorem 4, we have the following relations: if and only if . Moreover, if , then if and if . If , and then
3.2.3 Estimation in a neighborhood of the origin
In the following result, we make explicit the dependency in of the bound for This shows that is robust when slowly with respect to .
Theorem 5**.**
Let be a Lévy process as in (2) and let be a Lévy measure admitting a density with respect to the Lebesgue measure. Fix and suppose that for some and for some , , and (see (6) and (14)). If suppose additionally that is symmetric and is -Lipschitz on the interval .
Let and let be the estimator of on , defined in (11). Suppose that , and are such that , , and , where is defined in (39).
Then, for all , it holds
[TABLE]
where is a constant depending on , , , , , , , , and .
Note that for and an -stable processes, , using , we recover for and selected as in Theorem 5 that \ell_{2,\varepsilon}\big{(}\widehat{f}_{n,\varepsilon},f\big{)}\leq C\lambda_{{\varepsilon}}(n\Delta\lambda_{\varepsilon})^{-\frac{s}{2s+1}}.
3.3 Discussion
3.3.1 General comments
Theorem 3 ensures that for any finite variation process of the form (2) –whose Lévy density is automatically in – in regimes such that , our estimator attains the rate uniformly over a Besov class of regularity . This rate is also attained for any symmetric infinite variation process such that its Lévy density is in for some and is -Lipschitz on the interval . As mentioned earlier, the Lipschitz condition is satisfied if for all where is a bounded differentiable function with bounded derivative. This regularity condition on on a neighborhood of is different from requiring that
This result generalizes [11]: there, under the assumptions , and is in a Sobolev class of regularity , the same rate is attained for the estimation of for a loss function. However, in dimension , the hypothesis is in a Sobolev space with regularity implies that and that for (see Theorem 4.2 in [1]). Therefore, we generalize [11] to loss functions and to symmetric infinite variation Lévy processes in , .
Moreover, the results of Theorem 4 generalizes the latter to loss functions and without assumptions on other that . In these cases the rate of our procedure may be slower than .
Theorem 5 shows that our estimator is robust when gets close to the critical value 0. This form of results was not studied in the literature. Unsurprisingly, the rate deteriorates as gets close to the critical value 0 as it gets multiplied with the increasing quantity \lambda_{\varepsilon}^{\frac{p}{2s+1}}\varepsilon^{-\frac{sp}{2s+1}\big{(}1+\alpha-\frac{2}{p}\big{)}}.
3.3.2 Discussion on optimality
The question whether the upper bound of Theorem 2 is optimal remains open. When estimating , for fixed, we in fact estimate a Poisson measure. It is well known that the minimax rate of convergence over Besov classes with regularity for the estimation of the Lévy density of a compound Poisson process from the observation of of its increments sampled at rate is , see e.g. [14].
However, in the present context, the data available to estimate the Poisson measure are not the increments of a compound Poisson process but those of a general Lévy process. This is a more challenging problem and in particular we can deduce that the rate cannot be improved on the class (see also Section 4 of [16]). This rate is obtained in Theorem 3 under the additional assumptions , , () with and . Remark that for Lévy densities belonging to the nonparametric class if or for smooth symmetric Lévy densities in with , the last two conditions are automatically satisfied as soon as , where is as in (19). Theorem 5 shows that this rate is robust to small values of .
Finally, we observe that there exist examples of Lévy processes of infinite variation and with non-smooth Lévy densities that do not satisfy Assumption (), see e.g. [26]. Upper bounds for the quantities and are still known in this situation, see [15]. More precisely, for symmetric Lévy densities in with , it holds
[TABLE]
for some constants and whose dependency in and can be made explicit. Using (20) in Theorem 2 leads to a rate that depends on and which is slower than . Nevertheless it is hard to say whether such a rate of convergence is optimal.
3.3.3 Adaptive selection procedure for
Motivated by the belief that the dominating term in the upper bound of Theorem 2 is , in Theorems 3 and 4 ( is fixed) we selected such that A similar quantity depending on is considered in Theorem 5. However, such depends on the unknown regularity and is not feasible in practice. We propose here a data-driven procedure to select a suitable dimension .
Usually, for wavelet type estimators, adaptation is achieved by thresholding techniques. However, the study of our estimator relies on the property that (X_{i\Delta})_{i}\mapsto\widehat{\alpha}_{J,k}\big{(}(X_{i\Delta})_{i}\big{)} is differentiable, which is no longer true if we threshold the coefficients \big{(}\widehat{\alpha}_{J,k}\big{)}_{k}.
Ideally, should be selected of the order of a minimizer of the upper bound of Theorem 2 or equivalently a term realizing the trade-off between this bias term and all the variance terms depending on (see and the second term of in Section 3.3.1). The following adaptive procedure to select from the observations is inspired from the Goldenshluger and Lepskii’s method see e.g. [19]. Select such that an estimator of the bias is of the order of the variance. Denote by (see (11) and (17)), and consider the data driven choice
[TABLE]
where , is a constant to be calibrated and is an upper bound of the variance term appearing in the bound of Theorem 2. We do not seek for dimensions larger than as then the variance term no longer tends to 0 (see in Section 3.3.1). However, the problem of the definition (21) is that it requires an explicit, sharp, upper bound for which is not available without additional assumptions on the Lévy density , and . Under the Assumptions of Theorem 3, we can set,
[TABLE]
We do not investigate the question whether the resulting estimator satisfies the same upper bound –up to a numerical constant– as . A key element in establishing such a bound would be the control of the deviations of from .
3.4 Extensions
If the Lévy process has a Brownian component, the estimator presented here applies and the results established can be generalized without technical difficulties. Let be a Lévy process of the form:
[TABLE]
where is a standard Wiener process, independent of and . This corresponds to consider with as in (2).
Similarly, consider the increments \big{(}\widetilde{X}_{i\Delta}-\widetilde{X}_{(i-1)\Delta},i\in\widetilde{\mathscr{I}}_{\varepsilon}\big{)} where \widetilde{\mathscr{I}}_{\varepsilon}:=\big{\{}i=1,\ldots,n:|\widetilde{X}_{i\Delta}-\widetilde{X}_{(i-1)\Delta}|>\varepsilon\big{\}}. Equations (12) and (17) applied to these increments give estimators of and . Since , in the asymptotic approximation (15) still makes sense when applied to the increments of .
Theorem 1 holds true in this setting, its proof remains unchanged by the additional Brownian part. However, the quantity needs to be handled differently in examples. Moreover, at the expense of small modifications in its proof, Corollary 2 still holds after replacing in its statement by v_{\Delta,\sigma}(\varepsilon):=\mathbb{P}\big{(}|\Delta b_{\nu}(\varepsilon)+M_{\Delta}(\varepsilon)+\sigma W_{\Delta}|>\varepsilon\big{)} (this quantity agrees with the previous definition when ).
Combining those results, Theorem 2 holds. Applying Lemma 1, we recover that and as , from which we obtain the consistency of the procedure. Moreover, Theorems 3, 4 and 5 can also be obtained adapting () accordingly, replacing with Finally, as discussed in Section 2.5 of [15] the results of Appendix B also hold in presence of a Gaussian component and Assumptions () and () are satisfied on the class , .
Another interesting extension would be to apply this methodology to estimate the Lévy density of an Itô semimartingale from high frequency observations as studied in [23]. Then, for small enough and under suitable assumptions, the contribution of the drift and the diffusive part of the process can be neglected and the above strategy should generalize. However, it would induce more technicalities in the proofs as the additional drift and diffusive part are not necessarily constant nor deterministic.
4 Proofs
In the sequel, is a constant whose value may vary from line to line. Its dependencies may be given in indices. Proofs of auxiliary lemmas are postponed to Appendix A.
4.1 Proof of Theorem 1
Let and . The following holds
[TABLE]
To control the second term in (22), we introduce the i.i.d. centered random variables
[TABLE]
For , an application of the Rosenthal inequality together with \mathbb{E}\big{[}|U_{i}|^{p}\big{]}=O\big{(}\frac{F_{\Delta}(\varepsilon)}{n^{p}}\big{)} ensure the existence of a constant such that
[TABLE]
For , the Jensen inequality and the previous result for lead to
[TABLE]
In the asymptotic using (22), we are only left to show that, for ,
[TABLE]
An application of the Bernstein inequality (using that and the fact that the variance ) allows us to deduce that
[TABLE]
Therefore,
[TABLE]
Observe that, for , the denominator is smaller than while for we have . It follows, after a change of variables, that
[TABLE]
where, is the incomplete Gamma function and is the usual Gamma function. To conclude, we use the classical estimate for the incomplete Gamma function for : \Gamma(s,x)\approx x^{s-1}e^{-x}\big{(}1+\frac{s-1}{x}+O({x^{-2}})\big{)}. When (24) is divided by , it is asymptotically O\big{(}{(n\Delta)^{-p}}e^{-nF_{\Delta}(\varepsilon)}\big{)}, which goes to 0 faster than (23).
4.2 Proof of Proposition 1
Preliminary
As the proof is lengthy, we enlighten here the main difficulties arising from the fact that the estimator uses the observations , i.e. . As it holds .
The cardinality of is that is random. That is why in Proposition 1 we study the risk of this estimator conditionally on 2. 2.
An observation of is not a realization of . Indeed, an increment of does not necessarily correspond to one jump, whose density is . More demanding, the presence of the small jumps needs to be taken into account. To do so we split the sample in two parts according to the presence or absence of jumps in the Poisson part. On the subsample where the Poisson part is nonzero, we make an expansion at order 1 and we neglect the presence of the small jumps.
Expansion of
Consider the increments larger than . Recall that, for each , we have
[TABLE]
We split the sample as follows: and Denote by the cardinality of . To avoid cumbersomeness, in the remainder of the proof we write instead of and instead of . Recall that . Using that is continuously differentiable we can write, ,
[TABLE]
where . It follows that
[TABLE]
where conditional on , is the linear wavelet estimator of defined in (16) from direct measurements. Explicitly, it is defined as follows
[TABLE]
where . This is not an estimator as both and are not observed. However, approximates the quantity
[TABLE]
Decomposition of the loss
Taking the norm and applying the triangle inequality we get
[TABLE]
After taking expectation conditionally on and , we bound each term separately.
Remark 1**.**
If is a compound Poisson process and we take , then (and ) and .
Control of
We have
[TABLE]
The deterministic term is bounded using Lemma 3 by . Taking the expectation conditionally on and of , we recover the linear wavelet estimator of studied by Kerkyacharian and Picard [25] (see their Theorem 2). For the sake of completeness we reproduce the main steps of their proof. The control of the bias is the same as in [25]. Noticing that Lemma 2 implies {p}_{\Delta,\varepsilon}\in\mathscr{F}\big{(}s,p,q,\mathfrak{M},A(\varepsilon)\big{)} (see Lemma 5.1 in [14]), we get
[TABLE]
where and are defined in (25) and (26). First consider the case . We start by observing that
[TABLE]
where denotes the cardinality of the set . To bound the last term we apply the inequality of Bretagnolle and Huber to the i.i.d. centered random variables \big{(}\Phi_{Jk}(Z_{i\Delta}-Z_{(i-1)\Delta})-\mathbb{E}[\Phi_{Jk}(Z_{i\Delta}-Z_{(i-1)\Delta})]\big{)}_{i\in I} bounded by , conditional to . We obtain the bound on the previous term
[TABLE]
Therefore, we get
[TABLE]
where, as developed in [25],
[TABLE]
We can then conclude that
[TABLE]
Plugging this last inequality in we obtain
[TABLE]
where is a constant depending on and . Gathering all terms we get, for ,
[TABLE]
where depends on and . For , together with the additional assumption of , following the lines of the proof of Theorem 2 of Kerkyacharian and Picard [25] we obtain the same bound as above replacing with . Finally, using we have established for that
[TABLE]
where depends on and and .
Note that taking such that we have, uniformly over , an upper bound in for the estimation of , which is the optimal rate of convergence for a density from direct independent observations (see [25]). Moreover, we did not use that is bounded to control this quantity, it was possible to have .
Control of
Using the fact that is compactly supported, we get
[TABLE]
Furthermore, we use the following upper bound for the last term in the expression above:
[TABLE]
From the Rosenthal inequality conditional on and we derive for
[TABLE]
Observe that . There exists a constant , only depending on , such that Set .
For we obtain the same result using the Jensen inequality and the latter inequality with . Next,
[TABLE]
As is compactly supported, and since we estimate on a set bounded by , for every , the set has cardinality bounded by where depends on the support of and . It follows that,
[TABLE]
Control of
Similarly, for the last term we have
[TABLE]
Deconditioning on
Substituting (28), (29) and (30) into (27), and noticing that is negligible compared to , we obtain
[TABLE]
where depends on , , , , and . To remove the conditional expectation on we apply the following lemma, whose proof is postponed in the appendix.
Lemma 5**.**
Let and . If and , then for all , there exists a constant depending on such that
[TABLE]
Finally, using Lemma 5 and that , we complete the proof.
4.3 Proof of Theorem 2
Theorem 2 is a consequence of Theorem 1 and Corollary 2. For all , we decompose as follows:
[TABLE]
The term is controlled by means of Theorem 1 combined with the fact that if then , which implies . Concerning the term , the Cauchy-Schwarz inequality gives
[TABLE]
The term is treated using the triangle inequality and Theorem 1. For , notice that as is bounded, an application of the Jensen inequality yields:
[TABLE]
where depends on . The rate of the right hand side of the inequality has been studied in Corollary 2.
Acknowledgements
The work of E. Mariucci has been partially funded by the Federal Ministry for Education and Research through the Sponsorship provided by the Alexander von Humboldt Foundation, by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) – 314838170, GRK 2297 MathCoRe, and by Deutsche Forschungsgemeinschaft (DFG) through grant CRC 1294 ’Data Assimilation’.
Appendix A Additional proofs
Proof of Lemma 1
Using the decompositions (1.2) and (1.2), for any it holds:
[TABLE]
for some drift that might depend on . We shall complete the proof by showing that for .
Control of . By the Markov inequality, for all it holds
[TABLE]
Observe that is a Gaussian random variable with mean and variance independent of . Hence, by the binomial theorem,
[TABLE]
In the sequel, we establish that
[TABLE]
where , . To see that, one needs to control the moments of and . To compute we use that the th cumulant of is and the properties of Bell’s polynomials to derive that for ,
[TABLE]
where and denote the -th complete exponential Bell polynomial and the incomplete exponential Bell polynomials, respectively. From (32), it directly follows that for all and . Secondly, using the formulas for the moments of Gaussian distributions, we derive that for all and . Collecting all pieces together, we deduce that
[TABLE]
We then conclude that by observing that Indeed, let us write
[TABLE]
Using that joined with \big{(}\frac{x}{\varepsilon}\big{)}^{2k}\leq\big{(}\frac{x}{\varepsilon}\big{)}^{2}, and , we get
[TABLE]
Similarly, tends to 0 as . Finally, we have that
[TABLE]
Control of . As it holds .
Control of . The fact that is implied by
[TABLE]
which holds true by the dominated convergence theorem as , and a.s.
Proof of Lemma 3
Using the definition (16) we derive that
[TABLE]
Taking the norm and using the Young inequality together with the fact that is a density with respect to the Lebesgue measure, i.e. , we get
[TABLE]
as desired.
Proof of Lemma 4
We have We introduce the centered i.i.d. random variables , which are bounded by 2 and such that . Applying the Bernstein inequality we have,
[TABLE]
Fix , on the set A_{x}=\big{\{}\big{|}\tfrac{\mathbf{n}(\varepsilon)}{n}-F_{\Delta}(\varepsilon)\big{|}\leq x\big{\}} we have
[TABLE]
Moreover it holds that \mathbb{E}\big{[}\mathbf{n}(\varepsilon)^{-r}\big{]}=\mathbb{E}\big{[}\mathbf{n}(\varepsilon)^{-r}\mathds{1}_{{A_{x}^{c}}}\big{]}+\mathbb{E}\big{[}\mathbf{n}(\varepsilon)^{-r}\mathds{1}_{A_{x}}\big{]}. Since and , using (33) and (34) we get the following upper bound
[TABLE]
and the lower bound \mathbb{E}\big{[}\mathbf{n}(\varepsilon)^{-r}\big{]}\geq\mathbb{E}\big{[}\mathbf{n}(\varepsilon)^{-r}\mathds{1}_{A_{x}}\big{]}\geq\Big{(}\frac{3nF_{\Delta}(\varepsilon)}{2}\Big{)}^{-r}. This completes the proof.
Proof of Lemma 5
For the first inequality, the proof is similar to the proof of Lemma 4. Using the definition of we have For , we set . We have
[TABLE]
using the independence of and . The variables are centered, i.i.d., bounded by 2 and such that the following bound on the variance holds: . Applying the Bernstein inequality we have,
[TABLE]
Fix , on the set A_{x}=\big{\{}\big{|}\tfrac{\widetilde{\bm{n}}(\varepsilon)}{\mathbf{n}(\varepsilon)}-(1-\tfrac{v_{\Delta}(\varepsilon)e^{-\lambda_{\varepsilon}\Delta}}{F_{\Delta}(\varepsilon)})\big{|}\leq\frac{1}{2}\big{\}} we have
[TABLE]
if . It follows from (35), (36) and that for
[TABLE]
Finally, using that for all we have , we derive
[TABLE]
which leads to the first part of the result.
The second part of the result can be obtained by means of the Rosenthal inequality. For , we have, using that ,
[TABLE]
The Rosenthal inequality leads to, for ,
[TABLE]
Thanks to the Jensen inequality we can also treat the case recovering the same inequality. Therefore, it follows that for all
[TABLE]
This completes the proof.
Proof of Theorem 3
Fix , the proof is a consequence of Theorem 2. In the sequel is a constant, possibly depending on , , , , , and , whose value may change from line to line. First, using () and since it holds that , therefore . This together with , leads to whenever using that on .
Replacing and using that and , the upper bound given in Theorem 2 can be rewritten in
[TABLE]
where as is fixed, the quantities , and are included in the constant , together with the terms using the Hölder inequality and that . For , i.e. , the dominating terms in the latter inequalities are the following
[TABLE]
The following computations lead, for and , to Theorem 3 as
[TABLE]
Proof of Theorem 4
Fix , the proof is a consequence of Theorem 2. In the sequel is a constant, possibly depending on , , , , , and , whose value may change from line to line. First, using () and that , it holds for that and . Additionally, using the latter together with (), , and gives . Replacing , the upper bound given in Theorem 2 can be rewritten
[TABLE]
where, being fixed, the quantities and are included in the constant as well as using the Hölder inequality and that . Consider the case , Equation (37) simplifies in
[TABLE]
Set and
[TABLE]
Next, note that and
[TABLE]
It follows that, if and as then and . If , the constraint implies and . The order of the other terms depends on the rate of according to .
For the case , the case is covered by Theorem 3. If , we add the terms and that now intervene in the rate. Theorem 4 follows.
Proof of Theorem 5
Consider for and ; straightforward computations give for ,
[TABLE]
For , set where and appear in Theorems 6 and 7. Theorem 7 and (38) give for that , using that and it holds and Additionally, it follows from Theorem 6 that . Thus, for it follows that .
For , set where and appear in Theorem 10. Theorem 10 and (38) give for that . Similarly, using that and it holds and Additionally, it follows from (40) that . Hence, for we deduce that .
We set
[TABLE]
In the sequel is a constant, possibly depending on , , , , , , , , and , whose value may change from line to line.
The proof is a consequence of Theorem 2. First, we choose the resolution that perform the compromise between the terms and 2^{Jp/2}\ell_{p,\varepsilon}^{p/2}\big{(}nF_{\Delta}(\varepsilon)\big{)}^{-p/2}, where from Equation (38). We choose such that and replace it in the bound of Theorem 2 that can be rewritten for and after simplification in
[TABLE]
Using the assumptions , , , , and , we obtain for and that
[TABLE]
The result follows.
Appendix B Some inequalities on the class
We partially reproduce here the main results of [15] that provide a control of the quantities and on the class , (see (6)). Hereafter, the dependency in of the constants is not given. For explicit values of the constants in the following Theorems, the reader is referred to [15].
Theorem 6**.**
Let be a Lévy measure absolutely continuous with respect to the Lebesgue measure and denote by . Let , , , and . Then, there exists a constant , only depending on , such that
Theorem 7**.**
Let be a finite variation Lévy process of the form with Lévy measure absolutely continuous with respect to the Lebesgue measure and denote by . Suppose that for some and . If , then there exist two constants and only depending on , such that for all it holds:
[TABLE]
Theorem 8**.**
Let be a Lévy process as in (2) and let be a symmetric Lévy measure absolutely continuous with respect to the Lebesgue measure and denote by . Let , , , and . Then, there exists a constant , only depending on , such that
[TABLE]
Theorem 9**.**
Let be a Lévy process as in (2) and let be a symmetric Lévy measure with density with respect to the Lebesgue measure and for some and . Then, for all 0<t<(\varepsilon/2)^{\alpha}\big{(}1\land((2-\alpha)/2M)\big{)}, , it holds:
[TABLE]
where , and are positive constants, only depending on , and .
Theorem 10**.**
Let be a Lévy process as in (2) and let be a symmetric Lévy measure having a density with respect to the Lebesgue measure with for some and . Let and assume that is -Lipschitz on the interval . For all , it holds:
[TABLE]
where , and are universal positive constants, only depending on .
Note that Theorem 10 applied to , satisfying the assumptions of Theorem 10, permits to improve the result of Theorem 8 as follows
[TABLE]
If , Theorem 7 permits to derive that Assumption () is valid for and a constant depending only on , and whose dependency in is explicit. Theorem 6 ensures that Assumption () is fulfilled with , and depending only on , and whose dependency in is explicit.
If and the Lévy measure is symmetric, Theorem 10 (under an addition local Lipschitz condition on ) permits to derive that Assumption () is valid for and a constant depending only on , and whose dependency in is explicit. Theorem 8 ensures that Assumption () is fulfilled with , (if ). This can be improved using (40) in under a local Lipschitz condition on .
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] R. A. Adams and J. J. F. Fournier. Sobolev spaces , volume 140 of Pure and Applied Mathematics (Amsterdam) . Elsevier/Academic Press, Amsterdam, second edition, 2003.
- 2[2] O. E. Barndorff-Nielsen, T. Mikosch, and S. I. Resnick. Lévy processes: theory and applications . Springer Science & Business Media, 2012.
- 3[3] D. Belomestny, F. Comte, V. Genon-Catalot, H. Masuda, and M. Reiß. Lévy Matters IV - Estimation for discretely observed Lévy processes , volume 2128. Springer International Publishing, 2015.
- 4[4] D. Belomestny, S. Gugushvili, M. Schauer, and P. Spreij. Nonparametric bayesian inference for lévy subordinators. ar Xiv preprint ar Xiv:1804.11267 , 2018.
- 5[5] F. Biagini, J. Bregman, and T. Meyer-Brandis. Electricity futures price modeling with lévy term structure models. International Journal of Theoretical and Applied Finance , 18(01):1550003, 2015.
- 6[6] O. Boxma, J. Ivanovs, K. Kosiński, and M. Mandjes. Lévy-driven polling systems and continuous-state branching processes. Stochastic Systems , 1(2):411–436, 2011.
- 7[7] B. Buchmann and R. Grübel. Decompounding: an estimation problem for Poisson random sums. Ann. Statist. , 31(4):1054–1074, 2003.
- 8[8] P. Carr, H. Geman, D. B. Madan, and M. Yor. The fine structure of asset returns: An empirical investigation. The Journal of Business , 75(2):305–333, 2002.
