On Bernstein Type Inequalities for Stochastic Integrals of Multivariate Point Processes
Hanchao Wang, Zhengyan Lin, Zhonggen Su

TL;DR
This paper establishes Bernstein-type concentration inequalities for stochastic integrals of multivariate point processes, providing new bounds and convergence rates for related martingales and estimators.
Contribution
It introduces novel Bernstein inequalities for multivariate point process integrals and applies them to improve convergence rate results for nonparametric MLEs.
Findings
Derived a Bernstein-type concentration inequality using Doléans-Dade exponential formula.
Established a uniform exponential inequality via generic chaining.
Provided an improved convergence rate for nonparametric maximum likelihood estimators.
Abstract
We consider the stochastic integrals of multivariate point processes and study their concentration phenomena. In particular, we obtain a Bernstein type of concentration inequality through Dol\'eans-Dade exponential formula and a uniform exponential inequality using a generic chaining argument. As applications, we obtain a upper bound for a sequence of discrete time martingales indexed by a class of functionals, and so derive the rate of convergence for nonparametric maximum likelihood estimators, which is an improvement of earlier work of van de Geer.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPoint processes and geometric inequalities
On Bernstein Type Inequalities for Stochastic Integrals of Multivariate Point Processes
Hanchao Wang111Corresponding author, email: [email protected].
Zhongtai Security Institute for Financial Studies, Shandong University, Jinan, 250100, PRC
Zhengyan Lin, Zhonggen Su
School of Mathematical Sciences, Zhejiang University, Hangzhou, 310027, PRC
Abstract
We consider the stochastic integrals of multivariate point processes and study their concentration phenomena. In particular, we obtain a Bernstein type of concentration inequality through Doléans-Dade exponential formula and a uniform exponential inequality using a generic chaining argument. As applications, we obtain a upper bound for a sequence of discrete time martingales indexed by a class of functionals, and so derive the rate of convergence for nonparametric maximum likelihood estimators, which is an improvement of earlier work of van de Geer.
keywords:
Bernstein inequality, Doléans-Dade exponential formula, Generic chaining method, Multivariate point process.
1 Introduction
There have been a lot of research activities around phenomena of measure concentration in the past decades. The reader is referred to excellent books like Ledoux and Talagrand [12], Ledoux [11] and nice paper like Talagrand [14] for remarkable results and powerful methods. A primary purpose of the present paper is to establish a Bernstein type exponential concentration inequality for stochastic integrals of multivariate point processes.
For sake of statement, we will begin with a classical Bernstein inequality for sums of independent random variables. Assume that is a probability space so large that we can construct all random objects of interest in it. Let , , be a sequence of centered independent random variables with finite variance, and denote . If there exists a certain constant such that
[TABLE]
then
[TABLE]
for all and for all satisfying .
(1.1) was due to Bernstein [6], and so (1.1) is now referred as Bernstein condition. Since then various extensions and improvement have appeared in literature, among which are Bennett [1, 2], Hoeffding [9], Freedman [8], Bentkus [4, 5], Fan et al. [7]. A very recent nice book is Bercu et al [3] which gives a very clear exposition on concentration inequalities for sums of independent random variables and martingales.
An important extension of Bernstein inequality is to both discrete time martingales and continuous time martingales. In particular, Freedman [8] first obtained the Bernstein inequality of discrete time martingales with bounded jumps, and then Shorack and Wellner [13] extended Freedman's result to continuous time martingales. More precisely, let be a stochastic basis, be a locally square integrable martingale with respect to the filtration with . Denote the jump by and the predictable variation by , . Assume that
[TABLE]
for a positive constant . Then for each ,
[TABLE]
The bounded jump assumption (1.3) can be relaxed. In fact, van de Geer [17] improved the above result under Bernstein condition. For each , consider the process and its predictable compensator . If there exist a constant and a predictable process such that
[TABLE]
then for each ,
[TABLE]
We remark that any locally square integrable martingale can be represented as the sum of continuous local martingale and pure jump local martingale. The nonzero continuous local martingale part indeed played a crucial role in the proof of both (1.4) and (1.6). Now it is natural to ask what happens for a pure jump local martingale. It is an interesting and challenging mathematical problem to establish a concentration inequality for general pure jump local martingales. We shall restrict ourselves to stochastic integrals of multivariate point processes.
Let be a Blackwell space. Assume that be a sequence of strictly increasing positive random variables, a sequence of -valued random variables and is measurable with respect to for each . A multivariate point process is an integer-valued random variables defined by
[TABLE]
We note that Poisson point process and compound Poisson point process are classic and well-studied examples of multivariate point processes. We shall be interested in stochastic integrals of a predictable process with respect to the measure . Let be the predictable compensator of and assume that admits the disintegration
[TABLE]
where is a transition probability from in to , is an increasing càdlág predictable process. Denote . It is easy to see that the process if is a Lévy point process. However, what we are more interested in the case , namely .
Given a predictable function on , , define the stochastic integral
[TABLE]
In addition, put
[TABLE]
and
[TABLE]
An easy computation, see Chapter 2 of Jacod and Shiryaev [10], implies
[TABLE]
Motivated by (1.12), we introduce the following quantities
[TABLE]
The Bernstein inequality for reads as follows
Theorem 1.1
Suppose that for all and some
[TABLE]
Then for each ,
[TABLE]
The proof of Theorem 1.1 will be given in Section 2. A key ingredient is Doléans-Dade exponential formula for semimartingales with given predictable characteristics.
Next let us turn to consider the uniform bound for a family of stochastic integrals of predictable processes with respect to multivariate point process. Let be a metric space, a family of predictable functions on .
Fix a . We denote and define two metrics as follows
[TABLE]
[TABLE]
where stands for norm of .
By Theorem 1.1, one easily can obtain
[TABLE]
As known to us, (1.18) is a certain increment condition. We can further derive a uniform inequality for using a generic chaining method as in Talagrand [15]. To this end, we need to introduce more notations. For a given metric space , an increasing sequence of partitions of is called as admissible sequence if . Denote by the unique of element of containing , and denote by the diameter of under . In addition, let
[TABLE]
where the infimum is taken over all admissible sequences. We can now state a uniform inequality for in terms of and .
Theorem 1.2
Suppose that for all and some
[TABLE]
Then we have
[TABLE]
Moreover, it follows
[TABLE]
The proof of Theorem 1.2 will also be given in Section 2. As applications, we will obtain a Bernstein type exponential inequality for a class of functional index empirical processes and so derive a convergence rate for nonparametric maximum likelihood estimators. This is the content of Section 3.
2 Proofs of Theorems 1 and 2
**Proof of Theorem 1.1 ** Clearly, it follows
[TABLE]
So without loss of generality, we can and do assume . For simplicity of notation, put
[TABLE]
and
[TABLE]
where . Note for any semimartingale , the Doléans-Dade exponential is
[TABLE]
Since
[TABLE]
and
[TABLE]
we can obtain
[TABLE]
for all .
We shall first show the the process \Big{(}e^{\lambda X}/\mathcal{E}(S(\lambda))\Big{)}_{t\geq 0} is a local martingale. For , the jump part of is
[TABLE]
where is the thin set, which is exhausted by .
We denote by the jump measure of . Let be the predictable compensator of , and
[TABLE]
The Itô formula yields
[TABLE]
Furthermore,
[TABLE]
We obtain that
[TABLE]
is a local martingale. Set , , and The Itô formula yields
[TABLE]
Let , and note is also a local martingale. We have
[TABLE]
By the definition of , we have , thus
[TABLE]
Then
[TABLE]
Noting that , , we have
[TABLE]
where is a predictable process, and is a local martingale. By the property of the Stieltjes integral, we have
[TABLE]
Thus \Big{(}e^{\lambda X}/\mathcal{E}(S(\lambda))\Big{)}_{t} is a local martingale.
Since and ,
[TABLE]
Thus,
[TABLE]
Set
[TABLE]
[TABLE]
we have
[TABLE]
On ,
[TABLE]
then
[TABLE]
Take , we obtain
[TABLE]
Proof of Theorem 1.2
By Theorem 1.1, we can obtain
[TABLE]
and
[TABLE]
for . We set .
Consider an admissible sequence such that
[TABLE]
where is the diameter of the set for , and an admissible sequence such that
[TABLE]
where is the diameter of the set for .
We may define partition for as follows: ,
[TABLE]
Consider a set that contains exactly one point in . For , is the element of that belong to . We can easily obtain
[TABLE]
Let be the event defined by
[TABLE]
For , it easily follows
[TABLE]
Letting , we obtain
[TABLE]
On ,
[TABLE]
Hence,
[TABLE]
where
[TABLE]
Thus
[TABLE]
Obviously,
[TABLE]
When , we have , , so that
[TABLE]
Thus
[TABLE]
Proceeding similarly for , we obtain
[TABLE]
in . Thus
[TABLE]
we complete the proof of (1.21).
We can obtain (1.22) through
[TABLE]
Remark 2.1
Let be a metric space, and let \big{(}X^{\psi},\psi\in\Psi\big{)} be a family of stochastic processes defined on a probability space . A primary problem is to study the bounds for , where
[TABLE]
However, this is not easy at all for general processes. The generic chaining method was first invented by Talagrand in a series of articles to deal with . In particular, under the increment condition
[TABLE]
Talagrand [15] proved
[TABLE]
In addition, if the condition (2.31) is replaced by
[TABLE]
then it follows
[TABLE]
Theorem 1.2 implies that if the following increment condition is satisfied
[TABLE]
then (1.21) still holds true.
3 Applications
In this section we shall first apply the previous results to functional index empirical processes. Consider a sequence of adapted stationary time series on the discrete time stochastic basis . Let be the space of all bounded measurable functions in . For a , define
[TABLE]
Obviously, for each , is a discrete time martingale. Note also can be realized through a stochastic integral of with respect to a multivariate point process. In fact, let , , , then
[TABLE]
A simple computation shows
[TABLE]
and
[TABLE]
As a direct consequence of Theorem 1.1, we have
Theorem 3.1
Suppose that, for all and some
[TABLE]
[TABLE]
Then for each ,
[TABLE]
Remark 3.2
If we denote
[TABLE]
and
[TABLE]
The conditions (3.5) and (3.6) imply the conditions (1.14) in Theorem 1.1 for and .
Furthermore, we define the metric for fix
[TABLE]
[TABLE]
where stands for norm of .
Theorem 3.3
Suppose that, for all and some , (3.5) and (3.6) hold. Then
[TABLE]
- Proof. It follows directly from the proof of Theorem 1.2
[TABLE]
Note
[TABLE]
Then (3.12) easily holds. \qed
As a special example of functional index empirical processes, we consider the nonparametric maximum likelihood estimators below.
Let be a family of probability measures, we assume that is dominated by a Lebesgue measure. Denote the density of by , . Fix a such that , and let be a sequence of i.i.d. observations from . Define the empirical distribution
[TABLE]
on the basis of the first observations. The maximum likelihood estimator of is defined by
[TABLE]
We assume throughout that a exists.
It is very important to study the rate of convergence of to in the theory of nonparametric statistical inference. Recall the Hellinger distance is usually used to describe the distance between two probability measures. In particular, for a pair of probability measures the Hellinger distance is defined by
[TABLE]
where is a measure dominating and .
In our setting is a Lebesgue measure, , , and we simply write . It is natural to ask what the rate of convergence for to in terms of . We have the following result in this aspect. Denote , and set
[TABLE]
[TABLE]
Theorem 3.4
Suppose that there is a positive constant such that for all
[TABLE]
Then it follows
[TABLE]
- Proof.
Since for any ,
[TABLE]
Also, according to (3.16),
[TABLE]
Thus we have
[TABLE]
We now can complete the proof by Theorem 3.3. \qed
Remark 3.5
van de Geer [16, 17] discussed the similar problem on maximum likelihood estimators. To our knowledge, Theorem 3.4 is new in this area. We also remark that Theorem 3.4 can be extended by Theorem 1.2 and 3.3 to the stationary sample case with suitable maximum likelihood estimators. It is left to future work .
Acknowledgments
This research work is support by the National Natural Science Foundation of China (No. 11371317, 11171303) and the Fundamental Research Fund of Shandong University (No. 2016GN019).
Reference
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Bennett, G. Probability inequalities for the sum of independent random variables. J. Amer. Statist. Assoc. , 57 , 33-45, (1962).
- 2[2] Bennett, G. On the probability of large deviations from the expectation for sums of bounded independent random variables. Biometrika , 50 , 528-535, (1963).
- 3[3] Bercu, B., Delyon, B. and Rio, E. Concentration inequalities for sums and martingales. Springer Briefs in Mathematics , Springer, (2015)
- 4[4] Bentkus, V. An inequality for tail probabilities of martingales with differences bounded from one side. J. Theoret. Probab. , 16 , 161-173, (2003).
- 5[5] Bentkus, V. On Hoeffding's inequalities. Ann. Probab. , 32 , 1650-1673, (2004).
- 6[6] Bernstein, S.N. Theory of Probability , Moscow. (1927) .
- 7[7] Fan, X., Grama, I. and Liu, Q. Exponential inequalities for martingales with applications. Electron. J. Probab. , 20 , (2015).
- 8[8] Freedman, D. On tail probabilites for martingales. Ann. Probability , 3 , 100-118, (1975).
