Cram\'{e}r type moderate deviations for stationary sequences of bounded random variables
Xiequan Fan

TL;DR
This paper establishes Cramér type moderate deviation results for stationary bounded sequences, providing theoretical tools for probabilistic bounds and applications in various dependent processes.
Contribution
It introduces new Cramér type moderate deviation results for stationary sequences of bounded variables, extending existing probabilistic bounds to dependent data.
Findings
Derives Cramér type moderate deviations for stationary sequences
Implements results to quantile coupling inequalities and mixing sequences
Provides Berry-Esseen bounds for dependent processes
Abstract
We derive Cram\'{e}r type moderate deviations for stationary sequences of bounded random variables. Our results imply the moderate deviation principles and a Berry-Esseen bound. Applications to quantile coupling inequalities, functions of -mixing sequences, and contracting Markov chains are discussed.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic processes and statistical mechanics · Probability and Risk Models · Mathematical Dynamics and Fractals
Cramér type moderate deviations for stationary sequences of bounded random variables
Xiequan Fan
Center for Applied Mathematics, Tianjin University, Tianjin, China
Abstract
We derive Cramér type moderate deviations for stationary sequences of bounded random variables. Our results imply the moderate deviation principles and a Berry-Esseen bound. Applications to quantile coupling inequalities, functions of -mixing sequences, and contracting Markov chains are discussed. To cite this article: A. Name1, A. Name2, C. R. Acad. Sci. Paris, Ser. I 340 (2005).
Résumé Déviations modérés de type Cramér pour les séquences stationnaires. Nous dérivons les déviations modérées de type Cramér pour des séquences stationnaires de variables aléatoires bornées. Nos résultats impliquent les principes de déviation modérée et un théoreme de Berry-Esseen. Les applications aux inégalités de couplage quantile, fonctions des séquences de mélange, et des chaînes de Markov contractantes sont discutées. Pour citer cet article : A. Name1, A. Name2, C. R. Acad. Sci. Paris, Ser. I 340 (2005).
††journal: the Académie des sciences
Received *****; accepted after revision +++++
Presented by
1 Introduction
For the stationary sequence of centered random variables, define the partial sums and the normalized partial sums process by
[TABLE]
respectively. We say that the sequence of random variables satisfies the moderate deviation principle (MDP) with speed and good rate function , if the level set are compact for all and for all Borel sets
[TABLE]
where denotes the interior of the closure of and the infimum of a function over an empty set is interpreted as The MDP is an intermediate behavior between the central limit theorem ( and large deviations
The MDP results have been obtained by several authors. De Acosta [2] applied Laplace approximations to prove the MDP for sums of independent random vectors. Dembo [5] showed that the MDP holds for the trajectory of a locally square integrable martingale with bounded jumps as soon as its quadratic covariation converges in probability at an exponential rate. Gao [9] and Djellout [6] obtained the MDP for martingales with non-bounded differences and -mixing sequences with summable mixing rate. Dedecker et al. [3] derived the MDP for stationary sequences of bounded random variables under martingale-type conditions. It is known that the MDP results for stationary sequences can be applied in a variety of settings. For instance, Dedecker et al. [3] showed that such type of results can be applied to functions of mixing sequences, contracting Markov chains, expanding maps of the interval, and symmetric random walks on the circle.
In this paper we are concerned with Cramér type moderate deviations for stationary sequences. Cramér type moderate deviations usually imply the MDP results; see Fan et al. [7] for instance. Furthermore, Cramér type moderate deviations imply Berry-Esseen bounds; see Corollary 2.2. Following the excellent work of Mason and Zhou [13] and Dedecker et al. [3], we apply our results to quantile coupling inequalities, functions of -mixing sequences, and contracting Markov chains.
Our approach is based on martingale approximation and Cramér type moderate deviations for martingales due to Fan et al. [7]. Cramér type moderate deviations for martingales have been established by Račkauskas [16, 17], Grama [10] and Grama and Haeusler [11, 12]. Such type of results are very useful for study of stationary sequences, for instance, Wu and Zhao [20] applied the results of Grama [10] to establish Cramér type moderate deviations for stationary sequences with physical dependence measure introduced by Wu [19], functionals of linear processes and some nonlinear time series. See also Cuny and Merlevède [1] (cf. Theorem 3.2 therein) for a result similar to Wu and Zhao [20], where Cuny and Merlevède [1] established a Cramér type moderate deviations for an adapted stationary sequence in . For relationship among our results and the last two results, we refer to point 3 of Remark 1.
The paper is organized as follows. Our main results are stated and discussed in Section 2. The applications are given in Section 3. Proofs of theorems are deferred to Section 4.
2 Main results
From now on, assume that the stationary sequence is given by where is a bijective bimeasurable transformation preserving the probability on . For a subfield satisfying , let Our theorems and their corollaries treat the so-called adapted case, that is being -measurable and so the sequence is adapted to the filtration . Moreover, we denote the -norm by , that is the smallest such that
Throughout the paper, let be integers such that For instance, we may take where stands for the largest integer less than Denote
[TABLE]
and
[TABLE]
where . The following theorem gives a Cramér type moderate deviation result for stationary sequences.
Theorem 2.1
Assume that , and that is -measurable. Then there exists an absolute constant such that when and , it holds for all
[TABLE]
where depends only on In particular, the last inequality implies that
[TABLE]
uniformly for as Moreover, the same results hold when replacing by .
Remark 1
Let us comment on the results of Theorem 2.1.
Assume that
[TABLE]
and that there exists such that
[TABLE]
The conditions (6) and (7) were introduced by Dedecker et al. **[3]**. Assume that and as . By Lemma 29 of Dedecker et al. **[3]**, the assumptions of Theorem 2.1 hold with as 2. 2.
If is a martingale difference sequence, then Theorem 2.1 gives a Cramér type moderate deviation result with
[TABLE]
which is similar to the main theorem of Grama and Haeusler **[11]** (see also Fan et al. **[7]**). 3. 3.
The range of equality (5) can be very large. For instance, if \big{\|}\mathbf{E}[S_{n}|\mathcal{F}_{0}]\big{\|}_{\infty}=O(1) and \big{\|}\frac{1}{n}\mathbf{E}[S_{n}^{2}|\mathcal{F}_{0}]-\sigma_{n}^{2}\big{\|}_{\infty}=O\big{(}\frac{1}{n}\big{)} as then, by taking equality (5) holds uniformly for as 4. 4.
For stationary processes, results similar to Theorem 2.1 can be found in Wu and Zhao **[20]** and Cuny and Merlevède **[1]**. Wu and Zhao **[20]** showed that it is possible to prove the relative error of normal approximation tends to [math] for a certain class of stationary processes represented by functions of an i.i.d. sequence as soon as the partial sum process can be well approximated by martingales. Following the work of Wu and Zhao **[20]**, Cuny and Merlevède (see Theorem 3.2 of **[1]**) proved that under certain conditions for -norm, the relative error of normal approximation tends to [math] uniformly for that is (5) holds uniformly for . Now Theorem 2.1 shows that the last range could be as large as for some positive constant (cf. point (iii) of this remark) under the conditions for -norm (instead of -norm). 5. 5.
The absolute constant is very small. However, it can be improved to a larger one, provided that the absolute constant in the inequality of Peligrad et al. **[14]** (cf. inequality (28)) can be improved to a smaller one. 6. 6.
Notice that the quantities and can be estimated via the quantities
[TABLE]
Indeed, it is easy to see that
[TABLE]
and
[TABLE]
where is an absolute constant. Splitting the last sum as follows
[TABLE]
we infer that
[TABLE]
where is an absolute constant. Moreover, if
[TABLE]
for some constant by (8) and (9), then we have and
[TABLE] 7. 7.
Assume that . If for some constant with then equality (5) holds uniformly for as If for some constant with then equality (5) holds uniformly for as
Theorem 2.1 implies the following Berry-Esseen bound.
Corollary 2.2
Assume the conditions of Theorem 2.1. Then
[TABLE]
where is an absolute constant.
Remark 2
Let us comment on Corollary 2.2.
Assume that and for some constant By point (vi) of Remark 1, if then, with , bound (10) reaches its minimum of order If then, with , bound (10) gives its minimum of order 2. 2.
When is a uniformly mixing sequence, we refer to Rio **[18]** for a result similar to Corollary 2.2. In the paper, Rio **[18]** gave a Berry-Esseen bound of order under the condition where is the sequence of uniformly mixing coefficients. 3. 3.
If is a stationary martingale difference sequence, Corollary 2.2 gives the following Berry-Esseen bound
[TABLE]
When is -bounded (instead of -bounded), Dedecker et al. **[4]** have obtained some rather tight Berry-Esseen bounds. Notice that Dedecker et al. **[4]** assumed a martingale coboundary decomposition while we do not. On the other hand Dedecker et al. **[4]** worked in and we work in so the results are of independent interest. It is worth noticing that the best rates (for martingales) provided by Dedecker et al. **[4]** and us are the same.
Theorem 2.1 gives an alternative proof for the following moderate deviation principle (MDP) result which is implied by the functional MDP result of Dedecker et al. [3] under the conditions (6) and (7).
Corollary 2.3
Assume the conditions of Theorem 2.1. Assume that , and that as Let be any sequence of real numbers satisfying and as . Then for each Borel set ,
[TABLE]
where and denote the interior and the closure of , respectively.
The following theorem gives a Bernstein type inequality for the stationary sequences. Although such type of inequalities are less precise than Cramér type moderate deviations, they are available for all positive Moreover, they are very useful for establishing quantile coupling inequalities; see Theorem 2.5.
Theorem 2.4
Assume the conditions of Theorem 2.1. Then for any
[TABLE]
where
Assume that as Then and as Thus the second term in the r.h.s. of (13) is much smaller than the first one for any as So when satisfies and the bound (13) behaves like \exp\big{\{}-\frac{x^{2}}{2(1+\delta_{m}^{2})}\big{\}} for any
Next, we apply Theorems 2.1 and 2.4 to quantile coupling inequalities for stationary sequences. We follow Mason and Zhou [13], where such type of inequalities have been established for arbitrary random variables under some Cramér type moderate deviation assumptions. Using Theorems 2.1, 2.4 and Theorem 1 of Mason and Zhou [13], we obtain the following result.
Theorem 2.5
Assume the conditions of Theorem 2.1, and that as . Let Then, there exist two positive absolute constants and , a standard normal random variable and a random variable can be constructed on a new probability space such that and
[TABLE]
whenever
[TABLE]
and is large enough, where stands for equality in distribution. Furthermore, there exist two positive absolute constants and such that for large enough, we have for all
[TABLE]
Assume that and for some constant By point (i) of Remark 2, if then, with , the term is of order If then, with , the term is of order
3 Applications
In this section, we present some applications of our results. For more interesting applications, such as expanding map and symmetric random walk on the circle, we refer to Corollary 18 and Proposition 20 of Dedecker et al. [3]. Under their corresponding conditions, the conditions of Theorem 2.1 hold.
3.1 -mixing sequences
Let be a random variable with values in a Polish space If is a -field, the -mixing coefficient between and is defined by
[TABLE]
For a sequence of random variables and a positive integer denote
[TABLE]
and let be the usual -mixing coefficient. Under the following conditions
[TABLE]
Dedecker et al. [3] obtained a MDP result for bounded random variables. See also Gao [9] for an earlier MDP result under a condition stronger than (18), that is .
When the random variables are bounded, it holds and as . By point (vii) of Remark 1, we have the following result.
Proposition 3.1
Assume that the random variables are bounded, and
[TABLE]
for some constant
[i]
If , then (5) holds uniformly for as
[ii]
If , then (5) holds uniformly for as
3.2 Functions of -mixing sequences
Let be a stationary sequence of -mixing random variables taking values in a subset of a Polish space Denote by the coefficient
[TABLE]
where is defined by (17). Let be a function from to satisfying the following condition
(A):
\textrm{for any}\ i\geq 0,\ \ \ \ \sup_{x\in A^{\mathbf{N}},\,y\in A^{\mathbf{N}}}\Big{|}H(x)-H(x^{(i)}y)\Big{|}\leq R_{i},\ \ \textrm{where}\ \ R_{i}\ \textrm{decreases to }\ 0,
where the sequence is defined by for and for Define the stationary sequence by
[TABLE]
Dedecker et al. [3] gave a MDP result for , see Propositions 12 therein. From the proof of Propositions 12 of [3], it is easy to see that
[TABLE]
Notice that when it holds By point (vii) of Remark 1, we have the following Cramér type moderate deviations.
Proposition 3.2
Let be defined by (19), for a function satisfying condition (A). Assume
[TABLE]
for some constant and
[i]
If , then (5) holds uniformly for as
[ii]
If , then (5) holds uniformly for as
3.3 Contracting Markov chains
Let be a stationary Markov chain of bounded random variables with invariant measure and transition kernel Denote by the essential norm with respect to Let be the set of -Lipschitz functions. Assume that the chain satisfies the following condition:
(B):
there exist two constants and such that
[TABLE]
and for any
[TABLE]
We shall see in the next proposition that MDP result holds for the sequence
[TABLE]
as soon as the function belongs to the class introduced by Dedecker et al. [3]. Let be the class of functions such that , where is a concave and non-decreasing function and satisfies
[TABLE]
Clearly, (22) holds if for some constants and In particular, contains the class of -Hölder continuous functions from to , where
Dedecker et al. [3] gave a MDP result for , see Propositions 14 therein. From the proof of Propositions 14 of [3], it is easy to see that
[TABLE]
where is given by condition (B).
Proposition 3.3
Assume that the stationary Markov chain satisfies condition (B), and let be defined by (21). Assume
[TABLE]
and
[TABLE]
for some constant
[i]
If , then (5) holds uniformly for as
[ii]
If , then (5) holds uniformly for as
Notice that if for some constants and then (23) is satisfied.
4 Proofs of Theorems and Corollaries
The proofs of our results are mainly based on the following lemmas, which give some exponential deviation inequalities for the partial sums of dependent random variables.
4.1 Preliminary lemmas
Let be a sequence of martingale differences, defined on some probability space , where , are increasing -fields. Set
[TABLE]
Then is a martingale. Denote the quadratic characteristic of the martingale , that is
[TABLE]
Assume the following two conditions:
(C1)
There exists such that
[TABLE]
(C2)
There exists such that
Clearly, condition (C1) is satisfied for bounded martingale differences
In the proof of Theorem 2.1, we need the following Cramér moderate deviation expansions for martingales, which is a simple consequence of Theorems 2.1 and 2.2 of Fan et al. [7].
Lemma 4.1
Assume conditions (C1) and (C2). Then there is an absolute constant such that for all and ,
[TABLE]
where depends only on Moreover, the same equality remains true when is replaced by .
In the proof of Theorem 2.4, we make use of the following Freedman inequality [8].
Lemma 4.2
Assume that for some constant and all Then for all and ,
[TABLE]
We also use the following exponential inequality of Peligrad et al. [14] (cf. Proposition 2 therein), which plays an important role in the proof of Theorem 2.4.
Lemma 4.3
Let be a stationary sequence of random variables adapted to the filtration . Then for all
[TABLE]
4.2 Proof of Theorem 2.1
Let be the integer part of . The initial step of the proof is to divide the random variables into blocks of size and to make the sums in each block
[TABLE]
It is easy to see that Define
[TABLE]
Then is a stationary sequence of bounded martingale differences, that is
[TABLE]
Notice that
[TABLE]
and that, by stationarity, it follows that
[TABLE]
Moreover,
[TABLE]
Consequently, it holds
[TABLE]
and
[TABLE]
Denote and Then it is obvious that
[TABLE]
Assume and , where is given by Lemma 4.1. By Lemma 4.1, we have for all ,
[TABLE]
where depends only on Notice that for all and ,
[TABLE]
and
[TABLE]
where It is obvious that
[TABLE]
Therefore, by (29) and (30), for all
[TABLE]
Similarly, we have for all
[TABLE]
The last two inequalities imply that for all
[TABLE]
By Lemma 4.3, we derive that for all ,
[TABLE]
It is easy to see that for all ,
[TABLE]
By the inequalities (31)-(33), it follows that for all
[TABLE]
Using the following two-sided bound on tail probabilities of the standard normal random variable
[TABLE]
we deduce that for all and
[TABLE]
Notice that for
[TABLE]
By an argument similar to the proof of (LABEL:need01), we deduce that for all
[TABLE]
Combining (LABEL:need01) and (37) together, we obtain the desired equality for all Next, we consider the case where Notice that (31) holds also for Thus, from (31), we have
[TABLE]
For all we deduce that
[TABLE]
where the last line follows by (32). Similarly, we have for all
[TABLE]
The last two inequalities imply that for all
[TABLE]
The last inequality implies the desired equality for all
Since also satisfies the conditions of Theorem 2.1, the same equalities remain true when is replaced by .
4.3 Proof of Corollary 2.2
We only need to consider the case where Otherwise, Corollary 2.2 holds obviously for large enough. Denote
[TABLE]
where is the absolute constant given by Theorem 2.1. It is easy to see that
[TABLE]
By Theorem 2.1 and the inequality we have
[TABLE]
Using the last inequality, we deduce that
[TABLE]
Similarly, it holds that
[TABLE]
It is obvious that
[TABLE]
Combining the inequalities (39)-(43) together, we obtain the desired inequality.
4.4 Proof of Corollary 2.3
Let . Then it holds that as Thus as
First, we prove that
[TABLE]
For any given Borel set let Then, it is obvious that Therefore, by Theorem 2.1,
[TABLE]
Notice that
[TABLE]
as Using (34) and the fact , we deduce that
[TABLE]
which gives (44).
Next, we prove that
[TABLE]
We may assume that otherwise the last inequality holds obviously because the infimum of a function over an empty set is interpreted as For any there exists an such that
[TABLE]
Without loss of generality, we may assume that For there exists small such that Then it is obvious that By Theorem 2.1, we deduce that
[TABLE]
Using Theorem 2.1, (34) and the fact again, it follows that
[TABLE]
Letting we get
[TABLE]
Because can be arbitrarily small, we obtain (45). This completes the proof of Corollary 2.3.
4.5 Proof of Theorem 2.4
Recall the notations in the proof of Theorem 2.1. It is easy to see that
[TABLE]
and
[TABLE]
Applying Lemma 4.2 to we have for all
[TABLE]
By an argument similar to the proof of (32), we obtain for all
[TABLE]
Using (33) again, we obtain the desired inequality.
4.6 Proof of Proposition 2.5
For each integer , let
[TABLE]
be the cumulative distribution function of Then its quantile function is define by
[TABLE]
Let be a standard normal random variable. Denote
[TABLE]
Then see Mason and Zhou [13]. Denote
[TABLE]
By Theorem 2.1, there exist an absolute constants and such that when is large enough, we have for all
[TABLE]
and
[TABLE]
where depends only on By Theorem 1 of Mason and Zhou [13], then whenever and
[TABLE]
we have
[TABLE]
which gives (14) with and . Notice that there exists an integer such that for all
Next we give the proof of (16). Set for brevity
[TABLE]
By (14), we have for all
[TABLE]
Notice that
[TABLE]
When by the inequalities (50) and (51), it holds that
[TABLE]
and that
[TABLE]
Returning to (55), we obtain for all
[TABLE]
where For it is easy to see that
[TABLE]
Clearly, it holds for all
[TABLE]
By Theorem 2.4, there exists a positive constant such that for all
[TABLE]
Returning to (59), we have for all
[TABLE]
where Combining (58) and (60), we get the desired inequality.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Cuny, C., Merlevède, F., 2014. On martingale approximations and the quenched weak inviariance principle. Ann. Probab. 42 (2), 760–793.
- 2[2] De Acosta, A., 1992. Moderate deviations and associated Laplace approximations for sums of independent random vectors. Trans. Amer. Math. Soc. 329 (1): 357-375.
- 3[3] Dedecker, J., Merlevède, F., Peligrad, M., Utev, S., 2009. Moderate deviations for stationary sequences of bounded random variables. Ann. Inst. H. Poincaré Probab. Statist. 45 (2), 453–476.
- 4[4] Dedecker, J., Merlevède, F., Rio, E., 2009. Rates of convergence for minimal distances in the central limit theorem under projective criteria. Electron. J. Probab. , 14 , 978-1011.
- 5[5] Dembo, A., 1996. Moderate deviations for martingales with bounded jumps. Electron. Comm. Proba. 1 , 11–17.
- 6[6] Djellout, H., 2002. Moderate deviations for martingale differences and applications to ϕ − limit-from italic-ϕ \phi- mixing sequences. Stochastic Stochastic Rep. 73 , 37–63.
- 7[7] Fan X, Grama I, Liu Q., 2013. Cramér large deviation expansions for martingales under Bernstein’s condition. Stochastic Process. Appl. 123 (11): 3919–3942.
- 8[8] Freedman, D. A., 1975. On tail probabilities for martingales. Ann. Probab. 3 (1): 100–118.
