On large deviations for combinatorial sums
Andrei N. Frolov

TL;DR
This paper studies the asymptotic probabilities of large deviations in normalized combinatorial sums, identifying conditions under which these probabilities align with the standard normal tail, extending classical results.
Contribution
It introduces new conditions for large deviation asymptotics of combinatorial sums, expanding the understanding of their convergence to normal distribution tails.
Findings
Probabilities of large deviations match the standard normal tail in a specific zone.
Conditions similar to Bernstein's condition are sufficient for normal approximation.
The zone of normal convergence can grow at a power rate.
Abstract
We investigate asymptotic behaviour of probabilities of large deviations for normalized combinatorial sums. We find a zone in which these probabilities are equivalent to the tail of the standard normal law. Our conditions are similar to the classical Bernstein condition. The range of the zone of the normal convergence can be of power order.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
On large deviations for combinatorial sums
Andrei N. Frolov 111This investigation was supported by RFBR, research project No. 18–01–00393
Dept. of Mathematics and Mechanics
St. Petersburg State University
St. Petersburg, Russia
E-mail address: [email protected]
Abstract
We investigate asymptotic behaviour of probabilities of large deviations for normalized combinatorial sums. We find a zone in which these probabilities are equivalent to the tail of the standard normal law. Our conditions are similar to the classical Bernstein condition. The range of the zone of the normal convergence can be of power order.
AMS 2000 subject classification: 60F05
Key words: *combinatorial central limit theorem, combinatorial sum, large deviations *
1 Introduction
Let be a sequence of matrices of independent random variables and , be a sequence of random permutations of numbers . Assume that has the uniform distribution on the set of permutations of and it is independent with for all . Define the combinatorial sum by relation
[TABLE]
Under certain conditions, a sequence of distributions of combinatorial sums converges weakly to the standard normal law. Every such result is called a combinatorial central limit theorem (CLT).
Investigations in this direction have a long history. One can find results on combinatorial CLT in Wald and Wolfowitz [1], Noether [2], Hoeffding [3], Motoo [4], Kolchin and Chistyakov [5]. Further, non-asymptotic Esseen type bounds have been derived for accuracy of normal approximation of distributions of combinatorial sums. Such results have been obtained in Bolthausen [6], von Bahr [7], Ho and Chen [8], Goldstein [9], Neammanee and Suntornchost [10], Neammanee and Rattanawong [11], Chen, Goldstein and Shao [12], Chen and Fang [13], Frolov [14, 15], and in Frolov [16] for random combinatorial sums.
Note that if are identically distributed for all and , then the combinatorial sum has the same distribution as that of independent random variables. This case is well investigated, but one has to take it into account for estimation of optimality of derived results.
Besides some partial cases, combinatorial sums have not independent increments. Hence, it is difficult to use classical methods of proofs for Esseen type inequalities those are based on bounds for differences of characteristic functions (c.f.). One usually applies the Stein method. For combinatorial sums, it yields Esseen type inequalities for random variables with finite third moments. Applying of the truncation techniques, Frolov [14, 15] derived generalizations of these results to the case of finite moments of order and for infinite variations as well.
Every bound in CLT similar to the Esseen inequality yields results on asymptotic behaviour for large deviations coinciding with that for tail of the normal law in a logarithmic zone. Such results are usually called moderate deviations. Moderate deviations for combinatorial sums have been investigated in Frolov [17].
In this paper, we derive new results on the asymptotic behaviour for large deviations of combinatorial sums in power zones. Note that ranges of power zones are powers from some characteristic similar to the Lyapunov ratio. Indeed, we deal with non-identically distributed random variables. Even for sums of independent random variables, ranges of zones of the normal convergence depend on the Lyapunov ratios. For identically distributed random variables, this yields that the ranges are powers from the number of summands. But the last case corresponds to the classical theory for sums of independent random variables and it is not new therefore.
In our proofs, we will use the method of conjugate distributions. Note that von Bahr [7] developed a method to bound distances between c.f.’s of normalized combinatorial sums and normal law. Assuming that random variables are bounded or satisfy certain analogue of the classical Bernstein condition, we conclude that moment generating functions (m.g.f.) of normalised combinatorial sums are analytic in a circle of the complex plane. Adopting the Bahr’s method, we will bound the difference between m.g.f.’s in some circle. In view of the analytic property, this will also give bounds for derivatives of m.g.f.’s. Hence, we will arrive at desired asymptotics for m.g.f.’s and their first and second logarithmic derivatives which are means and variations of random variables being conjugate for normalized combinatorial sums. Then we will estimate a closeness of distributions of conjugate random variables and the standard normal law. Using relationship between distributions and conjugate ones, we will derive the asymptotics of large deviations under consideration.
2 Results
Let be a sequence of matrices of independent random variables such that
[TABLE]
for all . Let , be a sequence of random permutations of numbers . Assume that has the uniform distribution on the set of permutation and it is independent with for all . Put
[TABLE]
It is not difficult to check that
[TABLE]
Hence, condition (1) yields that combinatorial sums are centered at zero. Moreover,
[TABLE]
If as , then the main part of the variance is the normalized sum of second moments
[TABLE]
Therefore, in the sequel, we will use as norming sequence for .
Our main result is as follows.
Theorem 1**.**
Let be a non-decreasing sequence of positive numbers such that for , inequalities
[TABLE]
hold for all , and , where is an absolute positive constant. Put
[TABLE]
Then for every sequence of positive numbers with , and as , relation
[TABLE]
holds, where is the standard normal distribution function.
Note that . This follows from the inequality . Indeed, assuming that , we arrive at the incorrect inequality .
Bahr [7] proved the following Esseen type inequality:
[TABLE]
where is an absolute positive constant. Hence the condition as is natural for relation (3), giving exact (non-logarithmic) asymptotics of large deviations. For identically distributed , this condition turns to as .
Note that the conditions and as imply as .
Theorem 1 is stronger than the results in Frolov [14] since the zone of normal convergence may be of power order while it is logarithmic in [14]. Of course, this requires stronger moment assumptions.
Condition (2) is an analogue of the Bernstein condition which is a form of existence for the exponential moment. In classical theory, one mainly deals with centered random variables and the Berstein condition yields that the logarithm of the m.g.f. is asymptotically a quadratic function at zero. For combinatorial CLT, it is principally important that summands could be non-centered and even degenerate sometimes. In this case, the logarithm of m.g.f. may be a linear function in a neighbourhood of zero provided the mean is not zero.
One can rewrite inequalities (2) for as follows:
[TABLE]
Hence, the Lyapunov inequality implies that the next condition is sufficient for (2): the inequalities and
[TABLE]
hold for all , and .
Consider two important examples in which condition (2) is satisfied.
1. Bounded random variables. If there exists a non-decreasing sequence of positive constants such that for all and , then condition (2) holds. For degenerate case with for all and , condition (2) is fulfilled with for every .
2. Exponential random variables. Let and be random variables having the exponential distributions with the parameters and correspondingly. Assume that each random variable in every matrix has one from four distributions of random variables , , and . Since and for all , condition (2) holds with с . One can easily expand this example for a larger number of exponential distributions using for construction of matrices of X’s. Parameters of these distributions may depend on . Moreover, one can easily replace exponential distributions by Gamma ones.
Note that has an order of in the last example. It is also clear that the behaviour of will be similar when every random variable has one from given distributions. In the last case, one says about -sequences of matrix .
3 Proofs
For all , and , put
[TABLE]
where is the set of complex numbers. We have
[TABLE]
Note that the last sum is the permanent of the matrix . To investigate its behaviour we will use the following result.
Lemma 1**.**
Let be a random variable such that for the inequalities
[TABLE]
hold for all , where and are positive constants.
Then is an analytic function in the circle and for every with and , the inequalities
[TABLE]
hold, where constants depends on and do not depend on .
Proof. By inequality (5) and Stirling’s formula, we have
[TABLE]
for all , where the constant depends on , and . Hence, the series
[TABLE]
converges in the circle . Put
[TABLE]
Then a.s. The monotone convergence theorem yields that
[TABLE]
in the circle . In view of , the Lebesgue dominate convergence theorem implies that
[TABLE]
in the circle .
Put . For and , we have
[TABLE]
where
[TABLE]
By inequalities (5), we get
[TABLE]
for all and .
Since , we have
[TABLE]
for all .
Hence,
[TABLE]
The first inequality follows.
Making use of the inequality for , the Lyapunov inequality and , we obtain
[TABLE]
for all .
It follows that
[TABLE]
The second inequality is proved.
Applying the inequality for and the Lyapunov inequality, we have
[TABLE]
Using for , the Lyapunov inequality, inequality (5) and , we further get
[TABLE]
for all .
It yields that
[TABLE]
The lemma is proved.
Proof of Theorem 1. By Lemma 1 with , , and , for all , and , the inequalities
[TABLE]
hold for every in the circle . Relations (7) and (1) imply that
[TABLE]
and
[TABLE]
It follows from relations (8) and (1) that
[TABLE]
From relations (6) and (9)—(11), the definition of and , we have
[TABLE]
Note that the function , , is the c.f. for the normalized combinatorial sum. In Bahr [7], relations (4) and (12)—(15) for and have been used to bound the distance between and the c.f. of the standard normal law. The bounds for from there will coincide with our ones provided we change by . Hence, we borrow one further bound from [7] with a formal replacing by . We use the first formula from p. 137 in [7] with instead of . Then we have
[TABLE]
for all in the circle , where is an absolute positive constant. Hence,
[TABLE]
where . If , then
[TABLE]
It follows that
[TABLE]
for all in the circle .
Let be a sequence of positive numbers that will be chosen later. Assume that . The function is analytic in the circle . Hence,
[TABLE]
where by the Cauchy inequalities, the coefficients satisfy to the relations
[TABLE]
Put
[TABLE]
Then, in the circle , the inequalities
[TABLE]
hold. This and inequality (16) yield that
[TABLE]
for . Hence,
[TABLE]
for .
Further, making use of relations (17)–(19), we get
[TABLE]
for . It follows that
[TABLE]
for .
Let be a sequence of positive numbers. Let be a random variable conjugate to , i.e. has the following distribution function
[TABLE]
Note that and . In the sequel, we take such that relations (19) и (20) will yield and . Hence, we investigate the distance between the standard normal law and the distribution of centered at and normalized by main terms of the mean and the variance. Denote
[TABLE]
and estimate
[TABLE]
Put
[TABLE]
It is clear that is a c.f. of the random variable .
We have
[TABLE]
for . Since
[TABLE]
we obtain for . It follows from relations (16) and (17) that
[TABLE]
for . Putting , we get
[TABLE]
for all and . By the Esseen inequality, we have
[TABLE]
Furthermore,
[TABLE]
We have
[TABLE]
provided . Moreover,
[TABLE]
Put , where enough slowly to satisfy , and . Note that in view of relation (16), we have . Let be a solution of the equation
[TABLE]
The function is strictly increasing, and, by relations (19) and (16), the inequalities hold for all sufficiently large . It follows that the unique solution of equation (25) exists for all sufficiently large . Moreover, relation (19) yields that
[TABLE]
and
[TABLE]
It follows from relations (21)—(24) and (16) that
[TABLE]
Theorem 1 is proved.
Finally, we mention some unsolved problems. In Frolov, Martikainen and Steinebach [18], one can find more exact results on large deviations for sums of independent random variables in the scheme of series. In there, the conditions are imposed on the logarithms of m.g.f.’s of summands. Now we can not adopt the techniques from there to combinatorial sums. We see from relation (4) that the m.g.f. of is the permanent of the matrix . Above, the method of the investigation of the behaviour for this permanent implied bounds with instead of analogues of the Lyaponov ratios. The second problem is that the proof in [18] involves some bounds in CLT which variants for combinatorial sums are unknown. Solutions of these problems could yield more exact results under weaker conditions.
References
- [1]
Wald A., Wolfowitz J.,1944. Statistical tests based on permutations of observations. Ann. Math. Statist. 15, 358–372.
- [2]
Noether G.E., 1949. On a theorem by Wald and Wolfowitz. Ann. Math. Statist. 20, 455–458.
- [3]
Hoeffding W., 1951. A combinatorial central limit theorem. Ann. Math. Statist. 22, 558–566.
- [4]
Motoo M., 1957. On Hoeffding’s combinatorial central limit theorem. Ann. Inst. Statist. Math. 8, 145–154.
- [5]
Kolchin V.F., Chistyakov V.P. (1973) On a combinatorial limit theorem. Theor. Probab. Appl. 18, 728-739.
- [6]
Bolthausen E., 1984. An estimate of the remainder in a combinatorial central limit theorem. Z. Wahrsch. verw. Geb. 66, 379–386.
- [7]
von Bahr B., 1976. Remainder term estimate in a combinatorial central limit theorem. Z. Wahrsch. verw. Geb. 35, 131-139.
- [8]
Ho S.T., Chen L.H.Y., 1978. An bounds for the remainder in a combinatorial central limit theorem. Ann. Probab. 6, 231–249.
- [9]
Goldstein L., 2005. Berry-Esseen bounds for combinatorial central limit theorems and pattern occurrences, using zero and size biasing. J. Appl. Probab. 42, 661–683.
- [10]
Neammanee K., Suntornchost J., 2005. A uniform bound on a combinatorial central limit theorem. Stoch. Anal. Appl. 3, 559-578.
- [11]
Neammanee K., Rattanawong P., 2009. A constant on a uniform bound of a combinatorial central limit theorem. J. Math. Research 1, 91-103.
- [12]
Chen L.H.Y., Goldstein L., Shao Q.M., 2011. Normal approximation by Stein’s method. Springer.
- [13]
Chen L.H.Y., Fang X. (2015) 0n the error bound in a combinatorial central limit theorem. Bernoulli, 21, N.1, 335-359.
- [14]
Frolov A.N., 2014. Esseen type bounds of the remainder in a combinatorial CLT. J. Statist. Planning and Inference, 149, 90–97.
- [15]
Frolov A.N. (2015a) Bounds of the remainder in a combinatorial central limit theorem. Statist. Probab. Letters 105, 37-46.
- [16]
Frolov A.N. (2015b) On the probabilities of moderate deviations for combinatorial sums. Vestnik St. Petersburg University. Mathematics, 48, No. 1, 23-28. Allerton Press, Inc., 2015.
- [17]
Frolov A.N. (2017) On Esseen type inequalities for combinatorial random sums. Communications in Statistics -Theory and Methods. 46 (12), 5932-5940.
- [18]
Frolov A.N., Martikainen A.I., Steinebach J. (1997) Erdös–Rényi–Shepp type laws in non-i.i.d. case. Studia Sci. Math. Hungar. 34, 165–181.
