On large deviations for combinatorial sums

Andrei N. Frolov

arXiv:1901.04244·math.PR·January 15, 2019

On large deviations for combinatorial sums

Andrei N. Frolov

PDF

TL;DR

This paper studies the asymptotic probabilities of large deviations in normalized combinatorial sums, identifying conditions under which these probabilities align with the standard normal tail, extending classical results.

Contribution

It introduces new conditions for large deviation asymptotics of combinatorial sums, expanding the understanding of their convergence to normal distribution tails.

Findings

01

Probabilities of large deviations match the standard normal tail in a specific zone.

02

Conditions similar to Bernstein's condition are sufficient for normal approximation.

03

The zone of normal convergence can grow at a power rate.

Abstract

We investigate asymptotic behaviour of probabilities of large deviations for normalized combinatorial sums. We find a zone in which these probabilities are equivalent to the tail of the standard normal law. Our conditions are similar to the classical Bernstein condition. The range of the zone of the normal convergence can be of power order.

Equations163

S_{n} = i = 1 \sum n X_{ni π_{n} (i)}

S_{n} = i = 1 \sum n X_{ni π_{n} (i)}

i = 1 \sum n E X_{nij} = j = 1 \sum n E X_{nij} = 0

i = 1 \sum n E X_{nij} = j = 1 \sum n E X_{nij} = 0

S_{n} = i = 1 \sum n X_{ni π_{n} (i)} .

S_{n} = i = 1 \sum n X_{ni π_{n} (i)} .

E S_{n} = 0, D S_{n} = E S_{n}^{2} - (E S_{n})^{2} = \frac{1}{n - 1} i, j = 1 \sum n (E X_{nij})^{2} + \frac{1}{n} i, j = 1 \sum n D X_{nij} .

E S_{n} = 0, D S_{n} = E S_{n}^{2} - (E S_{n})^{2} = \frac{1}{n - 1} i, j = 1 \sum n (E X_{nij})^{2} + \frac{1}{n} i, j = 1 \sum n D X_{nij} .

D S_{n} = \frac{1}{n ( n - 1 )} i, j = 1 \sum n (E X_{nij})^{2} + \frac{1}{n} i, j = 1 \sum n E X_{nij}^{2} .

D S_{n} = \frac{1}{n ( n - 1 )} i, j = 1 \sum n (E X_{nij})^{2} + \frac{1}{n} i, j = 1 \sum n E X_{nij}^{2} .

B_{n} = \frac{1}{n} i, j = 1 \sum n E X_{nij}^{2} .

B_{n} = \frac{1}{n} i, j = 1 \sum n E X_{nij}^{2} .

E X_{nij}^{k} ⩽ D k! M_{n}^{k - s} E ∣ X_{nij} ∣^{s}

E X_{nij}^{k} ⩽ D k! M_{n}^{k - s} E ∣ X_{nij} ∣^{s}

γ_{n} = max {i, j max \frac{n}{B _{n}} E ∣ X_{nij} ∣, i max j = 1 \sum n \frac{E X _{nij}^{2}}{B _{n}}, j max j = 1 \sum n \frac{E X _{nij}^{2}}{B _{n}}, i, j = 1 \sum n \frac{E ∣ X _{nij} ∣ ^{3}}{n B _{n}^{3/2}}} .

γ_{n} = max {i, j max \frac{n}{B _{n}} E ∣ X_{nij} ∣, i max j = 1 \sum n \frac{E X _{nij}^{2}}{B _{n}}, j max j = 1 \sum n \frac{E X _{nij}^{2}}{B _{n}}, i, j = 1 \sum n \frac{E ∣ X _{nij} ∣ ^{3}}{n B _{n}^{3/2}}} .

P (S_{n} ⩾ u_{n} B_{n}) \sim 1 - Φ (u_{n}) \mbox a s n \to \infty,

P (S_{n} ⩾ u_{n} B_{n}) \sim 1 - Φ (u_{n}) \mbox a s n \to \infty,

x sup P (S_{n} < x B_{n}) - Φ (x) ⩽ A \frac{γ _{n}}{n},

x sup P (S_{n} < x B_{n}) - Φ (x) ⩽ A \frac{γ _{n}}{n},

E X_{nij}^{k} ⩽ D k! M_{n}^{k} 1 ⩽ s ⩽ 3 min \frac{E ∣ X _{nij} ∣ ^{s}}{M _{n}^{s}} .

E X_{nij}^{k} ⩽ D k! M_{n}^{k} 1 ⩽ s ⩽ 3 min \frac{E ∣ X _{nij} ∣ ^{s}}{M _{n}^{s}} .

E X_{nij}^{k} ⩽ D k! M_{n}^{k} min {\frac{E ∣ X _{nij} ∣}{M _{n}}, (\frac{E ∣ X _{nij} ∣}{M _{n}})^{3}}

E X_{nij}^{k} ⩽ D k! M_{n}^{k} min {\frac{E ∣ X _{nij} ∣}{M _{n}}, (\frac{E ∣ X _{nij} ∣}{M _{n}})^{3}}

φ_{nij} (z) = E e^{z X_{nij}}, φ_{n} (z) = E e^{z \frac{S _{n}}{B _{n}}}, z \in C,

φ_{nij} (z) = E e^{z X_{nij}}, φ_{n} (z) = E e^{z \frac{S _{n}}{B _{n}}}, z \in C,

e^{- \frac{z ^{2}}{2}} φ_{n} (z) = \frac{1}{n !} p_{n} \in P_{n} \sum i = 1 \prod n {e^{- \frac{z ^{2}}{2 n}} φ_{ni p_{n} (i)} (\frac{z}{B _{n}})} = \frac{1}{n !} p_{n} \in P_{n} \sum i = 1 \prod n {1 + b_{ni p_{n} (i)}} .

e^{- \frac{z ^{2}}{2}} φ_{n} (z) = \frac{1}{n !} p_{n} \in P_{n} \sum i = 1 \prod n {e^{- \frac{z ^{2}}{2 n}} φ_{ni p_{n} (i)} (\frac{z}{B _{n}})} = \frac{1}{n !} p_{n} \in P_{n} \sum i = 1 \prod n {1 + b_{ni p_{n} (i)}} .

∣ E X^{k} ∣ ⩽ D k! M^{k - s} E ∣ X ∣^{s}

∣ E X^{k} ∣ ⩽ D k! M^{k - s} E ∣ X ∣^{s}

E e^{u X - \frac{v ^{2}}{2}} - 1 ⩽ C_{1} (∣ u ∣ E ∣ X ∣ + ∣ v ∣),

E e^{u X - \frac{v ^{2}}{2}} - 1 ⩽ C_{1} (∣ u ∣ E ∣ X ∣ + ∣ v ∣),

E e^{u X - \frac{v ^{2}}{2}} - 1 - u E X ⩽ C_{2} (∣ u ∣^{2} E X^{2} + ∣ v ∣^{2}),

E e^{u X - \frac{v ^{2}}{2}} - 1 - u E X + \frac{v ^{2}}{2} - \frac{u ^{2}}{2} E X^{2} ⩽ C_{3} (∣ u ∣^{3} E ∣ X ∣^{3} + ∣ v ∣^{3})

E ∣ X ∣^{k} ⩽ E X^{2 k} ⩽ D (2 k)! M^{2 k - 1} E ∣ X ∣ ⩽ D_{1} k! (2 M)^{k}

E ∣ X ∣^{k} ⩽ E X^{2 k} ⩽ D (2 k)! M^{2 k - 1} E ∣ X ∣ ⩽ D_{1} k! (2 M)^{k}

k = 0 \sum \infty \frac{∣ u ∣ ^{k}}{k !} E ∣ X ∣^{k}

k = 0 \sum \infty \frac{∣ u ∣ ^{k}}{k !} E ∣ X ∣^{k}

e_{n} (z) = k = 0 \sum n \frac{z ^{k}}{k !} .

e_{n} (z) = k = 0 \sum n \frac{z ^{k}}{k !} .

E e^{∣ u X ∣} = n \to \infty lim E e_{n} (∣ u X ∣) = k = 0 \sum \infty \frac{∣ u ∣ ^{k}}{k !} E ∣ X ∣^{k}

E e^{∣ u X ∣} = n \to \infty lim E e_{n} (∣ u X ∣) = k = 0 \sum \infty \frac{∣ u ∣ ^{k}}{k !} E ∣ X ∣^{k}

E e^{u X} = n \to \infty lim E e_{n} (u X) = k = 0 \sum \infty \frac{u ^{k}}{k !} E X^{k}

E e^{u X} = n \to \infty lim E e_{n} (u X) = k = 0 \sum \infty \frac{u ^{k}}{k !} E X^{k}

E W^{k} = E j = 0 \sum k C_{k}^{j} (u X)^{j} (- \frac{v ^{2}}{2})^{k - j} ⩽ j = 0 \sum k C_{k}^{j} ∣ u ∣^{j} ∣ E X^{j} ∣ (\frac{∣ v ∣ ^{2}}{2})^{k - j} = T_{k s}^{'} + T_{k s}^{''},

E W^{k} = E j = 0 \sum k C_{k}^{j} (u X)^{j} (- \frac{v ^{2}}{2})^{k - j} ⩽ j = 0 \sum k C_{k}^{j} ∣ u ∣^{j} ∣ E X^{j} ∣ (\frac{∣ v ∣ ^{2}}{2})^{k - j} = T_{k s}^{'} + T_{k s}^{''},

T_{k s}^{'} = j = 0 \sum s - 1 C_{k}^{j} ∣ u ∣^{j} ∣ E X^{j} ∣ (\frac{∣ v ∣ ^{2}}{2})^{k - j}, T_{k s}^{''} = j = s \sum k C_{k}^{j} ∣ u ∣^{j} ∣ E X^{j} ∣ (\frac{∣ v ∣ ^{2}}{2})^{k - j} .

T_{k s}^{'} = j = 0 \sum s - 1 C_{k}^{j} ∣ u ∣^{j} ∣ E X^{j} ∣ (\frac{∣ v ∣ ^{2}}{2})^{k - j}, T_{k s}^{''} = j = s \sum k C_{k}^{j} ∣ u ∣^{j} ∣ E X^{j} ∣ (\frac{∣ v ∣ ^{2}}{2})^{k - j} .

T_{k s}^{''} ⩽ D k! ∣ u ∣^{s} E ∣ X ∣^{s} j = s \sum k C_{k}^{j} ∣ u M ∣^{j - s} (\frac{∣ v ∣ ^{2}}{2})^{k - j}

T_{k s}^{''} ⩽ D k! ∣ u ∣^{s} E ∣ X ∣^{s} j = s \sum k C_{k}^{j} ∣ u M ∣^{j - s} (\frac{∣ v ∣ ^{2}}{2})^{k - j}

⩽ D k! ∣ u ∣^{s} E ∣ X ∣^{s} j = 0 \sum k - s C_{k}^{j + s} ∣ u M ∣^{j} (\frac{∣ v ∣ ^{2}}{2})^{k - j - s}

⩽ D k! ∣ u ∣^{s} E ∣ X ∣^{s} j = 0 \sum k - s k^{s} C_{k - s}^{j} ∣ u M ∣^{j} (\frac{∣ v ∣ ^{2}}{2})^{k - j - s}

⩽ D k! ∣ u ∣^{s} E ∣ X ∣^{s} k^{s} (∣ u M ∣ + \frac{∣ v ∣ ^{2}}{2})^{k - s} ⩽ D k! ∣ u ∣^{s} E ∣ X ∣^{s} k^{s} 4^{- k + s}

T_{k 1}^{'} = \frac{∣ v ∣ ^{2 k}}{2 ^{k}} ⩽ 2∣ v ∣ 8^{- k}

T_{k 1}^{'} = \frac{∣ v ∣ ^{2 k}}{2 ^{k}} ⩽ 2∣ v ∣ 8^{- k}

E e^{u X - \frac{v ^{2}}{2}} - 1 = k = 1 \sum \infty \frac{1}{k !} E W^{k} ⩽ k = 1 \sum \infty \frac{1}{k !} ∣ E W^{k} ∣ ⩽ k = 1 \sum \infty \frac{1}{k !} (T_{k 1}^{'} + T_{k 1}^{''})

E e^{u X - \frac{v ^{2}}{2}} - 1 = k = 1 \sum \infty \frac{1}{k !} E W^{k} ⩽ k = 1 \sum \infty \frac{1}{k !} ∣ E W^{k} ∣ ⩽ k = 1 \sum \infty \frac{1}{k !} (T_{k 1}^{'} + T_{k 1}^{''})

⩽ 4 D ∣ u ∣ E ∣ X ∣ k = 1 \sum \infty k 4^{- k} + 2∣ v ∣ k = 1 \sum \infty \frac{8 ^{- k}}{k !} ⩽ C_{1} (∣ u ∣ E ∣ X ∣ + ∣ v ∣) .

T_{k 2}^{'} = k ∣ u ∣∣ E X ∣ \frac{∣ v ∣ ^{2 k - 2}}{2 ^{k - 1}} + \frac{∣ v ∣ ^{2 k}}{2 ^{k}} ⩽ \frac{k}{2} ∣ u ∣^{2} (E X)^{2} \frac{∣ v ∣ ^{2 k - 2}}{2 ^{k - 1}} + \frac{k}{2} \frac{∣ v ∣ ^{2 k - 2}}{2 ^{k - 1}} + \frac{∣ v ∣ ^{2 k}}{2 ^{k}}

T_{k 2}^{'} = k ∣ u ∣∣ E X ∣ \frac{∣ v ∣ ^{2 k - 2}}{2 ^{k - 1}} + \frac{∣ v ∣ ^{2 k}}{2 ^{k}} ⩽ \frac{k}{2} ∣ u ∣^{2} (E X)^{2} \frac{∣ v ∣ ^{2 k - 2}}{2 ^{k - 1}} + \frac{k}{2} \frac{∣ v ∣ ^{2 k - 2}}{2 ^{k - 1}} + \frac{∣ v ∣ ^{2 k}}{2 ^{k}}

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

On large deviations for combinatorial sums

Andrei N. Frolov 111This investigation was supported by RFBR, research project No. 18–01–00393

Dept. of Mathematics and Mechanics

St. Petersburg State University

St. Petersburg, Russia

E-mail address: [email protected]

Abstract

We investigate asymptotic behaviour of probabilities of large deviations for normalized combinatorial sums. We find a zone in which these probabilities are equivalent to the tail of the standard normal law. Our conditions are similar to the classical Bernstein condition. The range of the zone of the normal convergence can be of power order.

AMS 2000 subject classification: 60F05

Key words: *combinatorial central limit theorem, combinatorial sum, large deviations *

1 Introduction

Let $\{\left\|X_{nij}\right\|,1\leqslant i,j\leqslant n,n=2,3,\ldots\}$ be a sequence of matrices of independent random variables and $\{\vec{\pi}_{n}=(\pi_{n}(1),\pi_{n}(2),\ldots,\pi_{n}(n))$ , $n=2,3,\ldots\}$ be a sequence of random permutations of numbers $1,2,\ldots,n$ . Assume that $\vec{\pi}_{n}$ has the uniform distribution on the set of permutations of $1,2,\ldots,n$ and it is independent with $\left\|X_{nij}\right\|$ for all $n$ . Define the combinatorial sum $S_{n}$ by relation

[TABLE]

Under certain conditions, a sequence of distributions of combinatorial sums converges weakly to the standard normal law. Every such result is called a combinatorial central limit theorem (CLT).

Investigations in this direction have a long history. One can find results on combinatorial CLT in Wald and Wolfowitz [1], Noether [2], Hoeffding [3], Motoo [4], Kolchin and Chistyakov [5]. Further, non-asymptotic Esseen type bounds have been derived for accuracy of normal approximation of distributions of combinatorial sums. Such results have been obtained in Bolthausen [6], von Bahr [7], Ho and Chen [8], Goldstein [9], Neammanee and Suntornchost [10], Neammanee and Rattanawong [11], Chen, Goldstein and Shao [12], Chen and Fang [13], Frolov [14, 15], and in Frolov [16] for random combinatorial sums.

Note that if $X_{nij}$ are identically distributed for all $1\leqslant j\leqslant n$ and $n$ , then the combinatorial sum has the same distribution as that of independent random variables. This case is well investigated, but one has to take it into account for estimation of optimality of derived results.

Besides some partial cases, combinatorial sums have not independent increments. Hence, it is difficult to use classical methods of proofs for Esseen type inequalities those are based on bounds for differences of characteristic functions (c.f.). One usually applies the Stein method. For combinatorial sums, it yields Esseen type inequalities for random variables with finite third moments. Applying of the truncation techniques, Frolov [14, 15] derived generalizations of these results to the case of finite moments of order $2+\delta$ and for infinite variations as well.

Every bound in CLT similar to the Esseen inequality yields results on asymptotic behaviour for large deviations coinciding with that for tail of the normal law in a logarithmic zone. Such results are usually called moderate deviations. Moderate deviations for combinatorial sums have been investigated in Frolov [17].

In this paper, we derive new results on the asymptotic behaviour for large deviations of combinatorial sums in power zones. Note that ranges of power zones are powers from some characteristic similar to the Lyapunov ratio. Indeed, we deal with non-identically distributed random variables. Even for sums of independent random variables, ranges of zones of the normal convergence depend on the Lyapunov ratios. For identically distributed random variables, this yields that the ranges are powers from the number of summands. But the last case corresponds to the classical theory for sums of independent random variables and it is not new therefore.

In our proofs, we will use the method of conjugate distributions. Note that von Bahr [7] developed a method to bound distances between c.f.’s of normalized combinatorial sums and normal law. Assuming that random variables are bounded or satisfy certain analogue of the classical Bernstein condition, we conclude that moment generating functions (m.g.f.) of normalised combinatorial sums are analytic in a circle of the complex plane. Adopting the Bahr’s method, we will bound the difference between m.g.f.’s in some circle. In view of the analytic property, this will also give bounds for derivatives of m.g.f.’s. Hence, we will arrive at desired asymptotics for m.g.f.’s and their first and second logarithmic derivatives which are means and variations of random variables being conjugate for normalized combinatorial sums. Then we will estimate a closeness of distributions of conjugate random variables and the standard normal law. Using relationship between distributions and conjugate ones, we will derive the asymptotics of large deviations under consideration.

2 Results

Let $\{\left\|X_{nij}\right\|,1\leqslant i,j\leqslant n,n=2,3,\ldots\}$ be a sequence of matrices of independent random variables such that

[TABLE]

for all $n$ . Let $\{\vec{\pi}_{n}=(\pi_{n}(1),\pi_{n}(2),\ldots,\pi_{n}(n))$ , $n=2,3,\ldots\}$ be a sequence of random permutations of numbers $1,2,\ldots,n$ . Assume that $\vec{\pi}_{n}$ has the uniform distribution on the set of permutation $P_{n}$ and it is independent with $\left\|X_{nij}\right\|$ for all $n$ . Put

[TABLE]

It is not difficult to check that

[TABLE]

Hence, condition (1) yields that combinatorial sums are centered at zero. Moreover,

[TABLE]

If $\mathbf{D}S_{n}\to\infty$ as $n\to\infty$ , then the main part of the variance is the normalized sum of second moments

[TABLE]

Therefore, in the sequel, we will use $\{B_{n}\}$ as norming sequence for $S_{n}$ .

Our main result is as follows.

Theorem 1.

Let $\{M_{n}\}$ be a non-decreasing sequence of positive numbers such that for $s=1,2,3$ , inequalities

[TABLE]

hold for all $k\geqslant s$ , $1\leqslant i,j\leqslant n$ and $n\geqslant 2$ , where $D$ is an absolute positive constant. Put

[TABLE]

Then for every sequence of positive numbers $\{u_{n}\}$ with $u_{n}\to\infty$ , $u_{n}^{3}=o(\sqrt{n}/\gamma_{n})$ and $u_{n}=o(\sqrt{B_{n}}/M_{n})$ as $n\to\infty$ , relation

[TABLE]

holds, where $\Phi(x)$ is the standard normal distribution function.

Note that $\gamma_{n}\geqslant 1$ . This follows from the inequality $\max\limits_{i}\sum\limits_{j=1}^{n}\mathbf{E}X_{nij}^{2}\geqslant B_{n}$ . Indeed, assuming that $\max\limits_{i}\sum\limits_{j=1}^{n}\mathbf{E}X_{nij}^{2}<B_{n}$ , we arrive at the incorrect inequality $\sum\limits_{i,j=1}^{n}\mathbf{E}X_{nij}^{2}<nB_{n}=\sum\limits_{i,j=1}^{n}\mathbf{E}X_{nij}^{2}$ .

Bahr [7] proved the following Esseen type inequality:

[TABLE]

where $A$ is an absolute positive constant. Hence the condition $u_{n}^{3}=o(\sqrt{n}/\gamma_{n})$ as $n\to\infty$ is natural for relation (3), giving exact (non-logarithmic) asymptotics of large deviations. For identically distributed $X_{nij}$ , this condition turns to $u_{n}=o(n^{1/6})$ as $n\to\infty$ .

Note that the conditions $u_{n}^{3}=o(\sqrt{n}/\gamma_{n})$ and $u_{n}\to\infty$ as $n\to\infty$ imply $\gamma_{n}/\sqrt{n}\to 0$ as $n\to\infty$ .

Theorem 1 is stronger than the results in Frolov [14] since the zone of normal convergence may be of power order while it is logarithmic in [14]. Of course, this requires stronger moment assumptions.

Condition (2) is an analogue of the Bernstein condition which is a form of existence for the exponential moment. In classical theory, one mainly deals with centered random variables and the Berstein condition yields that the logarithm of the m.g.f. is asymptotically a quadratic function at zero. For combinatorial CLT, it is principally important that summands could be non-centered and even degenerate sometimes. In this case, the logarithm of m.g.f. may be a linear function in a neighbourhood of zero provided the mean is not zero.

One can rewrite inequalities (2) for $k\geqslant 3$ as follows:

[TABLE]

Hence, the Lyapunov inequality implies that the next condition is sufficient for (2): the inequalities $\mathbf{E}X_{nij}^{2}\leqslant 2DM_{n}\mathbf{E}|X_{nij}|$ and

[TABLE]

hold for all $k\geqslant 3$ , $1\leqslant i,j\leqslant n$ and $n\geqslant 2$ .

Consider two important examples in which condition (2) is satisfied.

1. Bounded random variables. If there exists a non-decreasing sequence of positive constants $\{M_{n}\}$ such that $\mathbf{P}(|X_{nij}|\leqslant M_{n})=1$ for all $1\leqslant i,j\leqslant n$ and $n\geqslant 2$ , then condition (2) holds. For degenerate case with $\mathbf{P}(X_{nij}=c_{nij})=1$ for all $1\leqslant i,j\leqslant n$ and $n\geqslant 2$ , condition (2) is fulfilled with $M_{n}=\max_{i,j}|c_{nij}|$ for every $n$ .

2. Exponential random variables. Let $\xi$ and $\eta$ be random variables having the exponential distributions with the parameters $\alpha$ and $\beta$ correspondingly. Assume that each random variable in every matrix $\|X_{nij}\|$ has one from four distributions of random variables $\xi$ , $-\xi$ , $\eta$ and $-\eta$ . Since $\mathbf{E}\xi^{k}=\alpha^{-k}$ and $\mathbf{E}\eta^{k}=\beta^{-k}$ for all $k$ , condition (2) holds with с $M_{n}=1/\min(\alpha,\beta)$ . One can easily expand this example for a larger number of exponential distributions using for construction of matrices of X’s. Parameters of these distributions may depend on $n$ . Moreover, one can easily replace exponential distributions by Gamma ones.

Note that $\gamma_{n}$ has an order of $\max\{\sqrt{n}/B_{n},(\sqrt{n}/B_{n})^{3}\}$ in the last example. It is also clear that the behaviour of $\gamma_{n}$ will be similar when every random variable $X_{nij}$ has one from $k$ given distributions. In the last case, one says about $k$ -sequences of matrix $\{\|X_{nij}\|\}$ .

3 Proofs

For all $i$ , $j$ and $n$ , put

[TABLE]

where $\mathbb{C}$ is the set of complex numbers. We have

[TABLE]

Note that the last sum is the permanent of the matrix $\|1+b_{nij}\|$ . To investigate its behaviour we will use the following result.

Lemma 1.

Let $X$ be a random variable such that for $s=1,2,3$ the inequalities

[TABLE]

hold for all $k\geqslant s$ , where $D$ and $M$ are positive constants.

Then $\mathbf{E}e^{uX}$ is an analytic function in the circle $|u|\leqslant 1/(4M)$ and for every $u,v\in\mathbb{C}$ with $|v|\leqslant 1/2$ and $|u|\leqslant 1/(8M)$ , the inequalities

[TABLE]

hold, where constants $C_{i}$ depends on $D$ and do not depend on $M$ .

Proof. By inequality (5) and Stirling’s formula, we have

[TABLE]

for all $k\geqslant 1$ , where the constant $D_{1}$ depends on $D$ , $M$ and $\mathbf{E}|X|$ . Hence, the series

[TABLE]

converges in the circle $|u|\leqslant 1/(4M)$ . Put

[TABLE]

Then $e_{n}(|uX|)\uparrow e^{|uX|}$ a.s. The monotone convergence theorem yields that

[TABLE]

in the circle $|u|\leqslant 1/(4M)$ . In view of $|e^{uX}|\leqslant e^{|uX|}$ , the Lebesgue dominate convergence theorem implies that

[TABLE]

in the circle $|u|\leqslant 1/(4M)$ .

Put $W=uX-v^{2}/2$ . For $s=1,2,3$ and $k\geqslant s$ , we have

[TABLE]

where

[TABLE]

By inequalities (5), we get

[TABLE]

for all $k\geqslant s$ and $s=1,2,3$ .

Since $|v|\leqslant 1/2$ , we have

[TABLE]

for all $k\geqslant 1$ .

Hence,

[TABLE]

The first inequality follows.

Making use of the inequality $2a\leqslant a^{2}+1$ for $a=|u||\mathbf{E}X|$ , the Lyapunov inequality and $|v|\leqslant 1/2$ , we obtain

[TABLE]

for all $k\geqslant 2$ .

It follows that

[TABLE]

The second inequality is proved.

Applying the inequality $2a\leqslant a^{2}+1$ for $a=|u||\mathbf{E}X|$ and the Lyapunov inequality, we have

[TABLE]

Using $2a\leqslant a^{2}+1$ for $a=|u|^{2}\mathbf{E}X^{2}$ , the Lyapunov inequality, inequality (5) and $|v|\leqslant 1/2$ , we further get

[TABLE]

for all $k\geqslant 3$ .

It yields that

[TABLE]

The lemma is proved. $\Box$

Proof of Theorem 1. By Lemma 1 with $X=X_{nij}$ , $u=z/\sqrt{B_{n}}$ , $v=z/\sqrt{n}$ and $M=M_{n}$ , for all $n$ , $i$ and $j$ , the inequalities

[TABLE]

hold for every $z$ in the circle $|z|\leqslant\min\{\sqrt{n},\sqrt{B_{n}}/M_{n}\}/8$ . Relations (7) and (1) imply that

[TABLE]

and

[TABLE]

It follows from relations (8) and (1) that

[TABLE]

From relations (6) and (9)—(11), the definition of $\gamma_{n}$ and $\gamma_{n}\geqslant 1$ , we have

[TABLE]

Note that the function $\varphi_{n}(it)$ , $t\in\mathbb{R}$ , is the c.f. for the normalized combinatorial sum. In Bahr [7], relations (4) and (12)—(15) for $z=it$ and $t\geqslant 0$ have been used to bound the distance between $\varphi_{n}(it)$ and the c.f. of the standard normal law. The bounds for $b_{nij}$ from there will coincide with our ones provided we change $t$ by $|z|$ . Hence, we borrow one further bound from [7] with a formal replacing $t$ by $|z|$ . We use the first formula from p. 137 in [7] with $2C_{3}\gamma_{n}$ instead of $\delta$ . Then we have

[TABLE]

for all $z$ in the circle $|z|\leqslant\min\{\sqrt{n},\sqrt{B_{n}}/M_{n}\}/8$ , where $C_{4}$ is an absolute positive constant. Hence,

[TABLE]

where $C_{5}=\max\{8eC_{3},4e^{2}C_{4}\}$ . If $|z|\leqslant\sqrt{n}/(2C_{5}\gamma_{n})$ , then

[TABLE]

It follows that

[TABLE]

for all $z$ in the circle $|z|\leqslant C_{7}\min\{\sqrt{n}/\gamma_{n},\sqrt{B_{n}}/M_{n}\}=y_{n}$ .

Let $\{x_{n}\}$ be a sequence of positive numbers that will be chosen later. Assume that $x_{n}\leqslant y_{n}/16$ . The function $f_{n}(z)=e^{-\frac{z^{2}}{2}}\varphi_{n}(z)-1$ is analytic in the circle $|z|\leqslant 16x_{n}$ . Hence,

[TABLE]

where by the Cauchy inequalities, the coefficients $a_{nk}$ satisfy to the relations

[TABLE]

Put

[TABLE]

Then, in the circle $|z|\leqslant 4x_{n}$ , the inequalities

[TABLE]

hold. This and inequality (16) yield that

[TABLE]

for $|z|\leqslant 4x_{n}$ . Hence,

[TABLE]

for $|z|\leqslant 4x_{n}$ .

Further, making use of relations (17)–(19), we get

[TABLE]

for $|z|\leqslant 4x_{n}$ . It follows that

[TABLE]

for $|z|\leqslant 4x_{n}$ .

Let $\{h_{n}\}$ be a sequence of positive numbers. Let $\overline{S}_{n}$ be a random variable conjugate to $S_{n}/\sqrt{B_{n}}$ , i.e. $\overline{S}_{n}$ has the following distribution function

[TABLE]

Note that $\mathbf{E}\overline{S}_{n}=m_{n}(h_{n})$ and $D\overline{S}_{n}=\sigma^{2}_{n}(h_{n})$ . In the sequel, we take $h_{n}$ such that relations (19) и (20) will yield $m_{n}(h_{n})=h_{n}+o(1)$ and $\sigma^{2}_{n}(h_{n})=1+o(1)$ . Hence, we investigate the distance between the standard normal law and the distribution of $\overline{S}_{n}$ centered at and normalized by main terms of the mean and the variance. Denote

[TABLE]

and estimate

[TABLE]

Put

[TABLE]

It is clear that $\psi_{n}(it)$ is a c.f. of the random variable $\overline{S}_{n}-m_{n}(h_{n})$ .

We have

[TABLE]

for $|z|+h_{n}\leqslant 4x_{n}$ . Since

[TABLE]

we obtain $|(z+h_{n})^{k}-h_{n}^{k}|\leqslant k|z|(4x_{n})^{k-1}$ for $|z|+h_{n}\leqslant 4x_{n}$ . It follows from relations (16) and (17) that

[TABLE]

for $|z|+h_{n}\leqslant 4x_{n}$ . Putting $z=it$ , we get

[TABLE]

for all $|t|\leqslant 2x_{n}$ and $|h_{n}|\leqslant 2x_{n}$ . By the Esseen inequality, we have

[TABLE]

Furthermore,

[TABLE]

We have

[TABLE]

provided $m_{n}(h_{n})\to\infty$ . Moreover,

[TABLE]

Put $x_{n}=u_{n}\varrho_{n}$ , where $\varrho_{n}\to\infty$ enough slowly to satisfy $x_{n}\to\infty$ , $x_{n}^{3}=o(\sqrt{n}/\gamma_{n})$ and $x_{n}=o(\sqrt{B_{n}}/M_{n})$ . Note that in view of relation (16), we have $g_{n}(8x_{n})=o(1)$ . Let $h_{n}$ be a solution of the equation

[TABLE]

The function $m_{n}(h)$ is strictly increasing, $m_{n}(0)=0$ and, by relations (19) and (16), the inequalities $m_{n}(4x_{n})=4x_{n}+o(x_{n}^{-1})\geqslant 2x_{n}>u_{n}$ hold for all sufficiently large $n$ . It follows that the unique solution of equation (25) exists for all sufficiently large $n$ . Moreover, relation (19) yields that

[TABLE]

and

[TABLE]

It follows from relations (21)—(24) and (16) that

[TABLE]

Theorem 1 is proved. $\Box$

Finally, we mention some unsolved problems. In Frolov, Martikainen and Steinebach [18], one can find more exact results on large deviations for sums of independent random variables in the scheme of series. In there, the conditions are imposed on the logarithms of m.g.f.’s of summands. Now we can not adopt the techniques from there to combinatorial sums. We see from relation (4) that the m.g.f. of $S_{n}/\sqrt{B_{n}}$ is the permanent of the matrix $\|\mathbf{E}e^{zX_{nij}/\sqrt{B_{n}}}\|$ . Above, the method of the investigation of the behaviour for this permanent implied bounds with $\gamma_{n}/\sqrt{n}$ instead of analogues of the Lyaponov ratios. The second problem is that the proof in [18] involves some bounds in CLT which variants for combinatorial sums are unknown. Solutions of these problems could yield more exact results under weaker conditions.

References

[1]

Wald A., Wolfowitz J.,1944. Statistical tests based on permutations of observations. Ann. Math. Statist. 15, 358–372.

[2]

Noether G.E., 1949. On a theorem by Wald and Wolfowitz. Ann. Math. Statist. 20, 455–458.

[3]

Hoeffding W., 1951. A combinatorial central limit theorem. Ann. Math. Statist. 22, 558–566.

[4]

Motoo M., 1957. On Hoeffding’s combinatorial central limit theorem. Ann. Inst. Statist. Math. 8, 145–154.

[5]

Kolchin V.F., Chistyakov V.P. (1973) On a combinatorial limit theorem. Theor. Probab. Appl. 18, 728-739.

[6]

Bolthausen E., 1984. An estimate of the remainder in a combinatorial central limit theorem. Z. Wahrsch. verw. Geb. 66, 379–386.

[7]

von Bahr B., 1976. Remainder term estimate in a combinatorial central limit theorem. Z. Wahrsch. verw. Geb. 35, 131-139.

[8]

Ho S.T., Chen L.H.Y., 1978. An $L_{p}$ bounds for the remainder in a combinatorial central limit theorem. Ann. Probab. 6, 231–249.

[9]

Goldstein L., 2005. Berry-Esseen bounds for combinatorial central limit theorems and pattern occurrences, using zero and size biasing. J. Appl. Probab. 42, 661–683.

[10]

Neammanee K., Suntornchost J., 2005. A uniform bound on a combinatorial central limit theorem. Stoch. Anal. Appl. 3, 559-578.

[11]

Neammanee K., Rattanawong P., 2009. A constant on a uniform bound of a combinatorial central limit theorem. J. Math. Research 1, 91-103.

[12]

Chen L.H.Y., Goldstein L., Shao Q.M., 2011. Normal approximation by Stein’s method. Springer.

[13]

Chen L.H.Y., Fang X. (2015) 0n the error bound in a combinatorial central limit theorem. Bernoulli, 21, N.1, 335-359.

[14]

Frolov A.N., 2014. Esseen type bounds of the remainder in a combinatorial CLT. J. Statist. Planning and Inference, 149, 90–97.

[15]

Frolov A.N. (2015a) Bounds of the remainder in a combinatorial central limit theorem. Statist. Probab. Letters 105, 37-46.

[16]

Frolov A.N. (2015b) On the probabilities of moderate deviations for combinatorial sums. Vestnik St. Petersburg University. Mathematics, 48, No. 1, 23-28. Allerton Press, Inc., 2015.

[17]

Frolov A.N. (2017) On Esseen type inequalities for combinatorial random sums. Communications in Statistics -Theory and Methods. 46 (12), 5932-5940.

[18]

Frolov A.N., Martikainen A.I., Steinebach J. (1997) Erdös–Rényi–Shepp type laws in non-i.i.d. case. Studia Sci. Math. Hungar. 34, 165–181.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

On large deviations for combinatorial sums

Abstract

1 Introduction

2 Results

Theorem 1**.**

3 Proofs

Lemma 1**.**

Theorem 1.

Lemma 1.