Estimates for the closeness of convolutions of probability distributions on convex polyhedra
Friedrich G\"otze, Andrei Yu. Zaitsev

TL;DR
This paper extends previous results on approximating sums of independent random variables and their convolutions to the setting of probability distributions on convex polyhedra, providing new estimates for their closeness.
Contribution
It transfers existing approximation and convolution proximity results to probability distributions on convex polyhedra, broadening their applicability.
Findings
Established bounds for convolution closeness on convex polyhedra
Extended compound Poisson approximation techniques to convex polyhedral distributions
Provided theoretical estimates for multidimensional distribution convolutions
Abstract
The aim of the present work is to show that the results obtained earlier on the approximation of distributions of sums of independent summands by the accompanying compound Poisson laws and the estimates of the proximity of sequential convolutions of multidimensional distributions may be transferred to the estimation of the closeness of convolutions of probability distributions on convex polyhedra.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Estimates for the closeness of convolutions of probability distributions on convex polyhedra
Friedrich Götze
and
Andrei Yu. Zaitsev
Fakultät für Mathematik,
Universität Bielefeld, Postfach 100131,
D-33501 Bielefeld, Germany
St. Petersburg Department of Steklov Mathematical Institute
Fontanka 27, St. Petersburg 191023, Russia
and St.Petersburg State University, 7/9 Universitetskaya nab., St. Petersburg, 199034 Russia
Abstract.
The aim of the present work is to show that the results obtained earlier on the approximation of distributions of sums of independent summands by the accompanying compound Poisson laws and the estimates of the proximity of sequential convolutions of multidimensional distributions may be transferred to the estimation of the closeness of convolutions of probability distributions on convex polyhedra.
Key words and phrases:
sums of independent random variables, closeness of successive convolutions, convex polyhedra, approximation, inequalities
1991 Mathematics Subject Classification:
Primary 60F05; secondary 60E15, 60G50
The authors were supported by the SPbGU-DFG grant 6.65.37.2017. The second author was supported by grant RFBR 16-01-00367.
Let us first introduce some notation. Let denote the set of probability distributions defined on the Borel -field of subsets of the Euclidean space and let be the distribution of a -dimensional random vector . Let be the set of symmetric distributions. For , we denote the corresponding characteristic functions by , , and distribution functions by , . The uniform Kolmogorov distance is defined by
[TABLE]
By the symbols and we denote (generally speaking various) positive absolute constants and quantities depending only on the arguments in brackets. For , we denote
[TABLE]
Products and powers of measures are understood in the convolution sense: , , , where is the distribution concentrated at a point . A natural approximating infinitely divisible distribution for is the accompanying compound Poisson distribution , where
[TABLE]
and, more generally,
[TABLE]
It is well-known that the distribution is infinitely divisible.
Arak [1] showed that, if is a symmetric one-dimensional distribution with a nonnegative characteristic function for all , then
[TABLE]
He introduced and used the so-called method of triangular functions (see [2, Chapter 3, Sections 2–4]).
Zaitsev [8] applied the methods which were used by Arak while proving inequality (2) (see [2, Chapter 5, Sections 2, 5–7]). Later, he managed to modify these methods, adapting them to the multidimensional case (see [9]–[13]). In particular, in [12], a multidimensional analogue of inequality (2) was obtained.
Using the method of triangular functions and its generalizations, several bounds of the type
[TABLE]
were obtained, where is small, , and the inequalities
[TABLE]
are valid (see a discussion for in [2, Chapter 3, Section 3]). Note that, in the general case, (4) does not imply (3).
Inequality (3) is equivalent to the validity of the inequality
[TABLE]
for all sets of the form
[TABLE]
where are the vectors of the standard Euclidean basis, , .
For we denote by the collection of convex polyhedra representable in the form
[TABLE]
where , , , and, for , ,
[TABLE]
where is the Lebesgue measure and {Q(F,\,b)=\sup_{x}F\bigl{\{}\kern 1.0pt[x,\,x+b]\kern 1.0pt\bigr{\}}} is the concentration function of . Define a distance over all polyhedra as
[TABLE]
In [13], Zaitsev has proved several non-uniform inequalities of the form containing a factor , depending on the compared distributions and on the set , which satisfies the inequality and may turn out to be small if the polyhedron is sufficiently small in a certain sense. Theorems 1 and 2 have been obtained in [13] as consequences of the corresponding results of [10]–[12] which were proved for the case when is a parallelepiped (6) with faces parallel to the coordinate axes. The aim of the present paper is to formulate and to discuss similar bounds for the quantities and \bigl{|}G\{X\}-H\{X\}\bigr{|}, , where are certain convolutions of probability distributions which have not been studied in the literature before.
Theorem 1**.**
Let , , , , , . Then
[TABLE]
and, moreover,
[TABLE]
Hence,
[TABLE]
Theorem 2**.**
Assume that the distributions are represented as
[TABLE]
where are arbitrary distributions, ,
[TABLE]
where are the -variate infinitely divisible distribution with characteristic function
[TABLE]
Then
[TABLE]
and, hence,
[TABLE]
It is easy to see that, under the conditions of Theorems 1 and 2, we have , , and, moreover, the quantities may be small. For example, for a fixed bounded set , the quantity decreases for not slower than . Thus, inequalities (7), (8) and (11) significantly strengthen inequalities (9) and (12). At the same time, there is no reason to expect that inequalities (7), (8) and (11) are optimal with respect to the dependence of the right-hand sides on the parameters and . In particular, already from Arak’s results (see [2, Theorem 7.1, Chap. V]) it follows that, for , , inequality (7) may be replaced by
[TABLE]
In inequalities (7) and (8) we have and
[TABLE]
[TABLE]
respectively.
The proofs of Theorems 1 and 2 are based on applications of -variate versions of bounds for the closeness of distributions on the sets of the form (6) with . It is important that the -variate vectors with coordinates , , , , satisfy the same -dimensional conditions as the random vectors with compared -dimensional distributions. For example, if , for some satisfying , then \mathcal{L}\big{(}\langle\xi,t_{1}\rangle,\ldots,\langle\xi,t_{m}\rangle\big{)}\in\mathfrak{F}_{m}^{(\alpha)}. Similarly, if , then \mathcal{L}\big{(}\langle\xi,t_{1}\rangle,\ldots,\langle\xi,t_{m}\rangle\big{)}\in\mathfrak{F}_{m}^{s}. Analogous statements hold for and -fold convolutions of such distributions and about other distributions involved in the assertions of Theorems 1 and 2. Thus, roughly speaking, from the known estimates of the distance in space we derive estimates of the distance in space .
The situation considered in Theorem 2 can be interpreted as a comparison of the sample containing independent observations of rare events with a Poisson point process which is obtained after a Poissonization of the initial sample (see [3], [14]).
Indeed, let be independent not identically distributed elements of a measurable space and be a Borel mapping. Assume that the set is represented as the union of two disjoint measurable sets: , with , . We say that the -th rare event occurs if . Respectively, it does not occur if .
Assume that , for all , and
[TABLE]
Then \mathcal{L}\big{(}f(X_{i})\big{)}=(1-p_{i})\,E+p_{i}\,V_{i}, where , and is conditional distributions of given . In [3], it was shown that
[TABLE]
where are the points of the corresponding Poisson point process. In particular, in the case where , , and f(x)=\big{(}\langle x,t_{1}\rangle,\ldots,\langle x,t_{m}\rangle\big{)}, for , inequality (14) turns into inequality (12).
In the rest of the paper we will study how small is the difference between and . A particular case of this problem is considered in inequality (7) of Theorem 1.
In the papers of Zaitsev [6], [7], [10] it was shown that one can obtain sharp bounds for the closeness of and without any moment conditions. Moreover, if the distribution is centered so that all its marginal distributions have zero medians, then
[TABLE]
where is an absolute constant (which was estimated for in [5] and [4]: ). The proof of this inequality is relatively simple and is based on classical bounds for concentration functions of convolutions. Much more complicated methods are needed to investigate the case of symmetric distributions . In this case inequality (15) is valid and it is optimal with respect to order in . But it may be essentially improved in the case when the characteristic function is uniformly separated from . In particular,
[TABLE]
if for all . Notice that inequality (8) is much more general compared to (16). Using this fact for the distribution with symmetric , we obtain the paradoxical statement that for all natural numbers and for any symmetric distribution the inequalities
[TABLE]
are valid and they are both optimal with respect to the order in . Inequalities (17) imply the following Theorem 3.
Theorem 3**.**
Let , . Then
[TABLE]
[TABLE]
In particular,
[TABLE]
It is evident that knowledge about the closeness of and is useful for studying distributions of the form
[TABLE]
A result in this direction is given in our Theorem 7. In particular, using (1) and the bounds for the closeness of and , Zaitsev [10] proved the following Theorem 4.
Theorem 4**.**
Let , . Then
[TABLE]
A one-dimensional version of Theorem 4 was proved somewhat earlier in [7].
It is evident that if a distribution is concentrated on a hyperplane which does not contain zero and is orthogonal to one of coordinate axes then for any . In particular, this is true in the case where , , . On the other hand, if all distributions , , of coordinates of the vector with are either non-degenerate or equal to , then, as is shown in Zaitsev [6], as and, moreover,
[TABLE]
Let be a one-dimensional lattice symmetric distribution concentrated on the set of odd numbers. Then the distributions , , are concentrated either on the set of odd numbers or on the set of even ones according to the parity of the number . Therefore, . For many distributions, e.g. for , the concentration function behaves as as . This indicates that the rate of decrease with respect to of the right-hand side of (22) cannot be increased without additional assumptions.
It is easy to show that the distribution is concentrated on the set of odd numbers if and only if its characteristic function is equal to at the points for all . For example, let , for . Inequality (9) of Theorem 1 says that the separation from of the characteristic function of a distribution leads to more quick decay of than the inequality (22) is able to provide.
Similarly to the proof of Theorems 1 and 2, we can use inequalities (18)–(22) in order to obtain the corresponding analogues of (18)–(22) for the closeness of convolutions of -dimensional distributions on the convex polyhedra . The following Theorems 5–7 are the main results of the present paper.
Theorem 5**.**
Let , . Then
[TABLE]
[TABLE]
[TABLE]
In particular,
[TABLE]
For , , we denote by the collection of convex polyhedra representable in the form
[TABLE]
Clearly,
[TABLE]
The following Theorem 6 is a consequence of inequality (22).
Theorem 6**.**
Let , , , and all distributions of the random variables , , are either non-degenerate or equal to . Then, for all ,
[TABLE]
Thus, we have the alternative: the left-hand side of (27) is equal to one or decreases at least as .
The quantity can be larger than any absolute constant. For example, if depends on and F_{n}\big{\{}[n,n+1)\big{\}}=1, then .
Note, however, that there is a difference between Theorems 1–2 and Theorems 5–6. In Theorems 1–2, the bounds are non-uniform. They include factors which depend on , , and may be small for small sets . The bounds of Theorems 5–6 cannot be improved even if we compare the probabilities to hit the set containing the unique point 0 only, where and . An exception is inequality (24). Applying inequality (8) to the distribution , it is easy to show that
[TABLE]
for any , where and is the maximal integer which is less or equal to .
In conclusion, we formulate a result on the closeness of the distributions of sums of random number of independent identically distributed random vectors, which follows from Theorem 5. For the distance this result is contained in [10, Theorem 1.3].
Let be independent identically distributed random vectors with common distribution and let be a two-dimensional random vector with integer non-negative coordinates, independent of the sequence . Denote
[TABLE]
Then it is well known that then
[TABLE]
Theorem 7**.**
If , then
[TABLE]
and if , then
[TABLE]
Here, the infimum is taken over all possible two-dimensional distributions such that , .
An interesting problem is to expand our inequalities to arbitrary convex sets . This does not follow from Theorems 1, 2, and 5–7, since the constants depend on .
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] T. V. Arak, On the approximation of n 𝑛 n -fold convolutions of distributions, having a non-negative characteristic function with accompanying laws . — Theory Probab. Appl. 25 , No. 2 (1980), 221–243.
- 2[2] T. V. Arak, A. Yu. Zaitsev, Uniform limit theorems for sums of independent random variables . — Proc. Steklov Inst. Math., 174 (1988), 222 p.
- 3[3] F. Götze, A. Yu. Zaitsev, Rare events and Poisson point processes . — Zap. Nauchn. Semin. POMI 466 (2017), 109–119 (in Russian). English transl. in J. Math. Sci. (N. Y.), (2019), to appear.
- 4[4] Ia. S. Golikova, On improvement of the estimate of the distance between sequential sums of independent random variables . — Zap. Nauchn. Semin. POMI 474 (2018), 118–123 (in Russian). English transl. in J. Math. Sci. (N. Y.), (2019), to appear.
- 5[5] E. L. Maistrenko, Estimate for the absolute constant in an inequality for the uniform distance between distributions of sequential sums of independent random variables . — Zap. Nauchn. Semin. POMI 454 (2016), 216–219 (in Russian). English transl. in J. Math. Sci. (N. Y.), 229 , 6 (2018), 741–743.
- 6[6] A. Yu. Zaitsev, The estimation of proximity of distribution of sequential sums of independent identically distributed random vectors . — Zap. Nauchn. Semin. POMI 97 (1980), 83–87 (in Russian). English transl. in J. Soviet Math., 24 , 5 (1984), 536–539.
- 7[7] A. Yu. Zaitsev, Some properties of n 𝑛 n -fold convolutions of distributions . — Theory Probab. Appl. 26 , 1 (1981), 148–152.
- 8[8] A. Yu. Zaitsev, On the accuracy of approximation of distributions of sums of independent random variables—which are nonzero with a small probability—by means of accompanying laws . — Theory Probab. Appl. 28 , 3 (1984), 657–669.
