Normal Approximation for Weighted Sums under a Second Order Correlation Condition
S. G. Bobkov, G.P. Chistyakov, F. G\"otze

TL;DR
This paper establishes a bound on how closely weighted sums of dependent variables approximate a normal distribution under certain correlation conditions, using advanced concentration inequalities, with applications to log-concave measures.
Contribution
It introduces a new upper bound for the normal approximation of dependent weighted sums under second order correlation conditions, improving existing results.
Findings
Bound of order (log n)/n for Kolmogorov distance
Enhanced concentration inequalities on high-dimensional spheres
Applications to log-concave probability measures
Abstract
Under correlation-type conditions, we derive an upper bound of order for the average Kolmogorov distance between the distributions of weighted sums of dependent summands and the normal law. The result is based on improved concentration inequalities on high-dimensional Euclidean spheres. Applications are illustrated on the example of log-concave probability measures.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPoint processes and geometric inequalities · Markov Chains and Monte Carlo Methods · Statistical Methods and Inference
NORMAL APPROXIMATION FOR WEIGHTED SUMS
UNDER A SECOND ORDER CORRELATION CONDITION
S. G. Bobkovlabel=e1][email protected] [ School of Mathematics
University of Minnesota
Vincent Hall 228
206 Church St. S.E.
Minneapolis, MN 55455
USA
G. P. Chistyakovlabel=e3] [email protected] [ Department of Mathematics
University of Bielefeld
Postbox 100131
33501 Bielefeld
Germany
E-mail: e3
F. Götze label=e2][email protected] [ University of Minnesota\thanksmarkm1 and Bielefeld University\thanksmarkm2
Abstract
Under correlation-type conditions, we derive an upper bound of order for the average Kolmogorov distance between the distributions of weighted sums of dependent summands and the normal law. The result is based on improved concentration inequalities on high-dimensional Euclidean spheres. Applications are illustrated on the example of log-concave probability measures.
60E,
60F,
Sudakov’s typical distributions,
normal approximation,
keywords:
[class=MSC]
keywords:
,
&
t1Supported by NSF grant DMS-1855575, and CRC 1283 at Bielefeld University.
1 Introduction
Let be an isotropic random vector in (), that is, with uncorrelated components having mean zero and variance one. We consider the distribution functions of the weighted sums
[TABLE]
with coefficients taken from the unit sphere in . Thus, and for all .
The central limit problem is to determine natural conditions on and which ensure that the random variables are nearly standard normal. In this case, one would also like to explore the rate of normal approximation in the Kolmogorov distance
[TABLE]
where
[TABLE]
is the standard normal distribution function. Let us briefly recall several well-known results in the case of independent components . Here, one of general variants of the central limit theorem asserts that will be small, as long as are identically distributed (the i.i.d. case), while is small. Moreover, under the 3rd moment condition this property may be quantified by virtue of the Berry-Esseen bound
[TABLE]
Here and below, we denote by , or by with an integer index absolute positive constants which may vary from place to place. The inequality (1.1) extends to the non-i.i.d. case as well ([P1], [P2]).
It easy to see that the sum in (1.1) is greater than or equal to for all , and that (1.1) leads to this standard -rate in the i.i.d. case, once the coefficients are equal to each other. For general distributions of this standard rate cannot be improved by assuming stronger moment-type conditions. Nevertheless, one may look at the problem from an ensemble point of view in asking whether or not will be essentially smaller than for most of on the sphere measured with the uniform probability measure on . A striking result in this direction was obtained by Klartag and Sodin [K-S], showing in particular that
[TABLE]
where we use to denote the average over the measure . Large deviation bounds for the set on the sphere where exceeds a multiple of are derived in [K-S] as well. Thus, when is bounded like in the i.i.d. case, the distances turn out to be typically of order in contrast to the classical case of equal coefficients.
The aim of these notes is to extend this interesting phenomenon under a suitable correlation-type condition (and thus for some class of dependent ) to isotropic random vectors with a similar -rate modulo a logarithmic factor. The scheme of the weighted sums under dependence has already a long history, going back to the seminal work of Sudakov [Su]. We will give a short overview of this line of research in Section 10 (partly in Section 7), and now turn to the main result.
We will say that the random vector satisfies a second order correlation condition with constant , if for any collection ,
[TABLE]
An optimal value is finite as long as has a finite -th moment, and then it represents the maximal eigenvalue of the covariance matrix associated with the -dimensional random vector \big{(}X_{i}X_{j}-{\mathbb{E}}X_{i}X_{j}\big{)}_{i,j=1}^{n}.
Theorem 1.1. Let be an isotropic random vector in with a symmetric distribution and a finite constant . Then
[TABLE]
The characteristic may be bounded, for example, via the relation in terms of a positive spectral gap, that is in terms of the optimal value in the Poincaré-type inequality
[TABLE]
(with ), where is an arbitrary smooth function on (cf. Proposition 3.4 below). In one important particular case, the well-known Kannan-Lovász-Simonovits conjecture asserts that is bounded away from zero for the whole class of isotropic log-concave probability distributions on the Euclidean space of any dimension (for short, K-L-S). Conditional on K-L-S, Theorem 1.1 would hence guarantee the -rate.
Corollary 1.2. Let be an isotropic random vector in with a symmetric log-concave distribution. Assuming the K-L-S hypothesis, we have
[TABLE]
In fact, modulo a logarithmic factor, the conclusion may be reversed in the sense that (1.6) implies , cf. Section 8.
An unconditional statement in the isotropic log-concave case with a standard rate of normal approximation can be obtained by combining the results of [A-B-P] and [B1] on the concentration of around the average distribution function with respect to the variable with a recent bound in the thin-shell problem due to Lee and Vempala [L-V] on the concentration of the Euclidean length about its average value (which is in essense equivalent to the closeness of to the standard normal distribution function ). More details are given in Section 7; one then gets
[TABLE]
As for the general (not necessarily log-concave) case, the functional turns out to be responsible for both, formally different concentration problems. The proof of Theorem 1.1 is based on results for spherical concentration, which have been recently developed in [B-C-G1]. They provide improved rates of concentration for smooth functions on the sphere based on the additional information about the Hessian of . This naturally leads to the definition of as introduced above. The “2nd-order” concentration inequalities on may also be used to derive large deviation bounds for considered as random variables on the probability space . Moreover, one may remove the symmetry assumption as well, by adding to the right-hand side of (1.4) an additional term responsible for 3rd order correlations between . We refrain from including these somewhat more technical results here and refer the interested reader to [B-C-G4] for a full account.
As we will see, there exist several natural classes of probability distributions for which a bound on the parameter can be obtained. Some of them are considered in Section 3, after a brief discussion of general properties of and related functionals in Section 2. Some results about the second order concentration on the sphere are described in Sections 4, which we apply in Section 5 to explore the concentration of characteristic functions of with respect to the variable . In Section 6, relying upon a general Berry-Esseen-type inequality, we finalize the proof of Theorem 1.1. The relationship of Theorem 1.1 with the K-L-S conjecture and a closely related thin-shell problem in the log-concave case are discussed separately in Sections 7-8.
2 Second Order Correlation Condition and Related Functionals
As usual, the Euclidean space is endowed with the canonical norm and the inner product . We start with preliminary remarks on the second order correlation condition and related functionals.
Let be a random vector in . With the Hilbert-Schmidt norm of a matrix given by , the definition (1.3) becomes
[TABLE]
where we may restrict ourselves to symmetric matrices only. This description shows that the functional is invariant under linear orthogonal transformations of the space (just as the Hilbert-Schmidt norm).
Related moment and variance-type functionals are
[TABLE]
We are mostly interested in the moments with and . For example, in the isotropic case, and , if is constant a.s. These functionals can be controlled in terms of , as the following statement shows.
Proposition 2.1. We have
[TABLE]
Proof. Choosing in (1.3) , , we get . Since , it follows that , that is, . Putting , we also obtain . ∎
In turn, the -moments may be related to the moments of . It is easy to see that
[TABLE]
while in the isotropic case, there is an opposite inequality .
The functionals , , and are useful for the estimation of “small” ball probabilities. For example, if , using an independent copy of , we have
[TABLE]
This bound was applied in the proof of Lemma 5.1 below (for details we refer to [B-C-G3]). Here, by Proposition 2.1 in the isotropic case, , which is also due to the fact that the functional is bounded away from zero for (in contrast to ).
Proposition 2.2. If is isotropic, then .
Proof. Applying the inequality (1.3) to the matrix with only one non-zero entry on the -place, we get
[TABLE]
Summing these bounds over all leads to . But . ∎
All the above definitions extend to complex-valued random variables using complex numbers in the definition (1.3) (of course, should be replaced with ). Note that, if is a complex-valued random variable, its variance is defined by
[TABLE]
3 **Classes of Distributions Satisfying
Second Order Correlation Condition**
Here we provide a few examples where functionals defined above may be easily evaluated or properly estimated. Bounds are attained for the second order correlation parameter for the following classes of distributions: i.i.d., coordinate-wise symmetric, log-concave and coordinate-wise symmetric, and probability measures with a pectral gap.
As before, let , . The case of independent components may be dealt with by simple calculation.
Proposition 3.1. If the random variables are independent and have mean zero, then
[TABLE]
[TABLE]
Note that equality (3.1) obviously extends to pairwise independent random variables with mean zero. The proof of the bound of in (3.2) is similar to the one in Proposition 3.2 below, so we omit it.
Another class of illustrative examples is given by distributions of random vectors which are equal to for arbitrary choices of signs . We call such distributions coordinate-wise symmetric, although in the literature they are also called distributions with unconditional basis. This class includes all symmetric product measures on and corresponds to the case where the components are i.i.d. random variables with symmetric distributions on the line. It is therefore not surprising that many formulas like those in Proposition 3.1 extend to the coordinate-wise symmetric distributions. In particular, the first equality in (3.2) is still valid. As for , it may be essentially reduced to the moment-type functional
[TABLE]
representing the maximal eigenvalue of the matrix .
Proposition 3.2. Given a random vector in with a coordinate-wise symmetric distribution, we have
[TABLE]
If additionally the distribution of is invariant under permutations of coordinates, then
[TABLE]
where the last term may be removed when .
The proof of this proposition is rather elementary, but technical. So, we postpone it to Section 9.
The following subfamily of coordinate-symmetric distributions admits a uniform bound on . Let us recall that a (Borel) probability measure on is called log-concave, if it satisfies the Brunn-Minkowski-type inequality
[TABLE]
for all non-empty compact sets and in , where denotes the Minkowski weighted sum. An equivalent description was given by Borell [Bor]: the measure should be supported on a closed convex set and have a log-concave density with respect to the Lebesgue measure on of the same dimension as (that is, is concave). Note that, if is isotropic and log-concave, then necessarily has dimension , so that is the (full) Lebesgue measure.
Proposition 3.3. Assume that the random vector in is isotropic and has a coordinate-wise symmetric, log-concave distribution. Then
[TABLE]
Proof. The distribution of the random vector has a log-concave, coordinate-wise non-increasing density. By a theorem due to Klartag [K3], the following weighted Poincaré-type inequality holds
[TABLE]
for any smooth even function on . Choosing with , we get
[TABLE]
In view of Proposition 3.2, we get
[TABLE]
It remains to recall that -norms of random variables with log-concave distributions are equivalent to each other. In particular, for isotropic log-concave ’s, we have . ∎
The above subclass may be potentially enlarged by considering the usual Poincaré-type inequality
[TABLE]
Proposition 3.4. Assume that a mean zero random vector in satisfies a Poincaré-type inequality with constant . Then . Moreover,
[TABLE]
and if isotropic, then
[TABLE]
Proof. Applying (3.5) to the linear functions , , we obtain
[TABLE]
If has mean zero, the latter means that . In particular, . Taking the quadratic function with , we get, by Cauchy’s inequality,
[TABLE]
Hence, the right-hand side does not exceed subject to , and thus , while in the isotropic case. ∎
4 Second Order Concentration on the Sphere
Concentration of measure on the sphere means that the range of deviations of any Lipschitz function on the unit sphere is essentially of order at most , which may be strengthened as the subgaussian stochastic dominance where denotes a standard normal random variable (cf. [M-S], [L]). More precisely, there is a subgaussian deviation inequality
[TABLE]
valid whenever the smooth function has -mean zero and Lipschitz seminorm . This may be partly seen from the Poincaré inequality
[TABLE]
in the class of all smooth complex-valued with -mean zero. Although here there is equality for all linear functions, the spherical concentration phenomenon may be strengthened with respect to the dimension for a wide subclass of smooth functions. In order to facilitate applications, we shall not use sphere intrinsic gradients but use Euclidean notions induced by the standard embedding of the sphere. Here functions are defined in an open subset of and their partial derivatives are understood in the usual sense. We denote by the Hessian, that is, the matrix of second order partial derivative , and by the identity matrix. The next proposition summarizes several recent results from [B-C-G1].
Proposition 4.1. Suppose that a real-valued function is defined and -smooth in some neighbourhood of . If is orthogonal to all affine functions in , then
[TABLE]
for any . Moreover, if uniformly on for the operator norm, and the second integral in is bounded by , then
[TABLE]
By Markov’s inequality, (4.4) yields a corresponding large deviation bound, which may be stated informally as a subexponential stochastic dominance . In particular, this means that the deviations of are of order at most .
The second order Poincaré-type inequality (4.3) obviously extends to all complex-valued that are orthogonal to all affine functions on the sphere. In this case, (4.4) may be applied separately to the real and imaginary part of , which results in
[TABLE]
assuming that on for some .
5 Concentration of Characteristic Functions
Given an isotropic random vector in , introduce the following smooth functions
[TABLE]
where serves as a parameter. Note that, for any fixed , represents the characteristic function of the weighted sum with distribution function , while the -mean of ,
[TABLE]
is the characteristic function of the avarage distribution function
[TABLE]
Let us recall that we use to denote integrals over the unit sphere with respect to the uniform measure .
In order to study deviations of the functions from their -means on , one may start from the Poincare inequality (4.2). Indeed, differentiating the equality (5.1), we get that, for any ,
[TABLE]
which, by Cauchy’s inequality, implies
[TABLE]
Taking the supremum over all , it follows that , which means that has a Lipschitz semi-norm (on the whole space ). Therefore, by (4.2),
[TABLE]
Thus, the deviations of from with respect to are of order at most – a property which may potentially be transfered to the analogous statement about the deviations of the distribution functions from in the sense of certain weak metrics.
In order to obtain better rates, we employ Proposition 4.1, assuming additionally that the random vector is symmetric and satisfies a second order correlatation condition (1.3) with parameter . To apply the bounds (4.3) and (4.5), we need to choose a suitable value and estimate the operator norm and the Hilbert-Schmidt norm . First note that, by further differentiation of (5.1), the Hessian of is given by
[TABLE]
for any fixed . Hence, a good choice could be in order to balance the diagonal elements in the matrix of second derivatives of . For any vector with complex components, using the canonical inner product in the complex -space, we have
[TABLE]
Hence, with this choice of , by the isotropy assumption,
[TABLE]
This bound insures that
[TABLE]
In addition, putting , we have
[TABLE]
where the supremum is running over all complex numbers such that . But, under this constraint (with complex coefficients), due to the second order correlation condition, the last expectation is bounded by , so that
[TABLE]
for all . On the other hand, by (5.3),
[TABLE]
since . The two bounds give
[TABLE]
Note that (5.6) is worse in comparison with (5.5) in the variable . Nevertheless, applying the second order Poincaré-type inequality, it is possible to improve the resulting inequality (5.7) for reasonably long -intervals. Since the distribution of is symmetric about the origin, the characteristic functions are even with respect to , i.e., . Hence, they are orthogonal in the Hilbert space to all linear functions on the sphere. Thus, the conditions of Proposition 4.1 are fulfilled for the function , and using (5.7), the inequality (4.3) gives
[TABLE]
This bound allows us to improve (5.6) to the form
[TABLE]
where in the last inequality we assume that . Combining this with (5.5), we therefore obtain that
[TABLE]
In view of (4.3), this already gives the inequality (5.9) below.
To get a stronger deviation inequality, let us recall (5.4), so that to conclude that the conditions of Proposition 4.1 (in its second part) are fulfilled for the function
[TABLE]
with parameter (which bounded away from zero). Applying (4.5), we arrive at:
Corollary 5.1. Let be an isotropic random vector in with a symmetric distribution and finite constant . Then the characteristic functions satisfy
[TABLE]
whenever . Moreover,
[TABLE]
As we have seen, removing the constraint , (5.9) may be replaced with a weaker inequality (5.8). When applying the latter to the estimation of via Lemma 6.1 below, we would gain an additional factor in Theorem 1.1.
6 Proof of Theorem 1.1
Based on the deviation inequalities (5.9)-(5.10), Fourier analytic tools yield bounds for the closeness of the distribution functions to the -mean distribution function defined in (5.2). The following Berry-Esseen-type bound can be found in [B-C-G3], cf. Lemma 6.2, which we state in the case .
Lemma 6.1. Suppose that a random vector in has a finite moment of order , with . Then, for all ,
[TABLE]
Proof of Theorem 1.1. Applying Propositions 2.1-2.2 and using the isotropy assumption, we have . Hence, (6.1) yields
[TABLE]
Here, the integrand may be estimated by virtue of (5.9), and then we get
[TABLE]
provided that . As a natural choice, take , (assuming that is large enough), which leads to the bound
[TABLE]
We finally refer to [B-C-G2], Theorem 1.1, cf. also [B-C-G3], Corollary 4.2, where the estimate
[TABLE]
was derived. Using and combining (6.2) with the triangle inequality for , we arrive at the desired inequality (1.4). ∎
Remark 6.2. Under proper moment assumptions and using the spherical deviation inequality (5.10), one may derive large deviation bounds for as well. In particular, suppose that
[TABLE]
for all with some . Then, in the setting of Theorem 1.1,
[TABLE]
In other words, with high -probability,
[TABLE]
For details we refer the interested reader to [B-C-G4].
7 The log-concave case
Specializing to the class of isotropic log-concave distributions on , first let us comment on the unconditional statement with a standard rate of normal approximation as indicated in the inequality (1.7). If the isotropic random vector has a uniform distribution over a symmetric convex body in , it was shown by Anttila, Ball, and Perissinaki that
[TABLE]
(actually with , cf. [A-B-P]). With a different argument, this inequality has been extended to arbitrary isotropic log-concave distributions in [B1]. In both papers, as a main step, it was observed that, for every point , the function has a bounded Lipschitz semi-norm on the unit sphere, so that one may apply the spherical concentration inequality (4.1), leading to
[TABLE]
Since , (7.1) readily yields an upper bound
[TABLE]
Combining it with (6.3) and applying the triangle inequality for the metric , we therefore obtain the normal approximation on average in the form of the relation
[TABLE]
It remains to involve the bound , which was recently derived by Lee and Vempala [L-V], and then we arrive at (1.7).
A thin-shell conjecture, raised in [B-K], asserts that the functional , or equivalently , is actually bounded by a dimension-free (and thus universal) constant over the whole class of isotropic log-concave random vectors in . Specializing to the convex body case, a similar concentration hypothesis was also suggested in [A-B-P]. It states that the deviation inequality
[TABLE]
holds true with . The boundedness of allows one to take a slightly thinner shell with . Anyhow, the bound (7.2) subject to the thin-shell conjecture still leads to the standard normal approximation as in (1.7).
Note that, by the Poincaré-type inequality (1.5) applied with , one gets , so that the thin-shell conjecture is formally weaker than the K-L-S (which is further precised in Proposition 3.4). On the other hand, recently Eldan [E] has developed a new localization technique, in essense reducing the stronger hypothesis to the weaker one modulo a logarithmic factor. It is is therefore possible to state Corollary 1.2 alternatively as follows.
Corollary 7.1. Let be an isotropic random vector in with a symmetric log-concave distribution. Assuming that the thin-shell conjecture is true, we have
[TABLE]
Proof. Combining Theorem 1.1 with Proposition 3.4, we get
[TABLE]
where is the smallest spectral gap in the Poincaré-type inequality over the class of all isotropic log-concave probability measures on . Assuming the K-L-S conjecture, is bounded away from zero, which thus leads to the inequality (1.6) of Corollary 1.2. Within the same class, this quantity may be related to the largest value . Namely, as shown by Eldan [E],
[TABLE]
In particular, the bound of the form implies that
[TABLE]
with in case and . It remains to apply (7.5) in (7.3) with . ∎
8 From the normal approximation to the shin shell
To refine the relationship between the central limit theorem and the thin-shell problem, let us complement Corollary 7.1 by the following general statement involving the maximal -norm of linear functionals of .
Proposition 8.1. Let be a random vector in with , satisfying the moment condition with some . Then
[TABLE]
In the isotropic log-concave case, the condition (6.4) is fulfilled with some absolute constant (by the well-known Borell’s Lemma 3.1 in [Bor]), and this simplifies (8.1) to
[TABLE]
Hence, the potential property
[TABLE]
as in Corollary 1.2 would imply that
[TABLE]
assuming additionally that the distribution of is symmetric about zero. But, the symmetry condition may easily be dropped. Indeed, define , where is an independent copy of a random vector with an isotropic log-concave distribution on . Then, the distribution of is isotropic, log-concave, and symmetric about zero. Moreover,
[TABLE]
Hence, once (8.3) is true for the random vector , it continues to hold for as well (with other constant).
Note also that, applying Eldan’s inequality (7.4) together with (8.3), from the normal approximation (8.2) we get
[TABLE]
Proof of Proposition 8.1. In view of the triangle inequality , it is sufficient to derive (8.1) for in place of . This means that we need in essense to reverse the inequality (6.3) by using (6.4). To this aim, let us rewrite the definition (5.2) as
[TABLE]
where we assume that and (as a random vector uniformly distributed on the sphere) are independent. This description yields
[TABLE]
or equivalently
[TABLE]
where . On the other hand, it follows from (6.4) that
[TABLE]
Using () together with the property , we have , which can be used to derive the bounds
[TABLE]
In addition, by Markov’s inequality, , so that
[TABLE]
for all . Hence, integrating by parts, we see that, for any , the left-hand side of (8.4) does not exceed in absolute value
[TABLE]
Choosing and recalling (8.4), we get
[TABLE]
∎
9 Proof of Proposition 3.2
The lower bound on in (3.3) immediately follows from (1.3) by choosing the coefficients to be of the form . For the upper bound, put and define
[TABLE]
The covariances of these mean zero random variables are given by
[TABLE]
Case 1: . By the symmetry with respect to the coordinate axes, the right-hand side of (9.1) is vanishing unless or . In both cases, it is equal to
[TABLE]
Case 2: . The right-hand side in (9.1) is non-zero only when .
Case 2a): , , . The right-hand side in (9.1) is equal to
[TABLE]
Case 2b): . The right-hand side is equal to
[TABLE]
In both subcases, . Therefore, for any collection of real numbers such that and ,
[TABLE]
Here, the first sum on the right-hand side does not exceed
[TABLE]
(by applying Cauchy’s inequality). As for the second sum, it does not exceed , and we obtain
[TABLE]
from which (3.3) follows immediately.
As for (3.4), recall that the first inequality always holds, cf. Proposition 2.1. For the second one, let us note that
[TABLE]
and that, for any ,
[TABLE]
Here, in the case , the right-hand side is maximized for equal coefficients, and recalling (9.3), we then get
[TABLE]
Hence, (9.2) implies (3.4). In the case , we similarly conclude that
[TABLE]
which means that . Thus, by (9.2),
[TABLE]
Hence, (3.4) follows in this case as well even without the -functional. ∎
10 Historical Remarks
Finally, let us give a short overview on results related to Theorem 1.1 (some account can also be found in the book [B-G-V-V]). It is natural to distinguish between two types of results.
10.1. Deviations of from the mean distribution in different metrics. The paper by Sudakov [Su] starts with the hypothesis
[TABLE]
which may be called a first order correlation condition. Here, an optimal value is the same functional we considered in Section 2; equivalently, represents the maximal eigenvalue of the correlation operator for the random vector . As was shown in [Su], if is bounded, and is large, then most of are close to the average distribution in the sense of the Kantorovich or -distance
[TABLE]
A closely related observation was also made by Diaconis and Freedman [D-F]. A somewhat different scheme, in which the coefficient vectors are drawn from the Gaussian measure on with mean zero and covariance matrix , was also considered by Nagaev [N] and von Weizsäcker [W]. In particular, assuming that , [N] contains a quantitative bound
[TABLE]
for the -distance between the distribution functions. When the coefficients have a special structure, similar phenomena were considered in [B2], [B-G].
Returning to the spherical measure , the rate as in (10.1) is achieved for the Lévy distance as well. More precisely, there is a general bound
[TABLE]
where the constant depends on only, cf. [B4]. Large deviation bounds on were given in [B1] in the isotropic case. As was already discussed in Section 7, the rate and deviation bounds may be essentially improved and be stated for the stronger Kolmogorov distance, when the random vector has an isotropic log-concave distribution.
Quantitative variants of Sudakov’s theorem for were studied in [B3], where it was shown that, for any ,
[TABLE]
The rate is thus approaching for growing . Under a stronger assumption (6.4), the above inequality easily implies
[TABLE]
Here, the logarithmic term may be removed, if has an isotropic log-concave distribution (by virtue of Proposition 3.1 in [B1]). Note that, in all these results, the rates are not better than a multiple of .
10.2. Deviations of from the standard normal distribution function . To study the approximation of by the standard normal distribution function for most of ’s, one is led to determine rates for the distance , which may be reduced to the estimation of (via relation (6.3)). In fact, the control of the two functionals, and , is sufficient to guarantee a standard rate of normal approximation for on average. As was shown in [B-C-G3], we have
[TABLE]
Note that Theorem 1.1 essentially improves this bound as long as is of the same order as and . However, whether or not and even is bounded might be a difficult problem for some classes of distributions on such as the class of isotropic log-concave probability measures. For this class, the property that is small for most of (when is large) was first established by Klartag [K1]. In particular, as uniformly over the class. For further refinements in this direction, see [K2], [E-K], [G-M].
There is also a number of results, where the coefficients are fixed, and are bounded by a quantity, which depends on as well, cf. e.g. [M], [M-M]. One striking result due to Klartag [K3] should be mentioned: If the random vector in is isotropic and has a coordinate-wise symmetric, isotropic log-concave distribution, then
[TABLE]
Moreover, a similar bound holds true for the stronger total variation distance. This is of course more precise in comparison with the average estimate .
Acknowledgment
The authors would like to thank the two referees for careful reading of the manuscript and valuable comments.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[A-B-P] M. Anttila, K. Ball, and I. Perissinaki. The central limit problem for convex bodies. Trans. Amer. Math. Soc. 355 (2003), no. 12, 4723–4735.
- 2[B 1] S. G. Bobkov. On concentration of distributions of random weighted sums. Ann. Probab. 31 (2003), no. 1, 195–215.
- 3[B 2] S. G. Bobkov. Concentration of distributions of the weighted sums with Bernoullian coefficients. Geometric aspects of functional analysis, 27–36, Lecture Notes in Math., 1807, Springer, Berlin, 2003.
- 4[B 3] S. G. Bobkov. Concentration of normalized sums and a central limit theorem for noncorrelated random variables. Ann. Probab. 32 (2004), no. 4, 2884–2907.
- 5[B 4] S. G. Bobkov. On a theorem of V. N. Sudakov on typical distributions. (Russian) J. Math. Sciences (New York), 167 (2010), no. 4, 464–473. Translated from: Zap. Nauchn. Semin. POMI, vol. 368 (2009), 59–74.
- 6[B-C-G 1] S. G. Bobkov, G. P. Chistyakov, and F. Götze. Second-order concentration on the sphere. Commun. Contemp. Math. 19 (2017), no. 5, 1650058, 20 pp.
- 7[B-C-G 2] S. G. Bobkov, G. P. Chistyakov, and F. Götze. Gaussian mixtures and normal approximation for V. N. Sudakov’s typical distributions. Zap. Nauchn. Sem. S.-Peterburg. Otdel. Mat. Inst. Steklov. (POMI) 457 (2017), Veroyatnosti Statistika. 25, 37–52.
- 8[B-C-G 3] S. G. Bobkov, G. P. Chistyakov, and F. Götze. Berry-Esseen bounds for typical weighted sums. J. Electron. Probab. 23 (2018), no. 92, 1–22.
