Two-sided estimates for order statistics of log-concave random vectors
Rafa{\l} Lata{\l}a, Marta Strzelecka

TL;DR
This paper derives precise two-sided bounds for the expectations of order statistics of log-concave random vectors, advancing understanding of their coordinate maxima and sums with universal constants.
Contribution
It provides the first sharp two-sided bounds for order statistics of log-concave vectors, applicable in unconditional and isotropic cases, with explicit bounds for sums of largest coordinates.
Findings
Exact bounds up to universal constants for order statistics in unconditional case.
Bounds valid for all k in unconditional case and for k ≤ n - c n^{5/6} in isotropic case.
Estimates for sums of the largest moduli of coordinates for specific classes of vectors.
Abstract
We establish two-sided bounds for expectations of order statistics (-th maxima) of moduli of coordinates of centered log-concave random vectors with uncorrelated coordinates. Our bounds are exact up to multiplicative universal constants in the unconditional case for all and in the isotropic case for . We also derive two-sided estimates for expectations of sums of largest moduli of coordinates for some classes of random vectors.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Two-sided estimates for order statistics
of log-concave random vectors
Rafał Latała and Marta Strzelecka
Institute of Mathematics, University of Warsaw, Banacha 2, 02–097 Warsaw, Poland.
[email protected], [email protected]
(Date: January 6, 2019)
Abstract.
We establish two-sided bounds for expectations of order statistics (-th maxima) of moduli of coordinates of centered log-concave random vectors with uncorrelated coordinates. Our bounds are exact up to multiplicative universal constants in the unconditional case for all and in the isotropic case for . We also derive two-sided estimates for expectations of sums of largest moduli of coordinates for some classes of random vectors.
The research of RL was supported by the National Science Centre, Poland grant 2015/18/A/ST1/00553 and of MS by the National Science Centre, Poland grant 2015/19/N/ST1/02661
1. Introduction and main results
For a vector let (or ) denote its -th maximum (respectively its * -th minimum*), i.e. its -th maximal (respectively -th minimal) coordinate. For a random vector , is also called the -th order statistic of .
Let be a random vector with finite first moment. In this note we try to estimate and
[TABLE]
Order statistics play an important role in various statistical applications and there is an extensive literature on this subject (cf. [2, 5] and references therein).
We put special emphasis on the case of log-concave vectors, i.e. random vectors satisfying the property for any and any nonempty compact sets and . By the result of Borell [3] a vector with full dimensional support is log-concave if and only if it has a log-concave density, i.e. the density of a form where is convex with values in . A typical example of a log-concave vector is a vector uniformly distributed over a convex body. In recent years the study of log-concave vectors attracted attention of many researchers, cf. monographs [1, 4].
To bound the sum of largest coordinates of we define
[TABLE]
and start with an easy upper bound.
Proposition 1**.**
For any random vector with finite first moment we have
[TABLE]
Proof.
For any we have
[TABLE]
It turns out that this bound may be reversed for vectors with independent coordinates or, more generally, vectors satisfying the following condition
[TABLE]
If this means that moduli of coordinates of are negatively correlated.
Theorem 2**.**
Suppose that a random vector satisfies condition (3) with some . Then there exists a constant which depends only on such that for any ,
[TABLE]
We may take .
In the case of i.i.d. coordinates two-sided bounds for in terms of an Orlicz norm (related to the distribution of ) of a vector where known before, see [7].
Log-concave vectors with diagonal covariance matrices behave in many aspects like vectors with independent coordinates. This is true also in our case.
Theorem 3**.**
Let be a log-concave random vector with uncorrelated coordinates (i.e. for ). Then for any ,
[TABLE]
In the above statement and in the sequel and denote positive universal constants.
The next two examples show that the lower bound cannot hold if and only marginal distributions of are log-concave or the coordinates of are highly correlated.
Example 1. Let , where are independent, and has the normal distribution. Then and it is not hard to check that and if .
Example 2. Let , where . Then, as in the previous example, and .
Question 1. Let be a decoupled version of , i.e. are independent and has the same distribution as . Due to Theorem 2 (applied to ), the assertion of Theorem 3 may be stated equivalently as
[TABLE]
Is the more general fact true that for any symmetric norm and any log-concave vector with uncorrelated coordinates
[TABLE]
Maybe such an estimate holds at least in the case of unconditional log-concave vectors?
We turn our attention to bounding -maxima of . This was investigated in [8] (under some strong assumptions on the function ) and in the weighted i.i.d. setting in [7, 9, 15]. We will give different bounds valid for log-concave vectors, in which we do not have to assume independence, nor any special conditions on the growth of the distribution function of the coordinates of . To this end we need to define another quantity:
[TABLE]
Theorem 4**.**
Let be a mean zero log-concave -dimensional random vector with uncorrelated coordinates and . Then
[TABLE]
Moreover, if is additionally unconditional then
[TABLE]
The next theorem provides an upper bound in the general log-concave case.
Theorem 5**.**
Let be a mean zero log-concave -dimensional random vector with uncorrelated coordinates and . Then
[TABLE]
and
[TABLE]
In the isotropic case (i.e. ) one may show that for and for (see Lemma 24 below). In particular for . This together with the two previous theorems implies the following corollary.
Corollary 6**.**
Let be an isotropic log-concave -dimensional random vector and . Then
[TABLE]
and
[TABLE]
If is additionally unconditional then
[TABLE]
Question 2. Does the second part of Theorem 4 hold without the unconditionality assumptions? In particular, is it true that for ?
Notation. Throughout this paper by letters we denote universal positive constants and by constants depending only on the parameter . The values of constants may differ at each occurrence. If we need to fix a value of constant, we use letters or . We write if . For a random variable we denote . Recall that a random vector is called isotropic, if and .
This note is organised as follows. In Section 2 we provide a lower bound for the sum of largest coordinates, which involves the Poincaré constant of a vector. In Section 3 we use this result to obtain Theorem 3. In Section 4 we prove Theorem 2 and provide its application to comparison of weak and strong moments. In Section 5 we prove the first part of Theorem 4 and in Section 6 we prove the second part of Theorem 4, Theorem 5, and Lemma 24.
2. Exponential concentration
A probability measure on satisfies exponential concentration with constant if for any Borel set with ,
[TABLE]
We say that a random -dimensional vector satisfies exponential concentration if its distribution has such a property.
It is well known that exponential concentration is implied by the Poincaré inequality
[TABLE]
and (cf. [12, Corollary 3.2]).
Obviously, the constant in the exponential concentration is not linearly invariant. Typically one assumes that the vector is isotropic. For our purposes a more natural normalization will be that all coordinates have -norm equal to .
The next proposition states that bound (2) may be reversed under the assumption that satisfies the exponential concentration.
Proposition 7**.**
Assume that satisfies the exponential concentration with constant and for all . Then for any sequence of real numbers and we have
[TABLE]
where is given by (1).
We begin the proof with a few simple observations.
Lemma 8**.**
For any real numbers and we have
[TABLE]
Proof.
Without loss of generality we may assume that . Then
[TABLE]
Fix a sequence and define for ,
[TABLE]
Corollary 9**.**
For any ,
[TABLE]
and for any ,
[TABLE]
In particular
[TABLE]
Proof.
We have
[TABLE]
where the last equality follows by Lemma 8.
Moreover,
[TABLE]
The last part of the assertion easily follows, since
[TABLE]
Proof of Proposition 7.
To shorten the notation put . Without loss of generality we may assume that and . Observe first that
[TABLE]
so we may assume that .
Let be the law of and
[TABLE]
We have
[TABLE]
so we may assume that .
Observe that if and for some then
[TABLE]
Thus we have
[TABLE]
Therefore
[TABLE]
and
[TABLE]
where to get the next-to-last inequality we used the fact that .
Hence Corollary 9 and the definition of yields
[TABLE]
so . ∎
We finish this section with a simple fact that will be used in the sequel.
Lemma 10**.**
Suppose that a measure satisfies exponential concentration with constant . Then for any and any Borel set with we have
[TABLE]
Proof.
Let . Observe that has an empty intersection with so if then
[TABLE]
and . Hence , therefore for ,
[TABLE]
and the assertion easily follows. ∎
3. Sums of largest coordinates of log-concave vectors
We will usethe regular growth of moments of norms of log-concave vectors multiple times. By [4, Theorem 2.4.6], if is a seminorm, and is log-concave, then
[TABLE]
where is a universal constant.
We will also apply a few times the functional version of the Grünbaum inequality (see [14, Lemma 5.4]) which states that
[TABLE]
Let us start with a few technical lemmas. The first one will be used to reduce the proof of Theorem 3 to the symmetric case.
Lemma 11**.**
Let be a log-concave -dimensional vector and be an independent copy of . Then for any ,
[TABLE]
and
[TABLE]
Proof.
The first estimate follows by the easy bound
[TABLE]
To get the second bound we may and will assume that . Let us define , and . Obviously
[TABLE]
We have , thus by (8). Hence
[TABLE]
for . In the same way we show that
[TABLE]
Therefore
[TABLE]
We have
[TABLE]
Together with (10) we get
[TABLE]
and (9) easily follows. ∎
Lemma 12**.**
Suppose that is a real symmetric log-concave random variable. Then for any and ,
[TABLE]
Moreover, if , then
Proof.
Without loss of generality we may assume that (otherwise the first estimate is trivial).
Observe that where is convex and . In particular
[TABLE]
and
[TABLE]
We have
[TABLE]
This implies the second part of the lemma.
To conclude the proof of the first bound it is enough to observe that
[TABLE]
Proof of Theorem 3.
By Proposition 1 it is enough to show the lower bound. By Lemma 11 we may assume that is symmetric. We may also obviously assume that for all .
Let , where . Then is log-concave, isotropic and, by (7), for all . Set . Then and . Moreover, by the result of Lee and Vempala [13], we know that any -dimensional projection of is a log-concave, isotropic -dimensional vector thus it satisfies the exponential concentration with a constants . (In fact an easy modification of the proof below shows that for our purposes it would be enough to have exponential concentration with a constant for some , so one may also use Eldan’s result [6] which gives such estimates for any ). So any -dimensional projection of satisfies exponential concentration with constant .
Let us fix and set , then (since has no atoms)
[TABLE]
For define
[TABLE]
where . By (11) there exists such that
[TABLE]
Let us consider three cases.
(i) and . Then
[TABLE]
(ii) and . Choose of cardinality . Then
[TABLE]
(iii) . By Lemma 12 (applied with ) we have
[TABLE]
Moreover for , , so the second part of Lemma 12 yields
[TABLE]
and .
Set . If then, using (12), we estimate
[TABLE]
Otherwise set and . By (11) we have
[TABLE]
so and satisfies exponential concentration with constant . Estimate (12) yields
[TABLE]
so . Moreover, by Proposition 7 we have (since )
[TABLE]
To conclude observe that
[TABLE]
and since ,
[TABLE]
4. Vectors satisfying condition (3)
Proof of Theorem 2.
By Proposition 1 we need to show only the lower bound. Assume first that variables have no atoms and .
Let . Then . Note, that (3) implies that for all we have
[TABLE]
We may assume that , because otherwise the lower bound holds trivially.
Let us define
[TABLE]
Since
[TABLE]
it suffices to bound below the probability that by a constant depending only on .
We have
[TABLE]
Therefore and for any we have
[TABLE]
By Corollary 9 we have (recall definition(6))
[TABLE]
Assumption (3) implies that
[TABLE]
Moreover for we have
[TABLE]
so
[TABLE]
Thus
[TABLE]
and
[TABLE]
This together with (15) and the assumption that implies
[TABLE]
and
[TABLE]
Therefore
[TABLE]
This applied to (14) with gives us and in consequence
[TABLE]
Since is non-decreasing, in the case we have
[TABLE]
The last step is to loose the assumption that has no atoms. Note that both assumption (3) and the lower bound depend only on , so we may assume that are nonnegative almost surely. Consider , where are i.i.d. nonnegative r.v’s with and a density , independent of . Then for every we have (observe that (3) holds also for or ).
[TABLE]
Thus satisfies assumption (3) and has the density function for every . Therefore for all natural we have
[TABLE]
Clearly, as , so the lower bound holds in the case of arbitrary satisfying (3). ∎
We may use Theorem 2 to obtain a comparison of weak and strong moments for the supremum norm:
Corollary 13**.**
Let be an -dimensional centered random vector satisfying condition (3). Assume that
[TABLE]
Then the following comparison of weak and strong moments for the supremum norm holds: for all and all ,
[TABLE]
where is a constant depending only on and .
Proof.
Let be a decoupled version of . For any a random vector satisfies condition (3), so by Theorem 2
[TABLE]
for all , up to a constant depending only on . The coordinates of are independent and satisfy condition (16), so due to [11, Theorem 1.1] the comparison of weak and strong moments of holds, i.e. for ,
[TABLE]
where depends only on . These two observations yield the assertion. ∎
5. Lower estimates for order statistics
The next lemma shows the relation between and for log-concave vectors .
Lemma 14**.**
Let be a symmetric log-concave random vector in . For any we have
[TABLE]
Proof.
Let and . We may assume that any is not identically equal to [math]. Then and .
Obviously . Also for any we have
[TABLE]
To prove the upper bound set
[TABLE]
We have
[TABLE]
so . Hence
[TABLE]
Moreover by the second part of Lemma 12 we get
[TABLE]
so
[TABLE]
Hence if then
[TABLE]
that is . ∎
To derive bounds for order statistics we will also need a few facts about log-concave vectors.
Lemma 15**.**
Assume that is an isotropic one- or two-dimensional log-concave random vector with a density . Then for all . If is one-dimensional, then also for all , where is an absolute constant.
Proof.
We will use a classical result (see [4, Theorem 2.2.2, Proposition 3.3.1 and Proposition 2.5.9]): (note that here we use the assumption that is isotropic, in particular that , and that the dimension of is or ). This implies the upper bound on .
In order to get the lower bound in the one-dimensional case, it suffices to prove that for , where is fixed and its value will be chosen later (then by the log-concavity we get for all ). Since is again isotropic we may assume that .
If , then we are done. Otherwise by log-concavity of we get
[TABLE]
On the other hand, has mean zero, so and by the Paley–Zygmund inequality and (7) we have
[TABLE]
For we get a contradiction. ∎
Lemma 16**.**
Let be a mean zero log-concave random variable and let for some . Then
[TABLE]
Proof.
By the Grünbaum inequality (8) we have , hence
[TABLE]
Since satisfies the same assumptions as we also have
[TABLE]
Lemma 17**.**
Let be a mean zero log-concave random variable and let for some . Then there exists a universal constant such that
[TABLE]
Proof.
Without loss of generality we may assume that . Then by Chebyshev’s inequality . Let be the density of . By Lemma 15 we know that and on , where and are universal constants. Thus
[TABLE]
and
[TABLE]
Now we are ready to give a proof of the lower bound in Theorem 4. The next proposition is a key part of it.
Proposition 18**.**
Let be a mean zero log-concave -dimensional random vector with uncorrelated coordinates and let . Suppose that
[TABLE]
Then
[TABLE]
Proof.
Let , and . We will choose in such a way that is large, in particular we may assume that . Observe also that , thus if . Hence
[TABLE]
Lemma 16 and the definition of yield
[TABLE]
This yields and by Theorem 3 we have
[TABLE]
Since for any norm for (see [10, Corollary 1]) we have
[TABLE]
By the Paley-Zygmund inequality and (7), if , so . Moreover it is easy to verify that for , thus . Hence Proposition 1 and Lemma 14 yield
[TABLE]
and therefore
[TABLE]
[TABLE]
so it is enough to choose in such a way that . ∎
Proof of the first part of Theorem 4.
Let and be as in Proposition 18. It is enough to consider the case when , then for all and . Define
[TABLE]
[TABLE]
If then , , and the assertion immediately follows by Proposition 18 since .
Otherwise define
[TABLE]
We have by Lemma 17 applied with
[TABLE]
Thus
[TABLE]
Therefore
[TABLE]
If then and the assertion easily follows. Otherwise Proposition 18 yields
[TABLE]
Observe that for we have , so
[TABLE]
Remark 19**.**
A modification of the proof above shows that under the assumptions of Theorem 4 for any there exists such that
[TABLE]
6. Upper estimates for order statistics
We will need a few more facts concerning log-concave vectors.
Lemma 20**.**
Suppose that is a mean zero log-concave random vector with uncorrelated coordinates. Then for any and ,
[TABLE]
Proof.
Let and be the constants from Lemma 15. If then, by Lemma 15, and the assertion is obvious (with any ). Thus we will assume that .
Let and let be the density of . By Lemma 15 we know that , so
[TABLE]
On the other hand the second part of Lemma 15 yields
[TABLE]
Lemma 21**.**
Let be a log-concave random variable. Then
[TABLE]
Proof.
We may assume that is non-degenerate (otherwise the statement is obvious), in particular has no atoms. Log-concavity of yields
[TABLE]
Hence
[TABLE]
Since satisfies the same assumptions as , we also have
[TABLE]
Adding both estimates we get
[TABLE]
Lemma 22**.**
Suppose that is a log-concave random variable and . Then .
Proof.
Let then by Lemma 21
[TABLE]
Let us now prove (4) and see how it implies the second part of Theorem 4. Then we give a proof of (5).
Proof of (4).
Fix and set . Then . Define
[TABLE]
Observe that for and we have by Lemma 21
[TABLE]
Consider two cases.
Case 1. . Then , so and
[TABLE]
Therefore by (22)
[TABLE]
Case 2. . Observe that for any disjoint sets , and integers such that , we have
[TABLE]
Since
[TABLE]
we have and, by (23),
[TABLE]
Estimate (22) yields
[TABLE]
To estimate observe that by Lemma 22, the definition of and assumptions on ,
[TABLE]
Set and
[TABLE]
Note that we know already that . Thus the Paley-Zygmund inequality implies
[TABLE]
However Lemma 20 yields
[TABLE]
Therefore
[TABLE]
for sufficiently large . ∎
The unconditionality assumption plays a crucial role in the proof of the next lemma, which allows to derive the second part of Theorem 4 from estimate (4).
Lemma 23**.**
Let be an unconditional log-concave -dimensional random vector. Then for any ,
[TABLE]
Proof.
Let be the law of . Then is log-concave on . Define for ,
[TABLE]
It is easy to check that , hence
[TABLE]
Proof of the second part of Theorem 4.
Estimate (4) together with Lemma 23 yields
[TABLE]
and the assertion follows by integration by parts. ∎
Proof of (5).
Define , , and by (20) and (21), where this time . Estimate (22) is still valid so integration by parts yields
[TABLE]
Set
[TABLE]
Observe that
[TABLE]
Hence .
If , then , so
[TABLE]
Therefore it suffices to consider case only.
Since and , we have by (23),
[TABLE]
Since and is increasing for we have
[TABLE]
Therefore, considering instead of and instead of it is enough to show the following claim:
Let , and let be an -dimensional log-concave vector. Suppose that
[TABLE]
then
[TABLE]
We will show the claim by induction on . For the statement is obvious (since the assumptions are contradictory). Suppose now that and the assertion holds for .
Case 1. for some . Then
[TABLE]
where to get the last inequality we used that is concave on , so for . Therefore by the induction assumption applied to ,
[TABLE]
Case 2. for all . Applying Lemma 15 we get
[TABLE]
so . Moreover . Therefore by the result of Lee and Vempala [13] satisfies the exponential concentration with .
Let then and . Let
[TABLE]
By (4) (applied with instead of ) we have . Observe that
[TABLE]
Therefore by Lemma 10 we get
[TABLE]
Integration by parts yields
[TABLE]
and the induction step is shown in this case provided that . ∎
To obtain Corollary 6 we used the following lemma.
Lemma 24**.**
Assume that is a symmetric isotropic log-concave vector in . Then
[TABLE]
and
[TABLE]
Proof.
Observe that
[TABLE]
Thus Lemma 15 implies that for (with ) we have . Moreover, by the Markov inequality
[TABLE]
so . Since is non-increasing, we know that for .
Now we will prove (25). We have
[TABLE]
so it suffices to show that . To this end we fix . By (24) we know that , so the isotropicity of and Markov’s inequality yield for all . We may also assume that . Integration by parts and Lemma 21 yield
[TABLE]
Therefore
[TABLE]
so . ∎
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] S. Artstein-Avidan, A. Giannopoulos, and V. D. Milman, Asymptotic geometric analysis. Part I , Mathematical Surveys and Monographs, 202 , American Mathematical Society, Providence, RI, 2015.
- 2[2] N. Balakrishnan, and A. C. Cohen, Order Statistics and Inference, Academic Press, New York, 1991.
- 3[3] C. Borell, Convex measures on locally convex spaces , Ark. Math. 12 (1974), 239–252.
- 4[4] S. Brazitikos, A. Giannopoulos, P. Valettas, and B. H. Vritsiou, Geometry of isotropic convex bodies , Mathematical Surveys and Monographs 196 , American Mathematical Society, Providence, RI, 2014.
- 5[5] H. A. David, and H. N. Nagaraja, Order Statistics , 3rd ed. Wiley-Interscience, Hoboken, NJ, 2003.
- 6[6] R. Eldan, Thin shell implies spectral gap up to polylog via a stochastic localization scheme , Geom. Funct. Anal. 23 (2013), 532–569.
- 7[7] Y. Gordon, A. Litvak, C. Schütt, and E. Werner, Orlicz norms of sequences of random variables , Ann. Probab. 30 (2002), no. 4, 1833–1853.
- 8[8] Y. Gordon, A. Litvak, C. Schütt, and E. Werner, On the minimum of several random variables , Proc. Amer. Math. Soc. 134 (2006), no. 12, 3665–3675.
