On the convex infimum convolution inequality with optimal cost function
Marta Strzelecka, Micha{\l} Strzelecki, Tomasz Tkocz

TL;DR
This paper proves that symmetric random variables with log-concave tails satisfy an optimal convex infimum convolution inequality, leading to nearly optimal comparison of weak and strong moments for certain random vectors.
Contribution
It establishes the convex infimum convolution inequality with an optimal cost function for symmetric log-concave tail variables, advancing moment comparison theory.
Findings
Symmetric variables with log-concave tails satisfy the inequality.
Nearly optimal comparison of weak and strong moments achieved.
Results apply to symmetric random vectors with independent coordinates.
Abstract
We show that every symmetric random variable with log-concave tails satisfies the convex infimum convolution inequality with an optimal cost function (up to scaling). As a result, we obtain nearly optimal comparison of weak and strong moments for symmetric random vectors with independent coordinates with log-concave tails.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
On the convex infimum convolution inequality with optimal cost function
Marta Strzelecka
Institute of Mathematics, University of Warsaw, Banacha 2, 02–097 Warsaw, Poland.
,
Michał Strzelecki
Institute of Mathematics, University of Warsaw, Banacha 2, 02–097 Warsaw, Poland.
and
Tomasz Tkocz
Mathematics Department, Princeton University, Fine Hall, Princeton, NJ 08544-1000 USA.
(Date: February 23, 2017)
Abstract.
We show that every symmetric random variable with log-concave tails satisfies the convex infimum convolution inequality with an optimal cost function (up to scaling). As a result, we obtain nearly optimal comparison of weak and strong moments for symmetric random vectors with independent coordinates with log-concave tails.
Key words and phrases:
Infimum convolution, log-concave tails, convex functions, weak and strong moments
2010 Mathematics Subject Classification:
Primary: 60E15. Secondary: 26A51, 26B25.
Research partially supported by the National Science Centre, Poland, grants no. 2015/19/N/ST1/02661 (M. Strzelecka) and 2015/19/N/ST1/00891 (M. Strzelecki) as well as the Simons Foundation (T. Tkocz)
1. Introduction
Functional inequalities such as the Poincaré, log-Sobolev, or Marton-Talagrand inequality to name a few, play a crucial role in studying concentration of measure, an important cornerstone of the local theory of Banach spaces. In this paper we focus on another example of such inequalities, the infimum convolution inequality, introduced by Maurey in [11].
Let be a random vector with values in and let be a measurable function. We say that the pair satisfies the infimum convolution inequality (ICI for short) if for every bounded measurable function ,
[TABLE]
where denotes the infimum convolution of and defined as for . The function is called a cost function and is called a test function. We also say that the pair satisfies the convex infimum convolution inequality if (1.1) holds for every convex function bounded from below.
Maurey showed that Gaussian and exponential random variables satisfy the ICI with a quadratic and quadratic-linear cost function respectively. Thanks to the tensorisation property of the ICI, he recovered the Gaussian concentration inequality as well as the so-called Talagrand two-level concentration inequality for the exponential product measure. Moreover, Maurey proved that bounded random variables satisfy the convex ICI with a quadratic cost function (see also Lemma 3.2 in [14] for an improvement).
Later on, Maurey’s idea was developed further by Latała and Wojtaszczyk who studied comprehensively the ICI in [10]. By testing with linear functions, they observed that the optimal cost function is given by the Legendre transform of the cumulant-generating function (here optimal means largest possible, up to a scaling constant, because the larger the cost function is, the better (1.1) gets). They introduced the notion of optimal infimum convolution inequalities, established them for log-concave product measures and uniform measures on -balls, and put forward important, challenging and far-reaching conjectures (see also [6]).
The recent works [4] and [3] enable to view the ICI from a different perspective. In [4] the authors introduce weak transport-entropy inequalities and establish their dual formulations. The dual formulations are exactly the convex ICIs. In [3] the authors investigate extensively the weak transport cost inequalities on the real line, obtaining a characterisation for arbitrary cost functions which are convex and quadratic near zero, thus providing a tool for studying the convex ICI. Around the same time, the convex ICI for the quadratic-linear cost function was fully understood in [2].
In this paper we go along Latała and Wojtaszczyk’s line of research and study the optimal convex ICI. Using the aforementioned novel tools from [3], we show that product measures with symmetric marginals having log-concave tails satisfy the optimal convex ICI, which complements Latała and Wojtaszczyk’s result about log-concave product measures. This has applications to concentration and moment comparison of any norm of such vectors in the spirit of celebrated Paouris’ inequality (see [13] and [1]) and addresses some questions posed lately in [7]. We also offer an example showing that the assumption of log-concave tails cannot be weakened substantially.
2. Main results
For a random vector in we define
[TABLE]
which is the Legendre transform of the cumulant-generating function
[TABLE]
If is symmetric and the pair satisfies the ICI, then for every (see Remark 2.12 in [10]). In other words, is the optimal cost function for which the ICI can hold. Since this conclusion is obtained by testing (1.1) with linear functions, the same holds for the convex ICI. Following [10] we shall say that satisfies (convex) if the pair satisfies the (convex) ICI.
We are ready to present our first main result.
Theorem 2.1**.**
Let be a symmetric random variable with log-concave tails, i.e. such that the function
[TABLE]
is convex. Then there exists a universal constant such that satisfies convex .
The (convex) ICI tensorises and, consequently, the property (convex) IC tensorises: if independent random vectors satisfy (convex) , , then the vector satisfies (convex) (see [11] and [10]). Therefore we have the following corollary.
Corollary 2.2**.**
Let be a symmetric random vector with values in and independent coordinates with log-concave tails. Then satisfies convex with a universal constant .
Note that the class of distributions from Theorem 2.1 is wider than the class of symmetric log-concave product distributions considered by Latała and Wojtaszczyk in [10]. Among others, it contains measures which do not have a connected support, e.g. a symmetric Bernoulli random variable.
In order to comment on the relevance of the assumptions of Theorem 2.1 and present applications to comparison of weak and strong moments, we need the following definition. Let be a random vector with values in . We say that the moments of grow -regularly if for every and every we have
[TABLE]
where is the -th integral norm of a random variable . Clearly, if the moments of grow -regularly, then has to be at least (unless a.s.).
Remark 2.3*.*
If is a symmetric random variable with log-concave tails, then its moments grow -regularly (this classical fact follows for instance from Proposition 5.5 from [5] and the proof of Proposition 3.8 from [10]).
The assumption of log-concave tails in Theorem 2.1 cannot be replaced by a weaker one of -regularity of moments: if is a symmetric random variable defined by
[TABLE]
then the moments of grow -regularly (for some ), but there does not exists such that the pair satisfies the convex ICI. All the more, cannot satisfy convex with any (see Section 5 for details). Thus it seems that the assumptions of Theorem 2.1 are not far from necessary conditions for the convex ICI to hold with an optimal cost function (random variables with moments growing regularly are akin to random variables with log-concave tails as the former can essentially be sandwiched between the latter, see (4.6) in [9]).
Our second main result is an application of Theorem 2.1 to moment comparison. Recall that for a random vector its -th weak moment associated with a norm is the quantity defined as
[TABLE]
where is the dual norm of . The following version of [10, Proposition 3.15] holds (some non-trivial modifications of the proof are necessary in order to deal with the fact that the inequality (1.1) only holds for convex functions).
Theorem 2.4**.**
Let be a symmetric random vector with values in which moments grow -regularly. Suppose moreover that satisfies convex . Then for every norm on and every we have
[TABLE]
where is a universal constant (one can take ).
Immediately we obtain the following corollary in the spirit of the results from [13, 1, 7, 8]. Similar inequalities for Rademacher sums with the emphasis on exact values of constants have also been studied by Oleszkiewicz (see [12, Theorem 2.1]).
Corollary 2.5**.**
Let be a symmetric random vector with values in and with independent coordinates which have log-concave tails. Then for every norm on and every we have
[TABLE]
where is a universal constant (one can take ).
Note that each of the terms on the right-hand side of (2.2) is, up to a constant, dominated by the left-hand side of (2.2), so (2.2) yields the comparison of weak and strong moments of the norms of .
Note also that the constant standing at is equal to . If we only assume that the coordinates of are independent and their moments grow -regularly, then (2.2) does not always hold (the example here is a vector with independent coordinates distributed like in (2.1); see Section 5 for details), although by [7, Theorem 1.1] it holds if we allow the constant at to be greater than and to depend on . Hence Corollary 2.5 and example (2.1) partially answer the following question raised in [7]: “For which vectors does the comparison of weak and strong moments hold with constant at the first strong moment?”
The organization of the paper is the following. In Section 3 and 4 we present the proofs of Theorem 2.1 and Proposition 2.4 respectively. In Section 5 we discuss example (2.1) in details.
3. Proof of Theorem 2.1
Our approach is based on a characterization – provided by Gozlan, Roberto, Samson, Shu, and Tetali in [3] – of measures on the real line which satisfy a weak transport-entropy inequality. We emphasize that our optimal cost functions need not be quadratic near the origin, therefore we cannot apply their characterization as is, but have to first fine-tune the cost functions a bit. We shall also need the following simple lemma.
Lemma 3.1**.**
If is a symmetric random variable and , then
[TABLE]
Proof.
Since is symmetric, we have
[TABLE]
Moreover, \mathcal{L}\bigl{(}\ln\cosh(\cdot)\bigr{)}(|u|)\leq|u|^{2} for (see for example the proof of [10, Proposition 3.3]). Therefore
[TABLE]
Throughout the proof stands for the generalized inverse of a function defined as
[TABLE]
Proof of Theorem 2.1.
Note that and the function is non-decreasing. First we tweak the assumptions and change the assertion to a more straightforward one.
Step 1 (first reduction). We claim that it suffices to prove the assertion for random variables for which the function is strictly increasing on the set where it is finite (or, in other words, only for ). Indeed, suppose we have done this and let now be any random variable satisfying the assumptions of the theorem. Let be a symmetric random variable such that , where . If and are represented in the standard way by the inverses of their CDFs on the probability space , then a.s. (and also a.s. as ). Hence and therefore also .
The theorem applied to the random variable and the above inequality imply that the pair satisfies the convex ICI. Taking we get the assertion for (in the second integral we just use the fact that the test function is bounded from below and thus is bounded from above; for the first integral it suffices to prove the convergence of integrals on any interval , and on such an interval we have , and thus is a good majorant).
Step 2 (second reduction). We claim that it suffices to prove the assertion for random variables such that . Indeed, suppose we have done this and let be any random variable satisfying the assumptions of the theorem. Let and let be a symmetric random variable such that . Then, similarly as in Step 1., , where is symmetric and . Thus we can apply the proposition to and we continue as in Step 1.
Step 3 (scaling). Due to the scaling properties of the Legendre transform, we can assume that , where (the case where is trivial). Note that then, by Markov’s inequality, , so
[TABLE]
Step 4 (reformulation). For let
[TABLE]
We claim that there exists a universal constant , such that the pair satisfies the convex infimum convolution inequality. Of course the assertion follows immediately from that.
Note that is convex, increasing on (because is convex and symmetric and thus non-decreasing on ). Crucially, for (by Lemma 3.1), so the cost function is quadratic near zero. Moreover, by Lemma 3.1, .
Let , where , are the distribution functions of and the symmetric exponential measure on , respectively. By [3, Theorem 1.1] we know that if there exists such that for every we have
[TABLE]
then the pair , where , satisfies the convex ICI. We will show that (3.2) holds with .
Step 5 (further reformulation). Let . We have three possibilities (recall that is left-continuous):
- •
. Then is continuous, increasing, and transforms onto . Also, is increasing and therefore is the usual inverse of .
- •
and . Then has an atom at . Moreover, .
- •
and .
Of course, in the first case one can extend by putting , so that all formulas below make sense.
Note that
[TABLE]
where denotes the right-sided limit of at (which is different from only if and has an atom at ). Hence, is continuous on the interval , the image of under is the interval \big{(}\frac{1}{2}\exp(-N(a)),1-\frac{1}{2}\exp(-N(a))\big{)}, and we have and . Since the image of under is equal to the image of under , we conclude that if and if . Denote .
When , it suffices to check condition (3.2) for (otherwise one can change , and decrease the right-hand side while not changing the value of the left-hand side of (3.2)). For we can write and . When , is a bijection (on its image), so we can obviously write again for any .
Therefore, in order to verify (3.2) we need to check that
[TABLE]
Since we consider the case when is finite for every , the Chernoff inequality applies, so for we have
[TABLE]
so
[TABLE]
Note that for , since would imply , and hence , and – by (3.4) – also , but for we have when or when and in either case is finite. Therefore for every we have . Since for such that (because is then continuous and increasing on ), the condition (3.3) is implied by
[TABLE]
In the next step we check that this is indeed satisfied.
Step 6 (checking the condition). Let (if we simply do not have to consider Case 2 below). We consider three cases. We repeatedly use the fact that for , , which follows by the convexity of and the property .
Case 1. . Then \varphi\bigl{(}|x-y|\bigr{)}=(x-y)^{2}\leq 1, so (3.5) is trivially satisfied.
Case 2. . Then \varphi\bigl{(}|x-y|\bigr{)}=\Lambda_{X}^{*}(\frac{1}{2\beta_{1}}|x-y|)\leq\Lambda^{*}_{X}(|x-y|/2). Inequality (3.4) implies that in order to prove (3.5) it suffices to show that if , are of the same sign, say , then N\bigl{(}|x-y|/2)\leq|N(x)-N(y)| and if have different signs, we have N\bigl{(}\bigl{(}|x|+|y|\bigr{)}/2\bigr{)}\leq N(|x|)+N(|y|).
By the convexity of , for we have
[TABLE]
and
[TABLE]
This finishes the proof of (3.5) in Case 2.
Case 3. . Then \varphi\bigl{(}|x-y|\bigr{)}=2|x-y|-1. Consider two sub-cases:
- (i)
have different signs. Without loss of generality we may assume . Thus in order to obtain (3.5) it suffices to show that . Note that , so . Thus
[TABLE]
which finishes the proof in case (i).
- (ii)
have the same sign. Without loss of generality we may assume Thus it suffices to show that . Note that due to the assumption of Case 3 we have , so by the convexity of we have
[TABLE]
This ends the examination of case (ii) and the proof of the theorem. ∎
4. Comparison of weak and strong moments
The goal of this section is to establish the comparison of weak and strong moments with respect to any norm for random vectors with independent coordinates having log-concave tails (Corollary 2.5). In view of Theorem 2.1 and Remark 2.3, it is enough to show Theorem 2.4.
Our proof of Theorem 2.4 comprises three steps: first we exploit -regularity of moments of to control the size of its cumulant-generating function , second we bound the infimum convolution of the optimal cost function with the convex test function being the norm properly rescaled, and finally by the property convex we obtain exponential tail bounds which integrated out give the desired moment inequality.
We start with two lemmas corresponding to the first two steps described above and then we put everything together.
Lemma 4.1**.**
Let and suppose that the moments of a random vector in grow -regularly. If for a vector we have , then
[TABLE]
Proof.
Let be the smallest integer larger than . If , then by -regularity we have
[TABLE]
Replace with to get the assertion. ∎
Lemma 4.2**.**
Let be a norm on and let be a random vector with values in and moments growing -regularly. For , , and we have
[TABLE]
where .
Proof.
For with positive being arbitrary for now we bound the infimum convolution as follows
[TABLE]
where in the last inequality we have used Lemma 4.1. Choose with such that . Then clearly and thus
[TABLE]
If we now set , then by the triangle inequality we obtain the desired lower bound
[TABLE]
Proof of Theorem 2.4.
Let with as in Lemma 4.2. Testing the property convex with and applying Lemma 4.2 yields
[TABLE]
By Jensen’s inequality we obtain that both and are bounded above by . Thus Markov’s inequality implies the tail bound
[TABLE]
Consequently,
[TABLE]
Plugging in the value of gives the result (we can take ). ∎
5. An example
Let be a symmetric random variable defined by , where
[TABLE]
or, in other words, let have the distribution
[TABLE]
Let us first show that the moments of grow -regularly, but does not satisfy for any (we also prove a slightly stronger statement later).
Let be a symmetric exponential random variable. Then has log-concave tails, so the moments of grow -regularly (see Remark 2.3). Moreover, if and are constructed in the standard way by the inverses of their CDFs on the probability space , then
[TABLE]
Therefore, for ,
[TABLE]
(we used the fact that in the last inequality). Thus the moments of grow -regularly.
On the other hand, for every there exists such that
[TABLE]
Therefore by [2, Theorem 1] there does not exist a constant such that the pair , where , satisfies the convex infimum convolution inequality. But, by symmetry and the -regularity of moments of ,
[TABLE]
Thus for some we have for and . Hence
[TABLE]
We conclude that cannot satisfy for any .
Remark 5.1*.*
Let us also sketch an alternative approach. Take , , and denote , for . One can check that
[TABLE]
if . It is rather elementary but cumbersome to show that for any there exist and such that (1.1) is violated by the test function . We omit the details.
In fact, the above example shows that even a slightly stronger statement is true: for vectors with independent coordinates with -regular growth of moments the comparison of weak and strong moments of norms does not hold with the constant at the first strong moment. More precisely, let be independent random variables with distribution given by (5.1). We claim that there does not exist any such that
[TABLE]
holds for every and (note that we chose the -norm as our norm). We shall estimate the three expressions appearing in (5.2).
We have
[TABLE]
(this inequality is in fact an equality). Since the moments of grow -regularly, the last term in (5.2) is bounded by for some .
To estimate the remaining two terms we need the following standard fact.
Lemma 5.2**.**
For independent events ,
[TABLE]
In particular, for i.i.d. non-negative random variables ,
[TABLE]
Proof.
The upper bound is just the union bound. The lower bound follows from de Morgan’s laws combined with independence and the inequalities and for , . ∎
Fix and let . Then
[TABLE]
By the above lemma,
[TABLE]
Set . Then
[TABLE]
Similarly,
[TABLE]
Hence
[TABLE]
Putting (5.3), (5.4), and (5.5) together, we see that (5.2) would imply
[TABLE]
for every , , and of the form , . Take and to get
[TABLE]
Since and as this inequality yields , which is a contradiction. Hence inequality (5.2) cannot hold for all and .
Acknowledgments
We thank Radosław Adamczak and Rafał Latała for posing questions which led to the results presented in this note.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] R. Adamczak, R. Latała, A. E. Litvak, K. Oleszkiewicz, A. Pajor, and N. Tomczak-Jaegermann, A short proof of Paouris’ inequality , Canad. Math. Bull. 57 (2014), no. 1, 3–8. MR 3150710
- 2[2] N. Feldheim, A. Marsiglietti, P. Nayar, and J. Wang, A note on the convex infimum convolution inequality , to appear in Bernoulli, preprint (2015), ar Xiv:1505.00240 .
- 3[3] N. Gozlan, C. Roberto, P.M. Samson, Y. Shu, and P. Tetali, Characterization of a class of weak transport-entropy inequalities on the line , to appear in Ann. Inst. Henri Poincaré Probab. Stat., preprint (2015), ar Xiv:1509.04202 v 2 .
- 4[4] N. Gozlan, C. Roberto, P.M. Samson, and P. Tetali, Kantorovich duality for general transport costs and applications , to appear in J. Funct. Anal., preprint (2014), ar Xiv:1412.7480 v 4 .
- 5[5] O. Guédon, P. Nayar, and T. Tkocz, Concentration inequalities and geometry of convex bodies , Analytical and probabilistic methods in the geometry of convex bodies, IMPAN Lect. Notes, vol. 2, Polish Acad. Sci. Inst. Math., Warsaw, 2014, pp. 9–86. MR 3329056
- 6[6] R. Latała, On some problems concerning log-concave random vectors , to appear in IMA Volume “Discrete Structures: Analysis and Applications”, Springer.
- 7[7] R. Latała and M. Strzelecka, Comparison of weak and strong moments for vectors with independent coordinates , preprint (2016), ar Xiv:1612.02407 v 1 .
- 8[8] by same author, Weak and strong moments of ℓ r subscript ℓ 𝑟 \ell_{r} -norms of log-concave vectors , Proc. Amer. Math. Soc. 144 (2016), no. 8, 3597–3608. MR 3503729
