On the convex infimum convolution inequality with optimal cost function

Marta Strzelecka; Micha{\l} Strzelecki; Tomasz Tkocz

arXiv:1702.07321·math.PR·May 18, 2021

On the convex infimum convolution inequality with optimal cost function

Marta Strzelecka, Micha{\l} Strzelecki, Tomasz Tkocz

PDF

TL;DR

This paper proves that symmetric random variables with log-concave tails satisfy an optimal convex infimum convolution inequality, leading to nearly optimal comparison of weak and strong moments for certain random vectors.

Contribution

It establishes the convex infimum convolution inequality with an optimal cost function for symmetric log-concave tail variables, advancing moment comparison theory.

Findings

01

Symmetric variables with log-concave tails satisfy the inequality.

02

Nearly optimal comparison of weak and strong moments achieved.

03

Results apply to symmetric random vectors with independent coordinates.

Abstract

We show that every symmetric random variable with log-concave tails satisfies the convex infimum convolution inequality with an optimal cost function (up to scaling). As a result, we obtain nearly optimal comparison of weak and strong moments for symmetric random vectors with independent coordinates with log-concave tails.

Equations119

E e^{f □ φ (X)} E e^{- f (X)} \leq 1,

E e^{f □ φ (X)} E e^{- f (X)} \leq 1,

Λ_{X}^{*} (x) := L Λ_{X} (x) := y \in R^{n} sup {⟨ x, y ⟩ - ln E e^{⟨ y, X ⟩}},

Λ_{X}^{*} (x) := L Λ_{X} (x) := y \in R^{n} sup {⟨ x, y ⟩ - ln E e^{⟨ y, X ⟩}},

Λ_{X} (x) := ln E e^{⟨ x, X ⟩}, x \in R^{n} .

Λ_{X} (x) := ln E e^{⟨ x, X ⟩}, x \in R^{n} .

t \mapsto N (t) := - ln P (∣ X ∣ \geq t), t \geq 0,

t \mapsto N (t) := - ln P (∣ X ∣ \geq t), t \geq 0,

∥ ⟨ X, θ ⟩ ∥_{p} \leq α \frac{p}{q} ∥ ⟨ X, θ ⟩ ∥_{q},

∥ ⟨ X, θ ⟩ ∥_{p} \leq α \frac{p}{q} ∥ ⟨ X, θ ⟩ ∥_{q},

P (∣ X ∣ > t) = 1_{[0, 2)} (t) + k = 1 \sum \infty e^{- 2^{k}} 1_{[2^{k}, 2^{k + 1})} (t), t \geq 0,

P (∣ X ∣ > t) = 1_{[0, 2)} (t) + k = 1 \sum \infty e^{- 2^{k}} 1_{[2^{k}, 2^{k + 1})} (t), t \geq 0,

σ_{∥ \cdot ∥, X} (p) := ∥ t ∥_{*} \leq 1 sup ∥ ⟨ t, X ⟩ ∥_{p},

σ_{∥ \cdot ∥, X} (p) := ∥ t ∥_{*} \leq 1 sup ∥ ⟨ t, X ⟩ ∥_{p},

\Bigl{(}\operatorname*{\mathbb{E}}\bigl{|}\|X\|-E\|X\|\bigr{|}^{p}\Bigr{)}^{1/p}\leq C\alpha\beta\sigma_{\|\cdot\|,X}(p),

\Bigl{(}\operatorname*{\mathbb{E}}\bigl{|}\|X\|-E\|X\|\bigr{|}^{p}\Bigr{)}^{1/p}\leq C\alpha\beta\sigma_{\|\cdot\|,X}(p),

\bigl{(}\operatorname*{\mathbb{E}}\|X\|^{p}\bigr{)}^{1/p}\leq\operatorname*{\mathbb{E}}\|X\|+D\sigma_{\|\cdot\|,X}(p),

\bigl{(}\operatorname*{\mathbb{E}}\|X\|^{p}\bigr{)}^{1/p}\leq\operatorname*{\mathbb{E}}\|X\|+D\sigma_{\|\cdot\|,X}(p),

Λ_{X}^{*} (x / β_{1}) \leq x^{2} for ∣ x ∣ \leq 1.

Λ_{X}^{*} (x / β_{1}) \leq x^{2} for ∣ x ∣ \leq 1.

E e^{tX} = 1 + k = 1 \sum \infty \frac{∥ X ∥ _{2 k}^{2 k} t ^{2 k}}{( 2 k )!} \geq 1 + k = 1 \sum \infty \frac{∥ X ∥ _{2}^{2 k} t ^{2 k}}{( 2 k )!} = 1 + k = 1 \sum \infty \frac{β _{1}^{- 2 k} t ^{2 k}}{( 2 k )!} = cosh (β_{1}^{- 1} ∣ t ∣) .

E e^{tX} = 1 + k = 1 \sum \infty \frac{∥ X ∥ _{2 k}^{2 k} t ^{2 k}}{( 2 k )!} \geq 1 + k = 1 \sum \infty \frac{∥ X ∥ _{2}^{2 k} t ^{2 k}}{( 2 k )!} = 1 + k = 1 \sum \infty \frac{β _{1}^{- 2 k} t ^{2 k}}{( 2 k )!} = cosh (β_{1}^{- 1} ∣ t ∣) .

Λ_{X}^{*} (x / β_{1}) = L (Λ_{X} (β_{1} \cdot)) (x) \leq L (ln cosh (\cdot)) (x) \leq x^{2} \mbox f or ∣ x ∣ \leq 1. \qed

Λ_{X}^{*} (x / β_{1}) = L (Λ_{X} (β_{1} \cdot)) (x) \leq L (ln cosh (\cdot)) (x) \leq x^{2} \mbox f or ∣ x ∣ \leq 1. \qed

g^{- 1} (y) := in f {x : g (x) \geq y} .

g^{- 1} (y) := in f {x : g (x) \geq y} .

N (1/2) \geq 2.

N (1/2) \geq 2.

\varphi(x):=\bigl{(}x^{2}1_{{\{|x|<1\}}}+(2|x|-1)1_{{\{|x|\geq 1\}}}\bigr{)}\lor\Lambda_{X}^{*}(x/(2{\beta_{1}})).

\varphi(x):=\bigl{(}x^{2}1_{{\{|x|<1\}}}+(2|x|-1)1_{{\{|x|\geq 1\}}}\bigr{)}\lor\Lambda_{X}^{*}(x/(2{\beta_{1}})).

\big{|}U(x)-U(y)\big{|}\leq\frac{1}{b}\varphi^{-1}\bigl{(}1+|x-y|\bigr{)},

\big{|}U(x)-U(y)\big{|}\leq\frac{1}{b}\varphi^{-1}\bigl{(}1+|x-y|\bigr{)},

F (t) = {\frac{1}{2} exp (- N (∣ t ∣)) 1 - \frac{1}{2} exp (- N_{+} (t)) if t < 0, if t \geq 0,

F (t) = {\frac{1}{2} exp (- N (∣ t ∣)) 1 - \frac{1}{2} exp (- N_{+} (t)) if t < 0, if t \geq 0,

|x-y|\leq\varphi^{-1}\bigl{(}1+\big{|}N(|x|)\operatorname*{sgn}(x)-N(|y|)\operatorname*{sgn}(y)\big{|}\bigr{)}\qquad\text{for }x,y\in A.

|x-y|\leq\varphi^{-1}\bigl{(}1+\big{|}N(|x|)\operatorname*{sgn}(x)-N(|y|)\operatorname*{sgn}(y)\big{|}\bigr{)}\qquad\text{for }x,y\in A.

\frac{1}{2} e^{- N (t)} = P (X \geq t) \leq e^{- Λ_{X}^{*} (t)},

\frac{1}{2} e^{- N (t)} = P (X \geq t) \leq e^{- Λ_{X}^{*} (t)},

N (t) \geq Λ_{X}^{*} (t) - ln 2.

N (t) \geq Λ_{X}^{*} (t) - ln 2.

\varphi\bigl{(}|x-y|\bigr{)}\leq 1+\big{|}N(|x|)\operatorname*{sgn}x-N(|y|)\operatorname*{sgn}y\big{|}\qquad\text{for }x,y\in A.

\varphi\bigl{(}|x-y|\bigr{)}\leq 1+\big{|}N(|x|)\operatorname*{sgn}x-N(|y|)\operatorname*{sgn}y\big{|}\qquad\text{for }x,y\in A.

N\bigl{(}(s+t)/2\bigr{)}\leq\frac{1}{2}N(s)+\frac{1}{2}N(t)\leq N(s)+N(t)

N\bigl{(}(s+t)/2\bigr{)}\leq\frac{1}{2}N(s)+\frac{1}{2}N(t)\leq N(s)+N(t)

N (s /2) + N (t) \leq N (s) + N (t) \leq \frac{s}{s + t} N (s + t) + \frac{t}{s + t} N (s + t) = N (s + t) .

N (s /2) + N (t) \leq N (s) + N (t) \leq \frac{s}{s + t} N (s + t) + \frac{t}{s + t} N (s + t) = N (s + t) .

N (x) \geq N (1/2) 2 x \geq^{\eqref N (1/2)} 4 x \geq 2 x + 2∣ y ∣,

N (x) \geq N (1/2) 2 x \geq^{\eqref N (1/2)} 4 x \geq 2 x + 2∣ y ∣,

\frac{N ( x ) - N ( y )}{x - y} \geq \frac{N ( \frac{1}{2} ) - N ( 0 )}{\frac{1}{2} - 0} \geq^{\eqref N (1/2)} 4 \geq 2

\frac{N ( x ) - N ( y )}{x - y} \geq \frac{N ( \frac{1}{2} ) - N ( 0 )}{\frac{1}{2} - 0} \geq^{\eqref N (1/2)} 4 \geq 2

Λ_{X} ((2 e α)^{- 1} p u) \leq p .

Λ_{X} ((2 e α)^{- 1} p u) \leq p .

Λ_{X} (p u)

Λ_{X} (p u)

\displaystyle\leq\ln\Bigl{(}\sum_{0\leq k\leq p}\frac{p^{k}\|\langle u,X\rangle\|_{p}^{k}}{k!}+\sum_{k>p}\bigl{(}\alpha e\|\langle u,X\rangle\|_{p}\bigr{)}^{k}\Bigr{)}

\displaystyle\leq\ln\Bigl{(}\sum_{0\leq k\leq p}\frac{p^{k}\|\langle u,X\rangle\|_{p}^{k}}{k!}+2(\alpha e\|\langle u,X\rangle\|_{p})^{k_{0}}\Bigr{)}

\displaystyle\mathop{\leq}\ln\Bigl{(}\sum_{0\leq k\leq p}\frac{p^{k}\|\langle u,X\rangle\|_{p}^{k}}{k!}+\frac{(2\alpha ep\|\langle u,X\rangle\|_{p})^{k_{0}}}{k_{0}!}\Bigr{)}

\displaystyle\leq\ln\Bigl{(}\sum_{0\leq k\leq k_{0}}\frac{(2\alpha ep\|\langle u,X\rangle\|_{p})^{k}}{k!}\Bigr{)}\leq 2\alpha ep\|\langle u,X\rangle\|_{p}\leq p.

\big{(}\Lambda_{X}^{*}\left(\cdot/\beta\right)\square a\|\cdot\|\big{)}(x)\geq a\|x\|-p,

\big{(}\Lambda_{X}^{*}\left(\cdot/\beta\right)\square a\|\cdot\|\big{)}(x)\geq a\|x\|-p,

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

On the convex infimum convolution inequality with optimal cost function

Marta Strzelecka

Institute of Mathematics, University of Warsaw, Banacha 2, 02–097 Warsaw, Poland.

[email protected]

,

Michał Strzelecki

Institute of Mathematics, University of Warsaw, Banacha 2, 02–097 Warsaw, Poland.

[email protected]

and

Tomasz Tkocz

Mathematics Department, Princeton University, Fine Hall, Princeton, NJ 08544-1000 USA.

[email protected]

(Date: February 23, 2017)

Abstract.

We show that every symmetric random variable with log-concave tails satisfies the convex infimum convolution inequality with an optimal cost function (up to scaling). As a result, we obtain nearly optimal comparison of weak and strong moments for symmetric random vectors with independent coordinates with log-concave tails.

Key words and phrases:

Infimum convolution, log-concave tails, convex functions, weak and strong moments

2010 Mathematics Subject Classification:

Primary: 60E15. Secondary: 26A51, 26B25.

Research partially supported by the National Science Centre, Poland, grants no. 2015/19/N/ST1/02661 (M. Strzelecka) and 2015/19/N/ST1/00891 (M. Strzelecki) as well as the Simons Foundation (T. Tkocz)

1. Introduction

Functional inequalities such as the Poincaré, log-Sobolev, or Marton-Talagrand inequality to name a few, play a crucial role in studying concentration of measure, an important cornerstone of the local theory of Banach spaces. In this paper we focus on another example of such inequalities, the infimum convolution inequality, introduced by Maurey in [11].

Let $X$ be a random vector with values in $\mathbb{R}^{n}$ and let $\varphi:\mathbb{R}^{n}\to[0,\infty]$ be a measurable function. We say that the pair $(X,\varphi)$ satisfies the infimum convolution inequality (ICI for short) if for every bounded measurable function $f:\mathbb{R}^{n}\to\mathbb{R}$ ,

[TABLE]

where $f\square\varphi$ denotes the infimum convolution of $f$ and $\varphi$ defined as $f\square\varphi(x)=\inf\{f(y)+\varphi(x-y):y\in\mathbb{R}^{n}\}$ for $x\in\mathbb{R}^{n}$ . The function $\varphi$ is called a cost function and $f$ is called a test function. We also say that the pair $(X,\varphi)$ satisfies the convex infimum convolution inequality if (1.1) holds for every convex function $f:\mathbb{R}^{n}\to\mathbb{R}$ bounded from below.

Maurey showed that Gaussian and exponential random variables satisfy the ICI with a quadratic and quadratic-linear cost function respectively. Thanks to the tensorisation property of the ICI, he recovered the Gaussian concentration inequality as well as the so-called Talagrand two-level concentration inequality for the exponential product measure. Moreover, Maurey proved that bounded random variables satisfy the convex ICI with a quadratic cost function (see also Lemma 3.2 in [14] for an improvement).

Later on, Maurey’s idea was developed further by Latała and Wojtaszczyk who studied comprehensively the ICI in [10]. By testing with linear functions, they observed that the optimal cost function is given by the Legendre transform of the cumulant-generating function (here optimal means largest possible, up to a scaling constant, because the larger the cost function is, the better (1.1) gets). They introduced the notion of optimal infimum convolution inequalities, established them for log-concave product measures and uniform measures on $\ell_{p}$ -balls, and put forward important, challenging and far-reaching conjectures (see also [6]).

The recent works [4] and [3] enable to view the ICI from a different perspective. In [4] the authors introduce weak transport-entropy inequalities and establish their dual formulations. The dual formulations are exactly the convex ICIs. In [3] the authors investigate extensively the weak transport cost inequalities on the real line, obtaining a characterisation for arbitrary cost functions which are convex and quadratic near zero, thus providing a tool for studying the convex ICI. Around the same time, the convex ICI for the quadratic-linear cost function was fully understood in [2].

In this paper we go along Latała and Wojtaszczyk’s line of research and study the optimal convex ICI. Using the aforementioned novel tools from [3], we show that product measures with symmetric marginals having log-concave tails satisfy the optimal convex ICI, which complements Latała and Wojtaszczyk’s result about log-concave product measures. This has applications to concentration and moment comparison of any norm of such vectors in the spirit of celebrated Paouris’ inequality (see [13] and [1]) and addresses some questions posed lately in [7]. We also offer an example showing that the assumption of log-concave tails cannot be weakened substantially.

2. Main results

For a random vector $X$ in $\mathbb{R}^{n}$ we define

[TABLE]

which is the Legendre transform of the cumulant-generating function

[TABLE]

If $X$ is symmetric and the pair $(X,\varphi)$ satisfies the ICI, then $\varphi(x)\leq\Lambda^{*}_{X}(x)$ for every $x\in\mathbb{R}^{n}$ (see Remark 2.12 in [10]). In other words, $\Lambda^{*}_{X}$ is the optimal cost function $\varphi$ for which the ICI can hold. Since this conclusion is obtained by testing (1.1) with linear functions, the same holds for the convex ICI. Following [10] we shall say that $X$ satisfies (convex) $\text{IC}(\beta)$ if the pair $(X,\Lambda_{X}^{*}(\cdot/\beta))$ satisfies the (convex) ICI.

We are ready to present our first main result.

Theorem 2.1.

Let $X$ be a symmetric random variable with log-concave tails, i.e. such that the function

[TABLE]

is convex. Then there exists a universal constant $\beta\leq 1680e$ such that $X$ satisfies convex $\text{IC}(\beta)$ .

The (convex) ICI tensorises and, consequently, the property (convex) IC tensorises: if independent random vectors $X_{i}$ satisfy (convex) $\textrm{IC}(\beta_{i})$ , $i=1,\ldots,n$ , then the vector $(X_{1},\ldots,X_{n})$ satisfies (convex) $\textrm{IC}(\max\beta_{i})$ (see [11] and [10]). Therefore we have the following corollary.

Corollary 2.2.

Let $X$ be a symmetric random vector with values in $\mathbb{R}^{n}$ and independent coordinates with log-concave tails. Then $X$ satisfies convex $\text{IC}(\beta)$ with a universal constant $\beta\leq 1680e$ .

Note that the class of distributions from Theorem 2.1 is wider than the class of symmetric log-concave product distributions considered by Latała and Wojtaszczyk in [10]. Among others, it contains measures which do not have a connected support, e.g. a symmetric Bernoulli random variable.

In order to comment on the relevance of the assumptions of Theorem 2.1 and present applications to comparison of weak and strong moments, we need the following definition. Let $X$ be a random vector with values in $\mathbb{R}^{n}$ . We say that the moments of $X$ grow $\alpha$ -regularly if for every $p\geq q\geq 2$ and every $\theta\in S^{n-1}$ we have

[TABLE]

where $\|Y\|_{p}:=(\operatorname*{\mathbb{E}}|Y|^{p})^{1/p}$ is the $p$ -th integral norm of a random variable $Y$ . Clearly, if the moments of $X$ grow $\alpha$ -regularly, then $\alpha$ has to be at least $1$ (unless $X=0$ a.s.).

*Remark 2.3**.*

If $X$ is a symmetric random variable with log-concave tails, then its moments grow $1$ -regularly (this classical fact follows for instance from Proposition 5.5 from [5] and the proof of Proposition 3.8 from [10]).

The assumption of log-concave tails in Theorem 2.1 cannot be replaced by a weaker one of $\alpha$ -regularity of moments: if $X$ is a symmetric random variable defined by

[TABLE]

then the moments of $X$ grow $\alpha$ -regularly (for some $\alpha<\infty$ ), but there does not exists $C>0$ such that the pair $(X,x\mapsto\max\{(Cx)^{2},C|x|\})$ satisfies the convex ICI. All the more, $X$ cannot satisfy convex $\text{IC}(\beta)$ with any $\beta<\infty$ (see Section 5 for details). Thus it seems that the assumptions of Theorem 2.1 are not far from necessary conditions for the convex ICI to hold with an optimal cost function (random variables with moments growing regularly are akin to random variables with log-concave tails as the former can essentially be sandwiched between the latter, see (4.6) in [9]).

Our second main result is an application of Theorem 2.1 to moment comparison. Recall that for a random vector $X$ its $p$ -th weak moment associated with a norm $\|\cdot\|$ is the quantity defined as

[TABLE]

where $\|\cdot\|_{*}$ is the dual norm of $\|\cdot\|$ . The following version of [10, Proposition 3.15] holds (some non-trivial modifications of the proof are necessary in order to deal with the fact that the inequality (1.1) only holds for convex functions).

Theorem 2.4.

Let $X$ be a symmetric random vector with values in $\mathbb{R}^{n}$ which moments grow $\alpha$ -regularly. Suppose moreover that $X$ satisfies convex $\text{IC}(\beta)$ . Then for every norm $\|\cdot\|$ on $\mathbb{R}^{n}$ and every $p\geq 2$ we have

[TABLE]

where $C$ is a universal constant (one can take $C=4\sqrt{2}e<16$ ).

Immediately we obtain the following corollary in the spirit of the results from [13, 1, 7, 8]. Similar inequalities for Rademacher sums with the emphasis on exact values of constants have also been studied by Oleszkiewicz (see [12, Theorem 2.1]).

Corollary 2.5.

Let $X$ be a symmetric random vector with values in $\mathbb{R}^{n}$ and with independent coordinates which have log-concave tails. Then for every norm $\|\cdot\|$ on $\mathbb{R}^{n}$ and every $p\geq 2$ we have

[TABLE]

where $D$ is a universal constant (one can take $D=6720\sqrt{2}e^{2}<70223$ ).

Note that each of the terms on the right-hand side of (2.2) is, up to a constant, dominated by the left-hand side of (2.2), so (2.2) yields the comparison of weak and strong moments of the norms of $X$ .

Note also that the constant standing at $\operatorname*{\mathbb{E}}\|X\|$ is equal to $1$ . If we only assume that the coordinates of $X$ are independent and their moments grow $\alpha$ -regularly, then (2.2) does not always hold (the example here is a vector with independent coordinates distributed like in (2.1); see Section 5 for details), although by [7, Theorem 1.1] it holds if we allow the constant at $\operatorname*{\mathbb{E}}\|X\|$ to be greater than $1$ and to depend on $\alpha$ . Hence Corollary 2.5 and example (2.1) partially answer the following question raised in [7]: “For which vectors does the comparison of weak and strong moments hold with constant $1$ at the first strong moment?”

The organization of the paper is the following. In Section 3 and 4 we present the proofs of Theorem 2.1 and Proposition 2.4 respectively. In Section 5 we discuss example (2.1) in details.

3. Proof of Theorem 2.1

Our approach is based on a characterization – provided by Gozlan, Roberto, Samson, Shu, and Tetali in [3] – of measures on the real line which satisfy a weak transport-entropy inequality. We emphasize that our optimal cost functions need not be quadratic near the origin, therefore we cannot apply their characterization as is, but have to first fine-tune the cost functions a bit. We shall also need the following simple lemma.

Lemma 3.1.

If $X$ is a symmetric random variable and $\operatorname*{\mathbb{E}}X^{2}=\beta_{1}^{-2}$ , then

[TABLE]

Proof.

Since $X$ is symmetric, we have

[TABLE]

Moreover, $\mathcal{L}\bigl{(}\ln\cosh(\cdot)\bigr{)}(|u|)\leq|u|^{2}$ for $|u|\leq 1$ (see for example the proof of [10, Proposition 3.3]). Therefore

[TABLE]

Throughout the proof $g^{-1}$ stands for the generalized inverse of a function $g$ defined as

[TABLE]

Proof of Theorem 2.1.

Note that $N(0)=0$ and the function $N$ is non-decreasing. First we tweak the assumptions and change the assertion to a more straightforward one.

Step 1 (first reduction). We claim that it suffices to prove the assertion for random variables for which the function $N$ is strictly increasing on the set where it is finite (or, in other words, $N(t)=0$ only for $t=0$ ). Indeed, suppose we have done this and let now $X$ be any random variable satisfying the assumptions of the theorem. Let $X_{\varepsilon}$ be a symmetric random variable such that $\mathbb{P}(|X_{\varepsilon}|\geq t)=\exp(-N_{\varepsilon}(t))$ , where $N_{\varepsilon}(t)=N(t)\lor\varepsilon t$ . If $X$ and $X_{\varepsilon}$ are represented in the standard way by the inverses of their CDFs on the probability space $(0,1)$ , then $|X_{\varepsilon}|\leq|X|$ a.s. (and also $X_{\varepsilon}\to X$ a.s. as $\varepsilon\to 0^{+}$ ). Hence $\Lambda_{X_{\varepsilon}}\leq\Lambda_{X}$ and therefore also $\Lambda^{*}_{X_{\varepsilon}}\geq\Lambda^{*}_{X}$ .

The theorem applied to the random variable $X_{\varepsilon}$ and the above inequality imply that the pair $(X_{\varepsilon},\Lambda^{*}_{X}(\cdot/\beta))$ satisfies the convex ICI. Taking $\varepsilon\to 0^{+}$ we get the assertion for $X$ (in the second integral we just use the fact that the test function $f$ is bounded from below and thus $e^{-f}$ is bounded from above; for the first integral it suffices to prove the convergence of integrals on any interval $[-M,M]$ , and on such an interval we have $f\square\Lambda^{*}_{X}(x/\beta)\leq f(x)+\Lambda^{*}_{X}(0)=f(x)$ , and thus $\exp(\max_{[-M,M]}f)$ is a good majorant).

Step 2 (second reduction). We claim that it suffices to prove the assertion for random variables such that $\Lambda_{X}<\infty$ . Indeed, suppose we have done this and let $X$ be any random variable satisfying the assumptions of the theorem. Let $N_{\varepsilon}(t)=N(t)\lor\varepsilon^{2}t^{2}$ and let $X_{\varepsilon}$ be a symmetric random variable such that $\mathbb{P}(|X_{\varepsilon}|\geq t)=\exp(-N_{\varepsilon}(t))$ . Then, similarly as in Step 1., $\Lambda_{X_{\varepsilon}}\leq\Lambda_{Y}<\infty$ , where $Y$ is symmetric and $\mathbb{P}(|Y|\geq t)=\exp(-\varepsilon^{2}t^{2})$ . Thus we can apply the proposition to $X_{\varepsilon}$ and we continue as in Step 1.

Step 3 (scaling). Due to the scaling properties of the Legendre transform, we can assume that $\operatorname*{\mathbb{E}}X^{2}=\beta_{1}^{-2}$ , where $\beta_{1}:=2e$ (the case where $X\equiv 0$ is trivial). Note that then, by Markov’s inequality, $e^{-N(1/2)}=\mathbb{P}(|X|\geq\frac{1}{2})\leq 4\operatorname*{\mathbb{E}}X^{2}=e^{-2}$ , so

[TABLE]

Step 4 (reformulation). For $x\in\mathbb{R}$ let

[TABLE]

We claim that there exists a universal constant $\widetilde{b}\leq 1/420$ , such that the pair $(X,\varphi(\tilde{b}\cdot))$ satisfies the convex infimum convolution inequality. Of course the assertion follows immediately from that.

Note that $\varphi$ is convex, increasing on $[0,\infty)$ (because $\Lambda_{X}^{*}(\cdot/(2\beta_{1}))$ is convex and symmetric and thus non-decreasing on $[0,\infty)$ ). Crucially, $\varphi(x)=x^{2}$ for $x\in[0,1]$ (by Lemma 3.1), so the cost function $\varphi$ is quadratic near zero. Moreover, by Lemma 3.1, $\varphi^{-1}(3)=2$ .

Let $U=F^{-1}\circ F_{\nu}$ , where $F$ , $F_{\nu}$ are the distribution functions of $X$ and the symmetric exponential measure $\nu$ on $\mathbb{R}$ , respectively. By [3, Theorem 1.1] we know that if there exists $b>0$ such that for every $x,y\in\mathbb{R}$ we have

[TABLE]

then the pair $(X,\varphi(\widetilde{b}\cdot))$ , where $\widetilde{b}=\frac{b}{210\varphi^{-1}(2+1^{2})}=\frac{b}{420}$ , satisfies the convex ICI. We will show that (3.2) holds with $b=1$ .

Step 5 (further reformulation). Let $a=\inf\{t>0:N(t)=\infty\}$ . We have three possibilities (recall that $N$ is left-continuous):

•

$a=\infty$ . Then $N$ is continuous, increasing, and transforms $[0,\infty]$ onto $[0,\infty]$ . Also, $F$ is increasing and therefore $F^{-1}$ is the usual inverse of $F$ .

•

$a<\infty$ and $N(a)<\infty$ . Then $X$ has an atom at $a$ . Moreover, $N(a)=\lim_{t\to a^{-}}N(t)$ .

•

$a<\infty$ and $N(a)=\infty=\lim_{t\to a^{-}}N(t)$ .

Of course, in the first case one can extend $N$ by putting $N(a)=\infty$ , so that all formulas below make sense.

Note that

[TABLE]

where $N_{+}(t)$ denotes the right-sided limit of $N$ at $t$ (which is different from $N(t)$ only if $t=a$ and $X$ has an atom at $a$ ). Hence, $F$ is continuous on the interval $(-a,a)$ , the image of $(-a,a)$ under $F$ is the interval $\big{(}\frac{1}{2}\exp(-N(a)),1-\frac{1}{2}\exp(-N(a))\big{)}$ , and we have $F(-a)=\frac{1}{2}\exp(-N(a))$ and $F(a)=1$ . Since the image of $\mathbb{R}$ under $U$ is equal to the image of $(0,1)$ under $F^{-1}$ , we conclude that $U(\mathbb{R})=(-a,a)$ if $N(a)=\infty$ and $U(\mathbb{R})=[-a,a]$ if $N(a)<\infty$ . Denote $A:=U(\mathbb{R})$ .

When $N(a)<\infty$ , it suffices to check condition (3.2) for $x,y\in[-N(a),N(a)]$ (otherwise one can change $x$ , $y$ and decrease the right-hand side while not changing the value of the left-hand side of (3.2)). For $x\in[-N(a),N(a)]$ we can write $U^{-1}(x)=N(|x|)\operatorname*{sgn}(x)$ and $U^{-1}(x)\in\mathbb{R}$ . When $N(a)=\infty$ , $U$ is a bijection (on its image), so we can obviously write again $U^{-1}(x)=N(|x|)\operatorname*{sgn}(x)$ for any $x\in\mathbb{R}$ .

Therefore, in order to verify (3.2) we need to check that

[TABLE]

Since we consider the case when $\Lambda_{X}(t)$ is finite for every $t\in\mathbb{R}$ , the Chernoff inequality applies, so for $t\geq\operatorname*{\mathbb{E}}X=0$ we have

[TABLE]

so

[TABLE]

Note that $\varphi(|x-y|)<\infty$ for $x,y\in A$ , since $\varphi(|x-y|)=\infty$ would imply $\Lambda^{*}_{X}(|x-y|/(2\beta_{1}))=\infty$ , and hence $\Lambda^{*}_{X}(|x-y|/2)=\infty$ , and – by (3.4) – also $N(|x-y|/2)=\infty$ , but for $x,y\in A$ we have $|x-y|/2\in[0,a)$ when $N(a)=\infty$ or $|x-y|/2\in[0,a]$ when $N(a)<\infty$ and in either case $N(|x-y|/2)$ is finite. Therefore for every $x,y\in A$ we have $\varphi(|x-y|)<\infty$ . Since $\varphi^{-1}(\varphi(z))=z$ for $z$ such that $\varphi(z)<\infty$ (because $\varphi$ is then continuous and increasing on $[0,z]$ ), the condition (3.3) is implied by

[TABLE]

In the next step we check that this is indeed satisfied.

Step 6 (checking the condition). Let $x_{0}=\inf\{x\geq 1:2x-1=\Lambda_{X}^{*}(\frac{x}{2\beta_{1}})\}$ (if $x_{0}=\infty$ we simply do not have to consider Case 2 below). We consider three cases. We repeatedly use the fact that $uN(t)\geq N(ut)$ for $u\leq 1$ , $t\geq 0$ , which follows by the convexity of $N$ and the property $N(0)=0$ .

Case 1. $|x-y|\leq 1$ . Then $\varphi\bigl{(}|x-y|\bigr{)}=(x-y)^{2}\leq 1$ , so (3.5) is trivially satisfied.

Case 2. $|x-y|\geq x_{0}$ . Then $\varphi\bigl{(}|x-y|\bigr{)}=\Lambda_{X}^{*}(\frac{1}{2\beta_{1}}|x-y|)\leq\Lambda^{*}_{X}(|x-y|/2)$ . Inequality (3.4) implies that in order to prove (3.5) it suffices to show that if $x$ , $y$ are of the same sign, say $x,y\geq 0$ , then $N\bigl{(}|x-y|/2)\leq|N(x)-N(y)|$ and if $x,y$ have different signs, we have $N\bigl{(}\bigl{(}|x|+|y|\bigr{)}/2\bigr{)}\leq N(|x|)+N(|y|)$ .

By the convexity of $N$ , for $s,t\geq 0$ we have

[TABLE]

and

[TABLE]

This finishes the proof of (3.5) in Case 2.

Case 3. $1\leq|x-y|\leq x_{0}$ . Then $\varphi\bigl{(}|x-y|\bigr{)}=2|x-y|-1$ . Consider two sub-cases:

(i)

$x,y$ have different signs. Without loss of generality we may assume $x\geq|y|\geq 0\geq y$ . Thus in order to obtain (3.5) it suffices to show that $N(x)\geq 2x+2|y|$ . Note that $1\leq x+|y|\leq 2x$ , so $x\geq\frac{1}{2}$ . Thus

[TABLE]

which finishes the proof in case (i).

(ii)

$x,y$ have the same sign. Without loss of generality we may assume $x\geq y\geq 0.$ Thus it suffices to show that $2(x-y)\leq N(x)-N(y)$ . Note that due to the assumption of Case 3 we have $x\geq x-y\geq 1\geq\frac{1}{2}$ , so by the convexity of $N$ we have

[TABLE]

This ends the examination of case (ii) and the proof of the theorem. ∎

4. Comparison of weak and strong moments

The goal of this section is to establish the comparison of weak and strong moments with respect to any norm $\|\cdot\|$ for random vectors $X$ with independent coordinates having log-concave tails (Corollary 2.5). In view of Theorem 2.1 and Remark 2.3, it is enough to show Theorem 2.4.

Our proof of Theorem 2.4 comprises three steps: first we exploit $\alpha$ -regularity of moments of $X$ to control the size of its cumulant-generating function $\Lambda_{X}$ , second we bound the infimum convolution of the optimal cost function with the convex test function being the norm $\|\cdot\|$ properly rescaled, and finally by the property convex $\text{IC}(\beta)$ we obtain exponential tail bounds which integrated out give the desired moment inequality.

We start with two lemmas corresponding to the first two steps described above and then we put everything together.

Lemma 4.1.

Let $p\geq 2$ and suppose that the moments of a random vector $X$ in $\mathbb{R}^{n}$ grow $\alpha$ -regularly. If for a vector $u\in\mathbb{R}^{n}$ we have $\|\langle u,X\rangle\|_{p}\leq 1$ , then

[TABLE]

Proof.

Let $k_{0}$ be the smallest integer larger than $p$ . If $\alpha e\|\langle u,X\rangle\|_{p}\leq 1/2$ , then by $\alpha$ -regularity we have

[TABLE]

Replace $u$ with $(2e\alpha)^{-1}u$ to get the assertion. ∎

Lemma 4.2.

Let $\|\cdot\|$ be a norm on $\mathbb{R}^{n}$ and let $X$ be a random vector with values in $\mathbb{R}^{n}$ and moments growing $\alpha$ -regularly. For $\beta>0$ , $p\geq 2$ , and $x\in\mathbb{R}^{n}$ we have

[TABLE]

where $a=p(2e\alpha\beta\sigma_{\|\cdot\|,X}(p))^{-1}$ .

Proof.

For $f(x)=a\|x\|$ with positive $a$ being arbitrary for now we bound the infimum convolution as follows

[TABLE]

where in the last inequality we have used Lemma 4.1. Choose $u=\sigma_{\|\cdot\|,X}(p)^{-1}v$ with $\|v\|_{*}\leq 1$ such that $\langle y,v\rangle=\|y\|$ . Then clearly $\|\langle u,X\rangle\|_{p}\leq 1$ and thus

[TABLE]

If we now set $a=p(2e\alpha\beta\sigma_{\|\cdot\|,X}(p))^{-1}$ , then by the triangle inequality we obtain the desired lower bound

[TABLE]

Proof of Theorem 2.4.

Let $f(x)=a\|x\|$ with $a=p(2e\alpha\beta\sigma_{\|\cdot\|,X}(p))^{-1}$ as in Lemma 4.2. Testing the property convex $\text{IC}(\beta)$ with $f$ and applying Lemma 4.2 yields

[TABLE]

By Jensen’s inequality we obtain that both $\operatorname*{\mathbb{E}}e^{a(\|X\|-\operatorname*{\mathbb{E}}\|X\|)}$ and $\operatorname*{\mathbb{E}}e^{a(-\|X\|+\operatorname*{\mathbb{E}}\|X\|)}$ are bounded above by $e^{p}$ . Thus Markov’s inequality implies the tail bound

[TABLE]

Consequently,

[TABLE]

Plugging in the value of $a$ gives the result (we can take $C=4\sqrt{2}e<16$ ). ∎

5. An example

Let $X$ be a symmetric random variable defined by $\mathbb{P}(|X|>t)=T(t)$ , where

[TABLE]

or, in other words, let $|X|$ have the distribution

[TABLE]

Let us first show that the moments of $X$ grow $3$ -regularly, but $X$ does not satisfy $\text{IC}(\beta)$ for any $\beta<\infty$ (we also prove a slightly stronger statement later).

Let $Y$ be a symmetric exponential random variable. Then $Y$ has log-concave tails, so the moments of $Y$ grow $1$ -regularly (see Remark 2.3). Moreover, if $X$ and $Y$ are constructed in the standard way by the inverses of their CDFs on the probability space $(0,1)$ , then

[TABLE]

Therefore, for $p\geq q\geq 2$ ,

[TABLE]

(we used the fact that $|X|\geq 2$ in the last inequality). Thus the moments of $X$ grow $3$ -regularly.

On the other hand, for every $h>0$ there exists $t>0$ such that

[TABLE]

Therefore by [2, Theorem 1] there does not exist a constant $C$ such that the pair $(X,\varphi(\cdot/C))$ , where $\varphi(x)=\tfrac{1}{2}x^{2}1_{{\{|x|\leq 1\}}}+(|x|-1/2)1_{{\{|x|>1\}}}$ , satisfies the convex infimum convolution inequality. But, by symmetry and the $3$ -regularity of moments of $X$ ,

[TABLE]

Thus for some $A,\varepsilon>0$ we have $\Lambda_{X}(s)\leq As^{2}$ for $|s|\leq\varepsilon$ and $2A\varepsilon^{2}\geq 1$ . Hence

[TABLE]

We conclude that $X$ cannot satisfy $\text{IC}(\beta)$ for any $\beta$ .

*Remark 5.1**.*

Let us also sketch an alternative approach. Take $a,c>0$ , $b\in\mathbb{R}$ , and denote $\varphi(x)=\min\{x^{2},|x|\}$ , $f(x)=f_{a,b}(x)=a(x-b)_{+}$ for $x\in\mathbb{R}$ . One can check that

[TABLE]

if $a>2c$ . It is rather elementary but cumbersome to show that for any $c>0$ there exist $a>0$ and $b\in\mathbb{R}$ such that (1.1) is violated by the test function $f$ . We omit the details.

In fact, the above example shows that even a slightly stronger statement is true: for vectors with independent coordinates with $\alpha$ -regular growth of moments the comparison of weak and strong moments of norms does not hold with the constant $1$ at the first strong moment. More precisely, let $X_{1},X_{2},\ldots$ be independent random variables with distribution given by (5.1). We claim that there does not exist any $K<\infty$ such that

[TABLE]

holds for every $p\geq 2$ and $n\in\mathbb{N}$ (note that we chose the $\ell^{\infty}$ -norm as our norm). We shall estimate the three expressions appearing in (5.2).

We have

[TABLE]

(this inequality is in fact an equality). Since the moments of $X_{1}$ grow $3$ -regularly, the last term in (5.2) is bounded by $\widetilde{K}p$ for some $\widetilde{K}<\infty$ .

To estimate the remaining two terms we need the following standard fact.

Lemma 5.2.

For independent events $A_{1},\ldots,A_{n}$ ,

[TABLE]

In particular, for i.i.d. non-negative random variables $Y_{1},\ldots,Y_{n}$ ,

[TABLE]

Proof.

The upper bound is just the union bound. The lower bound follows from de Morgan’s laws combined with independence and the inequalities $1-x\leq e^{-x}$ and $1-e^{-y}\geq(1-e^{-1})y$ for $x\in\mathbb{R}$ , $y\in[0,1]$ . ∎

Fix $m\geq 2$ and let $e^{2^{m-1}}\leq n<e^{2^{m}}$ . Then

[TABLE]

By the above lemma,

[TABLE]

Set $\theta=\theta(m,n)=ne^{-2^{m}}\in[e^{-2^{m-1}},1)$ . Then

[TABLE]

Similarly,

[TABLE]

Hence

[TABLE]

Putting (5.3), (5.4), and (5.5) together, we see that (5.2) would imply

[TABLE]

for every $p\geq 2$ , $m\geq 2$ , and $\theta\in[e^{-2^{m-1}},1)$ of the form $ne^{-2^{m}}$ , $n\in\mathbb{N}$ . Take $p=1/\theta$ and $\theta\sim 1/m$ to get

[TABLE]

Since $\theta\to 0$ and $2^{m}\theta\to\infty$ as $m\to\infty$ this inequality yields $2\leq 1$ , which is a contradiction. Hence inequality (5.2) cannot hold for all $p\geq 2$ and $n\in\mathbb{N}$ .

Acknowledgments

We thank Radosław Adamczak and Rafał Latała for posing questions which led to the results presented in this note.

Bibliography14

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] R. Adamczak, R. Latała, A. E. Litvak, K. Oleszkiewicz, A. Pajor, and N. Tomczak-Jaegermann, A short proof of Paouris’ inequality , Canad. Math. Bull. 57 (2014), no. 1, 3–8. MR 3150710
2[2] N. Feldheim, A. Marsiglietti, P. Nayar, and J. Wang, A note on the convex infimum convolution inequality , to appear in Bernoulli, preprint (2015), ar Xiv:1505.00240 .
3[3] N. Gozlan, C. Roberto, P.M. Samson, Y. Shu, and P. Tetali, Characterization of a class of weak transport-entropy inequalities on the line , to appear in Ann. Inst. Henri Poincaré Probab. Stat., preprint (2015), ar Xiv:1509.04202 v 2 .
4[4] N. Gozlan, C. Roberto, P.M. Samson, and P. Tetali, Kantorovich duality for general transport costs and applications , to appear in J. Funct. Anal., preprint (2014), ar Xiv:1412.7480 v 4 .
5[5] O. Guédon, P. Nayar, and T. Tkocz, Concentration inequalities and geometry of convex bodies , Analytical and probabilistic methods in the geometry of convex bodies, IMPAN Lect. Notes, vol. 2, Polish Acad. Sci. Inst. Math., Warsaw, 2014, pp. 9–86. MR 3329056
6[6] R. Latała, On some problems concerning log-concave random vectors , to appear in IMA Volume “Discrete Structures: Analysis and Applications”, Springer.
7[7] R. Latała and M. Strzelecka, Comparison of weak and strong moments for vectors with independent coordinates , preprint (2016), ar Xiv:1612.02407 v 1 .
8[8] by same author, Weak and strong moments of ℓ r subscript ℓ 𝑟 \ell_{r} -norms of log-concave vectors , Proc. Amer. Math. Soc. 144 (2016), no. 8, 3597–3608. MR 3503729

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

On the convex infimum convolution inequality with optimal cost function

Abstract.

Key words and phrases:

2010 Mathematics Subject Classification:

1. Introduction

2. Main results

Theorem 2.1**.**

Corollary 2.2**.**

Remark 2.3*.*

Theorem 2.4**.**

Corollary 2.5**.**

3. Proof of Theorem 2.1

Lemma 3.1**.**

Proof.

Proof of Theorem 2.1.

4. Comparison of weak and strong moments

Lemma 4.1**.**

Proof.

Lemma 4.2**.**

Proof.

Proof of Theorem 2.4.

5. An example

Remark 5.1*.*

Lemma 5.2**.**

Proof.

Acknowledgments

Theorem 2.1.

Corollary 2.2.

*Remark 2.3**.*

Theorem 2.4.

Corollary 2.5.

Lemma 3.1.

Lemma 4.1.

Lemma 4.2.

*Remark 5.1**.*

Lemma 5.2.