Transport Proofs Of Some Discrete Variants Of The Pr{\'e}Kopa-leindler   Inequality

Nathael Gozlan (MAP5 - UMR 8145); Cyril Roberto (LAMA); Paul-Marie; Samson (LAMA); Prasad Tetali (School of Mathematics)

arXiv:1905.04038·math.PR·May 13, 2019

Transport Proofs Of Some Discrete Variants Of The Pr{\'e}Kopa-leindler Inequality

Nathael Gozlan (MAP5 - UMR 8145), Cyril Roberto (LAMA), Paul-Marie, Samson (LAMA), Prasad Tetali (School of Mathematics)

PDF

Open Access

TL;DR

This paper provides a transport-based proof of discrete displacement convexity of entropy on integers, leading to new discrete forms of the Prékopa-Leindler inequality, including the Four Functions Theorem and recent results on Z.

Contribution

It introduces a novel transport proof for discrete displacement convexity, deriving new discrete inequalities from continuous analogs.

Findings

01

Established a transport proof for discrete displacement convexity on integers

02

Derived two discrete forms of the Prékopa-Leindler inequality

03

Connected continuous and discrete convexity results

Abstract

We give a transport proof of a discrete version of the displacement convexity of entropy on integers (Z), and get, as a consequence, two discrete forms of the Pr{\'e}kopa-Leindler Inequality : the Four Functions Theorem of Ahlswede and Daykin on the discrete hypercube [1] and a recent result on Z due to Klartag and Lehec [16].

Equations219

f (x)^{1 - t} g (y)^{t} \leq h ((1 - t) x + t y), \forall x, y \in R^{n} .

f (x)^{1 - t} g (y)^{t} \leq h ((1 - t) x + t y), \forall x, y \in R^{n} .

(\int_{R^{n}} f (x) d x)^{1 - t} (\int_{R^{n}} g (y) d y)^{t} \leq \int_{R^{n}} h (z) d z .

(\int_{R^{n}} f (x) d x)^{1 - t} (\int_{R^{n}} g (y) d y)^{t} \leq \int_{R^{n}} h (z) d z .

Vol ((1 - t) A + tB) \geq Vol (A)^{1 - t} Vol (B)^{t},

Vol ((1 - t) A + tB) \geq Vol (A)^{1 - t} Vol (B)^{t},

x \land y := (min (x_{1}, y_{1}), \dots, min (x_{n}, y_{n})) \mbox an d x \lor y := (max (x_{1}, y_{1}), \dots, max (x_{n}, y_{n})) .

x \land y := (min (x_{1}, y_{1}), \dots, min (x_{n}, y_{n})) \mbox an d x \lor y := (max (x_{1}, y_{1}), \dots, max (x_{n}, y_{n})) .

f (x) g (y) \leq h (x \land y) k (x \lor y), \forall x, y \in Ω_{n},

f (x) g (y) \leq h (x \land y) k (x \lor y), \forall x, y \in Ω_{n},

x \in Ω_{n} \sum f (x) x \in Ω_{n} \sum g (x) \leq x \in Ω_{n} \sum h (x) x \in Ω_{n} \sum k (x) .

x \in Ω_{n} \sum f (x) x \in Ω_{n} \sum g (x) \leq x \in Ω_{n} \sum h (x) x \in Ω_{n} \sum k (x) .

f (x) g (y) \leq h (⌊ \frac{x + y}{2} ⌋) k (⌈ \frac{x + y}{2} ⌉), \forall x, y \in Z .

f (x) g (y) \leq h (⌊ \frac{x + y}{2} ⌋) k (⌈ \frac{x + y}{2} ⌉), \forall x, y \in Z .

(x \in Z \sum f (x)) y \in Z \sum g (y) \leq (x \in Z \sum h (x)) y \in Z \sum k (y) .

(x \in Z \sum f (x)) y \in Z \sum g (y) \leq (x \in Z \sum h (x)) y \in Z \sum k (y) .

f (x) = g (T (x)) T^{'} (x), \forall x \in R .

f (x) = g (T (x)) T^{'} (x), \forall x \in R .

\int_{R} h (z) d z

\int_{R} h (z) d z

\geq \int_{R} f (x)^{1 - t} g (T (x))^{t} T^{'} (x)^{t} d x

= \int_{R} f (x)^{1 - t} f (x)^{t} d x = 1,

h^{a} (x) = h (x, a), \forall x \in Ω_{n - 1} .

h^{a} (x) = h (x, a), \forall x \in Ω_{n - 1} .

h_{1} (x) + h_{2} (y) \leq h_{3} (x \land y) + h_{4} (x \lor y), \forall x, y \in Ω_{n} .

h_{1} (x) + h_{2} (y) \leq h_{3} (x \land y) + h_{4} (x \lor y), \forall x, y \in Ω_{n} .

x \in Ω_{n} \sum e^{h_{1} (x)} x \in Ω_{n} \sum e^{h_{2} (x)} \leq x \in Ω_{n} \sum e^{h_{3} (x)} x \in Ω_{n} \sum e^{h_{3} (x)} .

x \in Ω_{n} \sum e^{h_{1} (x)} x \in Ω_{n} \sum e^{h_{2} (x)} \leq x \in Ω_{n} \sum e^{h_{3} (x)} x \in Ω_{n} \sum e^{h_{3} (x)} .

H (ν ∣ m_{n}) = \int lo g (\frac{d ν}{d m _{n}}) d ν .

H (ν ∣ m_{n}) = \int lo g (\frac{d ν}{d m _{n}}) d ν .

lo g \int e^{f} d m_{n} = ν \in P (Ω_{n}) sup {\int f d ν - H (ν ∣ m_{n})} .

lo g \int e^{f} d m_{n} = ν \in P (Ω_{n}) sup {\int f d ν - H (ν ∣ m_{n})} .

π (A \times E) = ν_{1} (A) and π (E \times B) = ν_{2} (B)

π (A \times E) = ν_{1} (A) and π (E \times B) = ν_{2} (B)

\begin{tabular}[]{|c||c|c||c|}\hline\cr\diagbox[dir={NW}]{{\shortstack[l]{$x$}}}{{\shortstack[r]{$y$}}}{}&0&1&\\ \hline\cr$0$&$\pi(0,0)$&$\pi(0,1)$&$\nu_{1}(0)$\\ \hline\cr$1$&$\pi(1,0)$&$\pi(1,1)$&$\nu_{1}(1)$\\ \hline\cr&$\nu_{2}(0)$&$\nu_{2}(1)$&\\ \hline\cr\end{tabular}\quad\stackrel{{\scriptstyle S}}{{\longrightarrow}}\quad\begin{tabular}[]{|c||c|c||c|}\hline\cr\diagbox[dir={NW}]{{\shortstack[l]{$x\wedge y$}}}{{\shortstack[r]{$x\vee y$}}}{}&0&1&\\ \hline\cr$0$&$\pi(0,0)$&$\pi(0,1)+\pi(1,0)$&$\nu_{1}(0)$\\ \hline\cr$1$&$0$&$\pi(1,1)$&$\nu_{1}(1)$\\ \hline\cr&$\nu_{2}(0)$&$\nu_{2}(1)$&\\ \hline\cr\end{tabular}

\begin{tabular}[]{|c||c|c||c|}\hline\cr\diagbox[dir={NW}]{{\shortstack[l]{$x$}}}{{\shortstack[r]{$y$}}}{}&0&1&\\ \hline\cr$0$&$\pi(0,0)$&$\pi(0,1)$&$\nu_{1}(0)$\\ \hline\cr$1$&$\pi(1,0)$&$\pi(1,1)$&$\nu_{1}(1)$\\ \hline\cr&$\nu_{2}(0)$&$\nu_{2}(1)$&\\ \hline\cr\end{tabular}\quad\stackrel{{\scriptstyle S}}{{\longrightarrow}}\quad\begin{tabular}[]{|c||c|c||c|}\hline\cr\diagbox[dir={NW}]{{\shortstack[l]{$x\wedge y$}}}{{\shortstack[r]{$x\vee y$}}}{}&0&1&\\ \hline\cr$0$&$\pi(0,0)$&$\pi(0,1)+\pi(1,0)$&$\nu_{1}(0)$\\ \hline\cr$1$&$0$&$\pi(1,1)$&$\nu_{1}(1)$\\ \hline\cr&$\nu_{2}(0)$&$\nu_{2}(1)$&\\ \hline\cr\end{tabular}

h_{1} (x) + h_{2} (y) \leq h_{3} (x \land y) + h_{4} (x \lor y), \forall x, y \in Ω_{n} .

h_{1} (x) + h_{2} (y) \leq h_{3} (x \land y) + h_{4} (x \lor y), \forall x, y \in Ω_{n} .

h_{1}^{a} (x^{'}) + h_{2}^{b} (y^{'}) \leq h_{3}^{a \land b} (x^{'} \land y^{'}) + h_{4}^{a \lor b} (x^{'} \lor y^{'}), \forall x^{'}, y^{'} \in Ω_{n - 1}

h_{1}^{a} (x^{'}) + h_{2}^{b} (y^{'}) \leq h_{3}^{a \land b} (x^{'} \land y^{'}) + h_{4}^{a \lor b} (x^{'} \lor y^{'}), \forall x^{'}, y^{'} \in Ω_{n - 1}

lo g x \in Ω_{n - 1} \sum e^{h_{1}^{a} (x)} + lo g x \in Ω_{n - 1} \sum e^{h_{2}^{b} (x)} \leq lo g x \in Ω_{n - 1} \sum e^{h_{3}^{a \land b} (x)} + lo g x \in Ω_{n - 1} \sum e^{h_{4}^{a \lor b} (x)} .

lo g x \in Ω_{n - 1} \sum e^{h_{1}^{a} (x)} + lo g x \in Ω_{n - 1} \sum e^{h_{2}^{b} (x)} \leq lo g x \in Ω_{n - 1} \sum e^{h_{3}^{a \land b} (x)} + lo g x \in Ω_{n - 1} \sum e^{h_{4}^{a \lor b} (x)} .

H_{1} (a) + H_{2} (b) \leq H_{3} (a \land b) + H_{4} (a \lor b) \forall a, b \in Ω_{1} .

H_{1} (a) + H_{2} (b) \leq H_{3} (a \land b) + H_{4} (a \lor b) \forall a, b \in Ω_{1} .

lo g (x \in Ω_{1} \sum e^{H_{1} (x)}) + lo g (x \in Ω_{1} \sum e^{H_{2} (x)}) \leq lo g (x \in Ω_{1} \sum e^{H_{3} (x)}) + lo g (x \in Ω_{1} \sum e^{H_{4} (x)}) .

lo g (x \in Ω_{1} \sum e^{H_{1} (x)}) + lo g (x \in Ω_{1} \sum e^{H_{2} (x)}) \leq lo g (x \in Ω_{1} \sum e^{H_{3} (x)}) + lo g (x \in Ω_{1} \sum e^{H_{4} (x)}) .

(\int h_{1} d ν_{1} - H (ν_{1} ∣ m_{1})) + (\int h_{2} d ν_{2} - H (ν_{2} ∣ m_{1})) \leq lo g (x \in Ω_{1} \sum e^{h_{3} (x)}) + lo g (x \in Ω_{1} \sum e^{h_{4} (x)}) .

(\int h_{1} d ν_{1} - H (ν_{1} ∣ m_{1})) + (\int h_{2} d ν_{2} - H (ν_{2} ∣ m_{1})) \leq lo g (x \in Ω_{1} \sum e^{h_{3} (x)}) + lo g (x \in Ω_{1} \sum e^{h_{4} (x)}) .

\int h_{1} d ν_{1} + \int h_{2} d ν_{2}

\int h_{1} d ν_{1} + \int h_{2} d ν_{2}

= \int_{Ω_{1}^{2}} h_{3} (x) + h_{4} (y) d π (x, y) = \int h_{3} d ν_{1} + \int h_{4} d ν_{2} .

(\int h_{1} d ν_{1} - H (ν_{1} ∣ m_{1})) + (\int h_{2} d ν_{2} - H (ν_{2} ∣ m_{1})) \leq (\int h_{3} d ν_{1} - H (ν_{1} ∣ m_{1}))

(\int h_{1} d ν_{1} - H (ν_{1} ∣ m_{1})) + (\int h_{2} d ν_{2} - H (ν_{2} ∣ m_{1})) \leq (\int h_{3} d ν_{1} - H (ν_{1} ∣ m_{1}))

AAAAAAAAAAAA + (\int h_{4} d ν_{2} - H (ν_{2} ∣ m_{1})) \leq lo g (x \in Ω_{1} \sum e^{h_{3} (x)}) + lo g (x \in Ω_{1} \sum e^{h_{4} (x)}),

lo g (x \in Ω_{1} \sum e^{h_{1} (x)}) + lo g (x \in Ω_{1} \sum e^{h_{2} (x)}) \leq lo g (x \in Ω_{1} \sum e^{h_{3} (x)}) + lo g (x \in Ω_{1} \sum e^{h_{4} (x)}) .

lo g (x \in Ω_{1} \sum e^{h_{1} (x)}) + lo g (x \in Ω_{1} \sum e^{h_{2} (x)}) \leq lo g (x \in Ω_{1} \sum e^{h_{3} (x)}) + lo g (x \in Ω_{1} \sum e^{h_{4} (x)}) .

Φ (h) = ν \in P (Ω_{1}) sup {\int h d ν - Ψ (ν)}, h \in F (Ω_{1}),

Φ (h) = ν \in P (Ω_{1}) sup {\int h d ν - Ψ (ν)}, h \in F (Ω_{1}),

Φ^{n} (h) = Φ (a \mapsto Φ^{n - 1} (h^{a})), h \in F (Ω_{n}),

Φ^{n} (h) = Φ (a \mapsto Φ^{n - 1} (h^{a})), h \in F (Ω_{n}),

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGeometric Analysis and Curvature Flows · Mathematical Dynamics and Fractals · Spectral Theory in Mathematical Physics

Full text

Transport Proofs of some discrete variants of the Prékopa-Leindler inequality

Nathael Gozlan, Cyril Roberto, Paul-Marie Samson, Prasad Tetali

N. Gozlan : Université Paris Descartes, MAP5, UMR 8145, 45 rue des Saints Pères, 75270 Paris Cedex 06

P.-M. Samson : Université Paris-Est, Laboratoire d’Analyse et de Mathématiques Appliquées (UMR 8050), UPEM, UPEC, CNRS, F-77454, Marne-la-Vallée, France

C. Roberto : Université Paris Nanterre - Modal’X, 200 avenue de la République 92000 Nanterre, France

P. Tetali : School of Mathematics & School of Computer Science, Georgia Institute of Technology, Atlanta, GA 30332

[email protected], [email protected], [email protected],[email protected]

Abstract.

We give a transport proof of a discrete version of the displacement convexity of entropy on integers ( $\mathbb{Z}$ ), and get, as a consequence, two discrete forms of the Prékopa-Leindler Inequality : the Four Functions Theorem of Ahlswede and Daykin on the discrete hypercube [1] and a recent result on $\mathbb{Z}$ due to Klartag and Lehec [16].

Key words and phrases:

Prékopa-Leindler Inequality, Optimal Transport

1991 Mathematics Subject Classification:

60E15, 32F32 and 26D10

This research is partly funded by the Bézout Labex, funded by ANR, reference ANR-10-LABX-58 and the Labex MME-DII funded by ANR, reference ANR-11-LBX-0023-01. Research of P.T. is supported in part by the NSF grant DMS-1811935

Introduction

The aim of the paper is to develop a transport approach to some discrete versions of the Prékopa-Leindler Inequality [25, 26, 19], namely the Four Functions Theorem due to Ahlswede and Daykin [1] and a recent result of Klartag and Lehec [16] on $\mathbb{Z}$ . Both inequalities will be a consequence of the stronger displacement convexity of entropy on the set of integers. Before presenting these discrete functional inequalities, let us recall the original continuous statement inspiring them.

The classical Prékopa-Leindler Inequality is the following.

Theorem 1 (Prékopa-Leindler).

Suppose that $f,g,h:\mathbb{R}^{n}\to\mathbb{R}^{+}$ are measurable functions such that, for some $t\in(0,1)$ ,

[TABLE]

Then

[TABLE]

The Prékopa-Leindler Inequality is a functional version of the celebrated Brunn-Minkowski Inequality stating that for all Borel sets $A,B\subset\mathbb{R}^{n}$ and all $t\in(0,1)$ it holds

[TABLE]

where $\mathrm{Vol}(\,\cdot\,)$ denotes the Lebesgue measure on $\mathbb{R}^{n}.$ It is more generally intimately related to the study of log-concave measures which is of considerable importance in convex geometry, probability theory and statistics. In particular, many geometric and functional inequalities for uniformly log-concave probability measures can be derived from Theorem 1 (see in particular the paper [4] by Bobkov and Ledoux). We refer to [11] for a thorough presentation of the subject as well as for historical comments on Theorem 1.

The question of extending the Prékopa-Leindler inequality outside the flat space framework has been tackled by many authors in recent years and turned out to be extremely fruitful in Geometry, Analysis and Probability. A first step has been accomplished by Cordero-Erausquin, McCann and Schmuckenschläger in [6, 7], who obtained extensions of the Prékopa-Leindler inequality on Riemannian manifolds with a lower bounded Ricci curvature. Their extension is closely related to displacement convexity properties of entropic functionals, first introduced by McCann in [21] in the flat space framework, and then extended to Riemannian manifolds by Otto and Villani [24] and von Renesse and Sturm [31]. This displacement convexity formulation is actually equivalent to lower bounds on the Ricci curvature and led to the Lott-Sturm-Villani [20, 27, 28] definition of metric measure spaces with lower bounded Ricci curvature which makes sense even in a non-smooth framework.

In a similar vein, it would also be satisfactory to extend the Prékopa-Leindler inequality to discrete frameworks such as graphs (which are not covered by the Lott-Sturm-Villani theory). Several general definitions of discrete spaces with lower bounded curvature were recently proposed, in particular by Bonciocat and Sturm [5], Ollivier [22], Ollivier and Villani [23], Erbar and Maas [9], Hillion [15] or the authors [13]. While these different definitions are all efficient at the level of functional inequalities and are satisfied by a large collection of classical graphs, none of them really succeeds in leading to a satisfactory Prékopa-Leindler or Brunn-Minkowski inequality on those spaces.

However, for at least two specific discrete spaces, convincing Prékopa-Leindler type inequalities already exist.

The first one, is the celebrated Four Functions Theorem on the discrete hypercube $\{0,1\}^{n}$ by Ahlswede and Daykin [1]. To recall its statement, we will need the following notation. The discrete hypercube will be denoted by $\Omega_{n}:=\{0,1\}^{n}$ and for all $x=(x_{1},\dots,x_{n}),\ y=(y_{1},\dots,y_{n})\in\Omega_{n}$ , one defines

[TABLE]

Theorem 2 (Ahlswede-Daykin).

Suppose that $f,g,h,k\colon\Omega_{n}\to\mathbb{R}^{+}$ are such that

[TABLE]

then

[TABLE]

Note that this result mimics the statement of Theorem 1 for $t=1/2$ on $\Omega_{n}$ . Theorem 2 has important implications in terms of correlation inequalities, as it gives back in particular the classical FKG inequality which has a lot of applications in percolation and statistical mechanics [10].

The second discrete form of the Prékopa-Leindler Inequality we will consider is a recent one due to Klartag and Lehec [16], and holds on the space $\mathbb{Z}$ of integers. Denote by $\lceil\cdot\rceil$ and $\lfloor\cdot\rfloor$ the ceiling and floor functions respectively.

Theorem 3.

Suppose that $f,g,h,k:\mathbb{Z}\to\mathbb{R}^{+}$ are such that

[TABLE]

Then

[TABLE]

As we will see in Section 2, Theorem 3 implies Theorem 2 for $n=1$ (which then gives the full conclusion by induction, see the proof of Theorem 2 in Section 1). Moreover Theorem 3 implies back Theorem 1 for $t=1/2$ (and thus for all other values of $t$ ). The proof given by Klartag and Lehec in [16] relies on rather sophisticated tools of stochastic analysis on the Poisson space and in particular on a stochastic representation formula for the relative entropy functional with respect to the Poisson distribution on the (non-negative) integers.

As already stated above, the main objective of the present paper is to recover Theorems 2 and 3 by means of optimal transport tools. In the continuous setting, optimal transport is indeed a very efficient way to establish functional inequalities (see [29, 30] and the references therein) and it is a challenging question to see how these powerful techniques can be adapted to the discrete world. To make this introduction more self-contained and to illustrate the difficulties in dealing with discrete structures, let us briefly recall a classical transport proof of Theorem 1 in dimension $1$ .

Proof of Theorem 1 for $d=1$ .

Without loss of generality, one can assume that $\int_{\mathbb{R}}f(x)\,dx=\int_{\mathbb{R}}g(y)\,dy=1$ , with $f$ and $g$ two positive and continuous functions. Defining $\mu(dx)=f(x)\,dx$ , $\nu(dy)=g(y)\,dy$ , a natural transport map between the probability measures $\mu$ and $\nu$ is given by $T(x)=F_{\nu}^{-1}\circ F_{\mu}(x)$ , where $F_{\mu}(x)=\int_{-\infty}^{x}f(u)\,du$ , $x\in\mathbb{R}$ , and $F_{\nu}(y)=\int_{-\infty}^{y}g(v)\,dv$ , $y\in\mathbb{R}$ , are the cumulative distribution functions of $\mu$ and $\nu$ . The change of variable formula immediately gives the following relation between $f$ and $g$ :

[TABLE]

Plugging $y=T(x)$ into (1) one gets by change of variables ( $z=(1-t)x+tT(x)$ , note that $T$ is increasing by construction)

[TABLE]

where the inequality comes from (1) and the arithmetic-geometric inequality $(1-t)a+tb\geq a^{1-t}b^{t}$ , $a,b\geq 0$ , $t\in[0,1]$ (appplied to $a=1$ and $b=T^{\prime}(x)$ ), while the last equality comes from (3). ∎

The proof for $n\geq 2$ is done by induction (see e.g the proof of [18, Theorem 2.13]). It is also possible to prove this result directly in dimension $n$ , by using the Brenier or the Knothe transport maps and the Monge-Ampère equation. See [29, Chapter 6] for details. Note that the use of coupling arguments for establishing Brunn-Minkowski type inequalities goes back at least to Knothe [17].

Analyzing the proof above immediately reveals two obvious obstacles that prevent to export it easily to the discrete setting:

(1)

Transport maps between probability measures $\mu$ and $\nu$ usually do not exist when the space is discrete and one often needs to cut the mass of atoms of the source measure $\mu$ to reconstruct the target measure $\nu$ ; 2. (2)

Even if there is a transport map $T$ sending $\mu$ on $\nu$ , there is no Jacobian equation such as (3).

In the case of Theorem 2 and 3, it turns out that these difficulties can be circumvented. It would be useless at this point to state general rules, however it seems at least that in both situations choosing $t=1/2$ helps a lot by introducing symmetry and compensations to overcome the lack of Jacobian equation.

In fact, we will go beyond Theorem 2 and 3 by proving, by transport arguments, a stronger statement: namely an entropic version of the Prékopa-Leindler Inequality (that we may also call displacement convexity of entropy), see Theorem 8 for a precise statement. In that sense, since such an entropic statement implies the Klartag-Lehec version of the Prékopa-Leindler Inequality on $\mathbb{Z}$ , which in turn, at the price of an obvious induction step, implies the Four Functions theorem, all results appear to be the consequence of one single (transport) proof. Moreover, our displacement convexity result on the integers, as the mesh size goes to 0, converges to the classical displacement convexity of entropy on the line (for $t=1/2$ ), obtained by McCann in [21] which shows the compatibility of our results to the well-known equivalent statement in the continuous.

The paper is organized as follows.

In Section 1, we give a simple proof of Theorem 2, based on the construction of an explicit coupling in dimension $n=1$ and on the dual formulation of the relative entropy functional. As already explained, Theorem 2 can also be seen as a consequence of Theorem 3. However, the proof is very simple and it seemed to us that it nicely illustrates the power of the transport techniques in discrete and therefore it is worth a separate presentation. Then we show how to recover a significant part of the classical Prékopa-Leindler inequality from Theorem 2, passing from the discrete to the continuous by means of the Central Limit Theorem.

In Section 2, we prove a stronger entropic version of Theorem 3, namely Theorem 8, based on the one-dimensional monotone rearrangement coupling. We also show how to fully recover the Prékopa-Leindler inequality starting from Theorem 3, again passing from discrete to continuous, but here using instead that the mesh size of the grid shrinks to [math].

Finally, Section 3 is devoted to curved versions of Theorem 3 applying to probability measures with a log-concave probability mass function.

1. The Four Functions theorem

1.1. A transport proof of the Four Functions Theorem

In the following, we prove the Four Functions Theorem using transport ingredients and a duality formula.

We will use the following notations. The set of all probability measures on $\Omega_{n}=\{0,1\}^{n}$ will be denoted by $\mathcal{P}(\Omega_{n})$ and the set of functions on $\Omega_{n}$ by $\mathcal{F}(\Omega_{n})$ . For all $a\in\Omega_{1}$ and $h\in\mathcal{F}(\Omega_{n})$ , the function $h^{a}:\Omega_{n-1}\to\mathbb{R}$ is defined by

[TABLE]

For convenience, we restate the Ahlswede-Daykin Theorem with an additive hypothesis (which corresponds to Theorem 2 with $f=e^{h_{1}}$ , $g=e^{h_{2}}$ , $h=e^{h_{3}}$ and $k=e^{h_{4}}$ ).

Theorem 4.

Let $n\geq 1$ . Suppose that $h_{1}$ , $h_{2}$ , $h_{3}$ , $h_{4}\colon\Omega_{n}\to\mathbb{R}$ are such that

[TABLE]

Then

[TABLE]

Recall the following duality formula involving the relative entropy functional. Let $m_{n}$ be the uniform measure on $\Omega_{n}$ and define for all probability measures $\nu$ on $\Omega_{n}$

[TABLE]

Then, for any function $f:\Omega_{n}\to\mathbb{R}$ , it holds

[TABLE]

In the proof of Theorem 4 we will also use the following coupling lemma whose proof is elementary. We recall that if $\nu_{1},\nu_{2}$ are two probability measures on a measurable space $(E,\mathcal{A})$ , a coupling of $\nu_{1}$ and $\nu_{2}$ (in that order) is a probability measure $\pi$ on the product space $E\times E$ having $\nu_{1}$ as first marginal and $\nu_{2}$ as second marginal, that is to say such that

[TABLE]

for all $A,B\in\mathcal{A}.$ Recall also that that if $\mu$ is a probability measure on $(E,\mathcal{A})$ and $S:E\to F$ a measurable map taking values in another measurable space $(F,\mathcal{B})$ , then the image of $\mu$ under the map $S$ (or push forward of $\mu$ under the map $S$ ) is the probability measure denoted by $S_{\#}\mu$ defined as $S_{\#}\mu(B)=\mu(S^{-1}(B))$ , $B\in\mathcal{B}$ .

Lemma 5.

Let $\nu_{1},\nu_{2}\in\mathcal{P}(\Omega_{1})$ and set $S\colon\Omega_{1}^{2}\ni(x,y)\mapsto(x\wedge y,x\vee y)$ .

$(i)$

if $\nu_{2}(0)\leq\nu_{1}(0)$ then there exists a (unique) coupling $\pi$ of $\nu_{1}$ and $\nu_{2}$ such that $\widetilde{\pi}:=S\sharp\pi$ is also a coupling of $\nu_{1}$ and $\nu_{2}$ . Moreover in this case $\pi=\widetilde{\pi}$ and $\pi(0,0)=\nu_{2}(0)$ , $\pi(1,0)=0$ , $\pi(0,1)=\nu_{1}(0)-\nu_{2}(0)$ and $\pi(1,1)=\nu_{1}(1)$ .

$(ii)$

if $\nu_{2}(0)\geq\nu_{1}(0)$ then there exists a (unique) coupling $\pi$ of $\nu_{1}$ and $\nu_{2}$ such that $\widetilde{\pi}=S\sharp\pi$ is a coupling of $\nu_{2},\nu_{1}$ . Moreover $\pi(0,0)=\widetilde{\pi}(0,0)=\nu_{1}(0)$ , $\pi(1,1)=\widetilde{\pi}(1,1)=\nu_{2}(1)$ , $\pi(0,1)=\widetilde{\pi}(1,0)=0$ and $\pi(1,0)=\widetilde{\pi}(0,1)=\nu_{2}(0)-\nu_{1}(0)$ .

Remark 6.

The coupling $\pi$ in $(i)$ (resp. $(ii)$ ) is nothing but the non-decreasing (non-increasing) rearrangement coupling.

The above lemma is very much one-dimensional. In fact, it is easy to construct examples of measures $\nu_{1},\nu_{2}\in\mathcal{P}(\Omega_{n})$ , for $n\geq 2$ , such that there does not exist any coupling $\pi$ of $\nu_{1}$ and $\nu_{2}$ such that $\widetilde{\pi}:=S\sharp\pi$ (with $S$ that acts coordinate by coordinate) is a coupling of $\nu_{1}$ and $\nu_{2}$ or a coupling of $\nu_{2}$ and $\nu_{1}$ .

Proof.

We will first prove Item $(i)$ . In the following diagram we represent the couplings $\pi$ on the left, and $\widetilde{\pi}$ on the right, with their marginals.

[TABLE]

Once one observes that necessarily $\widetilde{\pi}(1,0)=0$ (since there do not exist $x,y\in\Omega_{1}$ with $x\wedge y=1$ and $x\vee 1=0$ ), and $\widetilde{\pi}(0,0)=\pi(0,0)$ and $\widetilde{\pi}(1,1)=\pi(1,1)$ , then all the values of $\widetilde{\pi}(i,j)$ and $\pi(i,j)$ can be deduced from the marginals (details are left to the reader). A similar reasoning leads to the conclusion of Item $(ii)$ . The uniqueness part is obvious from the construction. ∎

Proof of Theorem 4.

The proof goes by induction on $n\geq 1$ . We will prove the base case towards the end of the proof. Assume first that the result holds on $\Omega_{n-1}$ . Then choose four functions $h_{1},h_{2},h_{3},h_{4}\colon\{0,1\}^{n}\to\mathbb{R}$ satisfying

[TABLE]

Fix $a,b\in\{0,1\}$ ; applying Condition (5) to $x=(x_{1}^{\prime},\dots,x_{n-1}^{\prime},a)$ and $y=(y_{1}^{\prime},\dots,y_{n-1}^{\prime},b)$ we get that

[TABLE]

which is precisely the condition of the theorem in dimension $n-1$ for the four functions $h_{1}^{a},h_{2}^{b},h_{3}^{a\wedge b}$ and $h_{4}^{a\vee b}$ . Applying the induction hypothesis we conclude that

[TABLE]

The latter holds for all $a,b\in\Omega_{1}$ . Hence, if we set $H_{i}(a):=\log\left(\sum_{x\in\Omega_{n-1}}e^{h_{i}^{a}(x)}\right)$ , for $i\in\{1,2,3,4\}$ , we have

[TABLE]

Now applying the result on $\Omega_{1}$ , we conclude that

[TABLE]

This leads to the desired conclusion since, by construction, for all $i\in\{1,2,3,4\}$ it holds $\log\left(\sum_{x\in\Omega_{1}}e^{H_{i}(x)}\right)=\log\left(\sum_{x\in\Omega_{n}}e^{h_{i}(x)}\right)$ .

Hence, in order to conclude the proof we need to prove the theorem on $\Omega_{1}$ . To that purpose, fix four functions $h_{1},h_{2},h_{3},h_{4}\colon\Omega_{1}\to\mathbb{R}$ satisfying Condition (5) (with $n=1$ ) and let $\nu_{1},\nu_{2}\in\mathcal{P}(\Omega_{1})$ . Let us show that

[TABLE]

First assume that $\nu_{1}(0)\leq\nu_{2}(0)$ . Thanks to Item $(i)$ of Lemma 5 above, there exists a coupling $\pi$ of $\nu_{1}$ and $\nu_{2}$ such that the coupling $\widetilde{\pi}$ defined as the push forward of $\pi$ under the map $S:\Omega_{1}^{2}\ni(x,y)\mapsto(x\wedge y,x\vee y)$ is still a coupling of $\nu_{1}$ and $\nu_{2}$ . It follows from the very definition of the coupling, from Condition (5), and by definition of the push-forward, that

[TABLE]

Therefore, by (4),

[TABLE]

which proves (6) in this case.

Now, if $\nu_{1}(0)>\nu_{2}(0)$ , then according to Item $(ii)$ of Lemma 5, there exists a coupling $\pi$ of $\nu_{1}$ and $\nu_{2}$ such that the probability $\widetilde{\pi}=S_{\#}\pi$ is now a coupling of $\nu_{2}$ and $\nu_{1}$ (in that order). Therefore, reasoning exactly as in (7), one gets $\int h_{1}\,d\nu_{1}+\int h_{2}\,d\nu_{2}\leq\int h_{3}\,d\nu_{2}+\int h_{4}\,d\nu_{1}$ , from which one concludes that (6) holds also in this case.

Finally, taking the supremum over $\nu_{1}$ and $\nu_{2}$ in (6) gives , thanks to (4),

[TABLE]

and completes the proof of Theorem 4. ∎

A careful reading of the proof of Theorem 4 actually leads to a slightly more general result that we now describe. Consider a functional $\Phi$ on $\mathcal{F}(\Omega_{1})$ and assume that it can be written as follows

[TABLE]

where $\Psi:\mathcal{P}(\Omega_{1})\to\mathbb{R}\cup\{\infty\}$ is a given function. Then, we define by induction a sequence of functions $\Phi^{n}$ on $\mathcal{F}(\Omega_{n})$ as follows: $\Phi^{1}=\Phi$ and for all $n\geq 2$ ,

[TABLE]

where we recall that for all $a\in\Omega_{1}$ and $h\in\mathcal{F}(\Omega_{n})$ , the function $h^{a}:\Omega_{n-1}\to\mathbb{R}$ is defined by $h^{a}(x)=h(x,a)$ , $x\in\Omega_{n-1}$ .

Following the exact same proof of Theorem 4 (details of which are left to the reader), we can conclude that, if $h_{1}$ , $h_{2}$ , $h_{3}$ , $h_{4}\colon\Omega_{n}\to\mathbb{R}$ are such that

[TABLE]

then

[TABLE]

This is a generalization of Theorem 4 since the relative entropy $\Psi(\nu)=H(\nu|m_{n})$ leads to $\Phi(h)=\log(\int_{\Omega_{1}}e^{h}dm_{1})$ by (4), and therefore, by a straightforward induction, to $\Phi^{n}(h)=\log(\int_{\Omega_{n}}e^{h}dm_{n})$ . However, we could not find any other explicit example of functional $\Phi$ and $\Phi^{n}$ of real interest. One of the reasons can be found in Hardy, Littlewood and Polya [14, Chapter 3]. Indeed, studying the generalized mean $F^{-1}(\int F(h)dm_{1})$ , these authors prove that, under some mild assumptions, it must be that $F(x)=\kappa e^{cx}$ for some constants $\kappa,c$ , leading back to the previous example.

Another natural example may be given by $\Psi(\nu)=+\infty$ for all $\nu$ expect one measure, say $m_{1}$ , for which $\Psi(m_{1})=0$ . Then, $\Phi(h)=\int hdm_{1}$ and therefore $\Phi^{n}(h)=\int hdm_{1}^{\otimes n}$ , where $m_{1}^{\otimes n}$ is the $n$ -fold product of $m_{1}$ , i.e. $m_{1}^{\otimes n}=m_{n}$ . In that case, the conclusion above is nontrivial though being a consequence of the classical conclusion of the four functions theorem (by considering $\varepsilon h_{i}$ in the limit $\varepsilon\to 0$ ).

A further generalization may be as follows. Let $U\colon[0,\infty)\to\mathbb{R}$ denote a semi-continuous, strictly convex function satisfying $\lim_{x\to\infty}U(x)/x=\infty$ and $U(1)\geq 0$ . Then, given $\mu,\nu\in\mathcal{P}(\Omega_{n})$ , we set $U_{\mu}(\nu)=\int U(f)d\mu$ , if $\nu$ is absolutely continuous with respect to $\mu$ with density $f$ , and $U_{\mu}(\nu)=+\infty$ otherwise. With such a definition, the special choice $U(x)=x\log x$ amounts to $U_{\mu}(\nu)=H(\nu|\mu)$ . Furthermore, since $U(1)\geq 0$ , by Jensen’s inequality $U_{\mu}(\nu)\geq 0$ for all $\nu\in\mathcal{P}(\Omega_{n})$ . Also, for any $f\colon\{0,1\}^{n}\to\mathbb{R}$ and $\mu\in\mathcal{P}(\Omega_{n})$ , set $\Lambda_{\mu}(f):=\sup_{\nu\in\mathcal{P}(\Omega_{n})}\left(\int_{\Omega_{n}}fd\nu-U_{\mu}(\nu)\right)$ which generalizes (4). For such $U$ ’s, as proved in [12, Proposition 2.9], it holds

[TABLE]

and

[TABLE]

with $U^{*}(y):=\sup_{x>0}\{xy-U(x)\}$ , $y\in\mathbb{R}$ . For instance, the choice $U(x)=x^{2}/2$ , $x\geq 0$ leads to $\Lambda_{m_{1}}(f)=\operatorname{Var}_{m_{1}}(f)+\int fdm_{1}-\frac{1}{2}$ if $f(0)-f(1)\in[-2,2]$ and $\Lambda_{m_{1}}(f)=\max(f(0),f(1))-1$ otherwise. At the price of multiplying $h_{i}$ by a constant, we can assume that $\max h-\inf\leq 2$ so that $\Phi(h)=\operatorname{Var}_{m_{1}}(h)+\int hdm_{1}-\frac{1}{2}$ is explicit so that one can, at least theoretically, express $\Phi^{n}$ in this case.

1.2. From the Four Function Theorem to the Prékopa-Leindler Inequality

Using the Four Functions Theorem, we shall prove the following weak version of the Prékopa-Leindler Inequality. We state and prove the result in dimension one, for simplicity, but it holds in any dimension with no extra complication besides presentation.

Proposition 7.

Let $f,g,h\colon\mathbb{R}\to\mathbb{R}$ be three continuous functions satisfying

[TABLE]

Assume furthermore that $h$ is convex and bounded from below. Then, it holds

[TABLE]

It should be noticed that equality cases are known in the Prékopa-Leindler inequality [8] and correspond to choosing precisely $h$ convex, and $f$ and $g$ proper translation and dilation of $h$ . Of course, the extra assumptions of continuity of $f,g$ and lower boundedness of $h$ could be removed via standard approximation arguments, but we refrain from further discussion, since it does not seem possible to remove the convexity assumption on $h$ and to recover the full conclusion of Theorem 1.

Proof.

Let $f,g,h\colon\mathbb{R}\to\mathbb{R}$ be continuous functions satisfying

[TABLE]

with $h$ convex and bounded from below. First let us assume that $f$ and $g$ are bounded from above. For any $n$ , define the following three functions on $\Omega_{n}$ : for $x=(x_{1},\dots,x_{n})\in\Omega_{n}$ , set

[TABLE]

Then we observe that, for any $x,y\in\Omega_{n}$ , coordinate-wise

[TABLE]

Hence, the condition satisfied by $f,g$ and $h$ transfers to $F_{n},G_{n}$ and $H_{n}$ as follows: for all $x,y\in\Omega_{n}$ ,

[TABLE]

where the last inequality follows from the convexity of $h$ . Let $M>0$ be a constant such that $f\leq M$ , $g\leq M$ and $h\geq-M$ . Then, it holds

[TABLE]

In other words $F_{n}$ , $G_{n}$ and $H_{n}\wedge 3M$ satisfy the condition of the Four Functions Theorem (with $h_{3}=h_{4}$ ) so that, denoting by $m_{n}$ the uniform probability measure on $\Omega_{n}$ ,

[TABLE]

Applying the Central Limit Theorem, one gets

[TABLE]

where $\gamma$ denotes the Standard Gaussian probability measure on $\mathbb{R}.$ Replacing $f,g,h$ by $f_{\lambda}(x):=f(\lambda^{1/2}x)$ , $g_{\lambda}(x):=g(\lambda^{1/2}x)$ and $h_{\lambda}(x):=h(\lambda^{1/2}x)$ , where $\lambda>0$ , one easily gets

[TABLE]

Letting $\lambda\to+\infty$ , the monotone convergence theorem gives the desired inequality. Finally, one can easily remove the upper boundedness assumption on $f,g$ by truncation and monotone convergence. ∎

2. Klartag-Lehec Prékopa-Leindler inequality on $\mathbb{Z}$

2.1. From Klartag-Lehec Inequality to the Four Functions Theorem

To make clear the connection with the preceding section, let us first remark that Theorem 3 implies the one dimensional version of the Four Functions Theorem (and thus the result in all dimensions by tensorization).

Indeed let $f,g,h,k$ be four non-negative functions on $\{0,1\}$ satisfying the hypothesis of the Four Functions Theorem, namely for any $x,y\in\{0,1\}$ ,

[TABLE]

Setting for any $x\in\mathbb{Z}$

[TABLE]

and similarly $\tilde{g},\tilde{h},\tilde{k}$ , one may easily check that that for any $x,y\in\mathbb{Z}$

[TABLE]

Therefore applying Theorem 3 we get the conclusion of the Four Functions Theorem,

[TABLE]

2.2. Transport proof of the Klartag-Lehec Inequality

Our goal is now to establish the following entropic version of Klartag-Lehec Inequality which is actually stronger than Theorem 3. In what follows, we recall that the monotone coupling $\pi$ between two probability measures $\nu_{0}$ and $\nu_{1}$ on $\mathbb{R}$ is defined by

[TABLE]

where $U$ is a random variable uniformly distributed on $(0,1)$ and where for all $i\in\{0,1\}$ , $F_{\nu_{i}}(x)=\nu_{i}((-\infty,x])$ , $x\in\mathbb{R},$ is the cumulative distribution of $\nu_{i}$ and $F_{\nu_{i}}^{-1}(t)=\inf\{x\in\mathbb{R}:F_{\nu_{i}}(x)\geq t\}$ , $t\in(0,1)$ , is the generalized inverse of $F_{\nu_{i}}.$

Theorem 8 (displacement convexity of entropy).

Suppose that $\nu_{0},\nu_{1}$ are two probability measures on $\mathbb{Z}$ with compact supports. Define (recall the definition of the push forward right before Lemma 5)

[TABLE]

where $\pi$ is the monotone coupling between $\nu_{0}$ and $\nu_{1}$ , and for all $x,y\in\mathbb{Z}$ ,

[TABLE]

Then, denoting by $m$ the counting measure on $\mathbb{Z}$ , it holds

[TABLE]

Before turning to the proof of Theorem 8, let us first recall how to recover Theorem 3 from Theorem 8.

Proof of Theorem 3.

The proof uses (again) the dual expression of the log-Laplace transform of any bounded function $\varphi$ :

[TABLE]

where the supremum runs over all probability measures $\nu$ on $\mathbb{Z}$ with bounded support. Let $f,g,h,k$ be four non-negative functions satisfying (2). Given $\varepsilon,\kappa>0$ and setting $f^{\varepsilon,\kappa}(x)=\max(\varepsilon,\min(f(x),\kappa))$ , one may simply check that equivalently for all $x,y\in\mathbb{Z}$ ,

[TABLE]

Integrating this inequality with respect to the monotone coupling $\pi$ of two probability measures on $\mathbb{Z}$ with bounded support $\nu_{0}$ and $\nu_{1}$ implies

[TABLE]

Therefore, applying Inequality (9) of Theorem 8 implies

[TABLE]

where the last inequality is a consequence of Identity (10). Then optimizing over all probability measures with bounded support $\nu_{0}$ and $\nu_{1}$ , and using again (10) one gets

[TABLE]

The conclusion of Theorem 3 follows by monotone convergence as $\varepsilon$ goes to 0 and $\kappa$ goes to infinity. ∎

Now we turn to the proof of Theorem 8 which in turn is a consequence of the following result of independent interest.

Theorem 9.

With the same notation as in Theorem 8, it holds

[TABLE]

Proof of Theorem 8.

The logarithm function being concave one gets by Jensen’s inequality, thanks to (11),

[TABLE]

Now observe that, by definition of $\pi$ , $\nu_{-}$ and $\nu_{+}$ ,

[TABLE]

completing the proof. ∎

In the proof of Theorem 9 we will make repeated use of the following elementary lemma:

Lemma 10.

(1)

*Let $(x_{1},y_{1}),(x_{2},y_{2})\in\mathbb{Z}^{2}$ be such that $(x_{1},y_{1})\neq(x_{2},y_{2})$ with $x_{1}\leq x_{2}$ and $y_{1}\leq y_{2}$ . Then $\lfloor\frac{x_{1}+y_{1}}{2}\rfloor=\lfloor\frac{x_{2}+y_{2}}{2}\rfloor$ if and only if $y_{2}-y_{1}+x_{2}-x_{1}=1$ and $\frac{x_{1}+y_{1}}{2}\in\mathbb{Z}$ . *

In this case, $\lceil\frac{x_{2}+y_{2}}{2}\rceil=\lceil\frac{x_{1}+y_{1}}{2}\rceil+1$ . 2. (2)

Let $(x_{1},y_{1}),(x_{2},y_{2})\in\mathbb{Z}^{2}$ be such that $x_{1}\leq x_{2}$ , $y_{1}\leq y_{2}$ , $\lfloor\frac{x_{1}+y_{1}}{2}\rfloor=a$ and $\lfloor\frac{x_{2}+y_{2}}{2}\rfloor=a^{\prime}$ with $a<a^{\prime}$ .

•

If $a^{\prime}\geq a+2$ , $\lceil\frac{x_{1}+y_{1}}{2}\rceil\neq\lceil\frac{x_{2}+y_{2}}{2}\rceil$ .

•

If $a^{\prime}=a+1$ , $\lceil\frac{x_{1}+y_{1}}{2}\rceil=\lceil\frac{x_{2}+y_{2}}{2}\rceil$ if and only if $y_{2}-y_{1}+x_{2}-x_{1}=1$ with $\frac{x_{1}+y_{1}}{2}\in\mathbb{Z}+\frac{1}{2}$ .

The following figures illustrate the next lemma.

$x_{1}$$x_{2}$$y_{1}=y_{2}$ item (1)= $\lfloor\frac{x_{1}+y_{1}}{2}\rfloor=\lfloor\frac{x_{2}+y_{2}}{2}\rfloor$$y_{2}-y_{1}+x_{2}-x_{1}=1,\;\;\frac{x_{1}+y_{1}}{2}\in\mathbb{Z}$$x$$\frac{x+y}{2}$$y$$x_{1}=x_{2}$$y_{1}$$y_{2}$

$a$$a^{\prime}$$x_{1}$$x_{2}$$y_{1}=y_{2}$ item (2), $a^{\prime}=a+1$$a$$a=\lfloor\frac{x_{1}+y_{1}}{2}\rfloor,\quad a^{\prime}=\lfloor\frac{x_{2}+y_{2}}{2}\rfloor$$a^{\prime}$$y_{2}-y_{1}+x_{2}-x_{1}=1,\;\frac{x_{1}+y_{1}}{2}\in\mathbb{Z}+\frac{1}{2}$$x$$\frac{x+y}{2}$$y$$x_{1}=x_{2}$$y_{1}$$y_{2}$

Proof of Lemma 10.

(1) If $y_{2}-y_{1}+x_{2}-x_{1}\geq 2$ , then $\frac{x_{2}+y_{2}}{2}\geq\frac{x_{1}+y_{1}}{2}+1$ and thus $\lfloor\frac{x_{2}+y_{2}}{2}\rfloor\geq\lfloor\frac{x_{1}+y_{1}}{2}\rfloor+1.$ Hence $y_{2}-y_{1}+x_{2}-x_{1}=1.$ Without loss of generality one can assume that $x_{1}=x_{2}$ and $y_{2}=y_{1}+1$ . But in this case, $\frac{x_{2}+y_{2}}{2}=\frac{x_{1}+y_{1}}{2}+\frac{1}{2}$ . The fact that $\lfloor\frac{x_{1}+y_{1}}{2}\rfloor=\lfloor\frac{x_{2}+y_{2}}{2}\rfloor$ then implies that $\frac{x_{1}+y_{1}}{2}\in\mathbb{Z}.$ The converse is obvious. In this case $\lceil\frac{x_{2}+y_{2}}{2}\rceil=\lceil\frac{x_{1}+y_{1}}{2}+\frac{1}{2}\rceil=\frac{x_{1}+y_{1}}{2}+1=\lceil\frac{x_{1}+y_{1}}{2}\rceil+1$ .

(2) If $a^{\prime}\geq a+2$ , then

[TABLE]

Now let us assume that $a^{\prime}=a+1$ . If $y_{2}-y_{1}+x_{2}-x_{1}=2$ , then $\frac{x_{2}+y_{2}}{2}=\frac{x_{1}+y_{1}}{2}+1$ and so $\lceil\frac{x_{2}+y_{2}}{2}\rceil=\lceil\frac{x_{1}+y_{1}}{2}\rceil+1.$ Therefore $y_{2}-y_{1}+x_{2}-x_{1}=1$ and so $\frac{x_{2}+y_{2}}{2}=\frac{x_{1}+y_{1}}{2}+\frac{1}{2}$ . The condition $\lfloor\frac{x_{2}+y_{2}}{2}\rfloor=\lfloor\frac{x_{1}+y_{1}}{2}\rfloor+1$ then implies that $\frac{x_{1}+y_{1}}{2}\in\mathbb{Z}+\frac{1}{2}$ . Then it holds $\lceil\frac{x_{2}+y_{2}}{2}\rceil=\lceil\frac{x_{1}+y_{1}}{2}+\frac{1}{2}\rceil=\frac{x_{1}+y_{1}}{2}+\frac{1}{2}=\lceil\frac{x_{1}+y_{1}}{2}\rceil.$ The converse is obvious. ∎

Before proving Theorem 9 let us introduce some notation. We will denote

[TABLE]

and for all $a\in\mathbb{Z}$ ,

[TABLE]

(with thus $S(a)=\emptyset$ when $a\notin M_{-}$ ).

Lemma 11.

For any $a\in\mathbb{Z}$ , $\mathrm{Card}(S(a))\in\{0,1,2\}$ .

Proof of Lemma 11.

Let $a\in M_{-}$ . By compactness of the support of $\pi$ , the set $S(a)$ is finite. Suppose that $\mathrm{Card}(S(a))>1$ . Let $x_{0}$ be the minimal first coordinate of the elements of $S(a)$ , and let $y_{0}$ be the minimal second coordinate of the elements of $S(a)$ having $x_{0}$ as first coordinate. If $(x_{1},y_{1})$ is another element of $S(a)$ , then either $x_{0}=x_{1}$ and $y_{0}\leq y_{1}$ , or $x_{0}<x_{1}$ and in this case, by monotonicity of the support of $\pi$ , one has $y_{0}\leq y_{1}$ . According to Item (1) of Lemma 10, one has $\frac{x_{0}+y_{0}}{2}\in\mathbb{Z}$ and ( $x_{0}=x_{1}$ and $y_{1}=y_{0}+1$ ) or ( $y_{0}=y_{1}$ and $x_{1}=x_{0}+1$ ). By monotonicity of the support of $\pi$ , these two cases exclude each other and so $\mathrm{Card}(S(a))=2.$ ∎

For $i\in\{1,2\}$ , we will denote by $M_{-}^{i}$ the set of $a\in M_{-}$ such that $\mathrm{Card}(S(a))=i$ . If $a\in M_{-}^{1}$ , the unique element of $S(a)$ will be denoted by $(x_{0}(a),y_{0}(a))$ . If $a\in M_{-}^{2}$ , we will denote by $(x_{0}(a),y_{0}(a))$ and $(x_{1}(a),y_{1}(a))$ the two elements of $S(a)$ , with the convention that $x_{0}(a)\leq x_{1}(a)$ and $y_{0}(a)\leq y_{1}(a)$ and $\frac{x_{0}(a)+y_{0}(a)}{2}\in\mathbb{Z}$ as in Lemma 10 and the proof above.

Proof of Theorem 9.

Using the notation above, we need to show that the following quantity is less than or equal to $1$ .

[TABLE]

The strategy to bound $P$ by 1 is to show that, in fact,

[TABLE]

For that purpose we consider two cases.

First case. Let $a\in M_{-}$ be such that

[TABLE]

Then let us show that for all $(x,y)\in S(a)$ , it holds

[TABLE]

We distinguish between two sub-cases, $a\in M_{-}^{1}$ and $a\in M_{-}^{2}$ . Suppose first that $a\in M_{-}^{1}$ . Then $S(a)=\{(x_{0},y_{0})\}$ and therefore

[TABLE]

Moreover, since $a$ satisfies (13), Item 2 of Lemma 10 gives that

[TABLE]

Since $\pi(x_{0},y_{0})\leq\min(\nu_{0}(x_{0}),\nu_{1}(y_{0}))$ , this gives (14). Now let us assume that $a\in M_{-}^{2}$ . Then one can assume without loss of generality that $S(a)=\{(x_{0},y_{0}),(x_{0},y_{0}+1)\}$ with $\frac{x_{0}+y_{0}}{2}\in\mathbb{Z}$ and thus $m_{+}(x_{0},y_{0}+1)=m_{+}(x_{0},y_{0})+1.$ In this case,

[TABLE]

and reasoning as above

[TABLE]

which establish (14).

Second case. Let $a_{0}\in M_{-}$ and $p\geq 1$ such that $m_{+}(S(a_{0}+i))\cap m_{+}(S(a_{0}+i+1))\neq\emptyset$ for all $i\in\{0,\ldots,p-1\}$ and such that $m_{+}(S(a_{0}-1))\cap m_{+}(S(a_{0}))=\emptyset$ and $m_{+}(S(a_{0}+p))\cap m_{+}(S(a_{0}+p+1))=\emptyset$ (i.e. $p$ is maximal). Since $m_{+}(S(a_{0}+i))\subset\{a_{0}+i;a_{0}+i+1\}$ , the only possibility is that $m_{+}(S(a_{0}+i))=\{a_{0}+i;a_{0}+i+1\}$ for all $i\in\{1,\ldots,p-1\}$ (this set being empty if $p=1$ ). Let us assume that $a_{0}\in M_{-}^{2}$ and $a_{0}+p\in M_{-}^{2}$ (the other cases are dealt similarly). Let us denote $(x_{0}^{i},y_{0}^{i})=(x_{0}(a_{0}+i),y_{0}(a_{0}+i))$ and $(x_{1}^{i},y_{1}^{i})=(x_{1}(a_{0}+i),y_{1}(a_{0}+i))$ (recall that by definition $x_{1}^{i}\geq x_{0}^{i}$ and $y_{1}^{i}\geq y_{0}^{i}$ ). According to Lemma 10, it holds $x_{1}^{i}-x_{0}^{i}+y_{1}^{i}-y_{0}^{i}=1$ and $y_{0}^{i+1}-y_{1}^{i}+x_{0}^{i+1}-x_{1}^{i}=1$ .

Let us introduce

[TABLE]

and let us show that

[TABLE]

We will use the following facts:

•

Fact 1 : For all $i\in\{0,\ldots,p\}$ it holds

[TABLE]

•

Fact 2 : For all $i\in\{1,\ldots,p\}$

[TABLE]

and $\nu_{+}(a_{0})=\pi(x_{0}^{0},y_{0}^{0})$ and $\nu_{+}(a_{0}+p+1)=\pi(x_{1}^{p},y_{1}^{p})$ .

Observe that

[TABLE]

where, for $i\in\{1,\ldots,p\}$ ,

[TABLE]

and

[TABLE]

According to Fact 2, in order to prove (15), it is enough to show that $\alpha_{i}\leq 1$ for all $i\in\{0,\ldots,p+1\}.$

For $i=p+1$ , one can assume without loss of generality that $x_{0}^{p}=x_{1}^{p}$ . Then, according to Fact 1, $\nu_{-}(a_{0}+p)\leq\nu_{0}(x_{1}^{p})$ and since $\pi(x_{1}^{p},y_{1}^{p})\leq\nu_{1}(y_{1}^{p})$ , it follows that $\alpha_{p+1}\leq 1$ . The case $i=0$ is similar.

Now let us consider the case $i\in\{1,\ldots,p\}$ . Observe that either $x_{1}^{i-1}=x_{0}^{i}$ either $y_{1}^{i-1}=y_{0}^{i}$ . Without loss of generality, one can assume that $x_{1}^{i-1}=x_{0}^{i}$ (the case $y_{1}^{i-1}=y_{0}^{i}$ follows by symmetry in $x$ and $y$ ), so that

[TABLE]

Let us consider the following subcases:

(a)

If $x_{0}^{i-1}=x_{1}^{i-1}=x_{0}^{i}=x_{1}^{i}$ , then $y_{1}^{i-1},y_{0}^{i},y_{1}^{i}$ are pairwise distinct.

: $\,\,a_{0}+i-1$ : $\,\,a_{0}+i$$x$$\frac{x+y}{2}$$y$$x_{0}^{i-1}\!\!\!\!\!=x_{1}^{i-1}\!\!\!\!\!=x_{0}^{i}=x_{1}^{i}$$y_{0}^{i-1}$$y_{1}^{i-1}$$y_{0}^{i}$$y_{1}^{i}$

Since $\pi(x_{1}^{i-1},y_{1}^{i-1})\leq\nu_{1}(y_{1}^{i-1})$ and $\pi(x_{0}^{i},y_{0}^{i})\leq\nu_{1}(y_{0}^{i})$ , by using Fact 1, one gets

[TABLE]

The last inequality holds since $x_{0}^{i-1}=x_{1}^{i-1}=x_{0}^{i}=x_{1}^{i}$ and $y_{1}^{i-1},y_{0}^{i},y_{1}^{i}$ are pairwise distinct.

(b)

If $x_{0}^{i-1}\neq x_{1}^{i-1}=x_{0}^{i}=x_{1}^{i}$ , then necessarily $y_{0}^{i-1}=y_{1}^{i-1}$ .

$x$$\frac{x+y}{2}$$y$$x_{0}^{i-1}$$x_{1}^{i-1}\!\!\!\!\!=x_{0}^{i}=x_{1}^{i}$$y_{0}^{i-1}=y_{1}^{i-1}$$y_{0}^{i}$$y_{1}^{i}$

Using Fact 1, one gets $\nu_{-}(a_{0}+i-1)\leq\nu_{1}(y_{1}^{i-1})$ . Since $\nu_{-}(a_{0}+i)=\pi(x_{0}^{i},y_{0}^{i})+\pi(x_{1}^{i},y_{1}^{i})$ and $\pi(x_{0}^{i},y_{0}^{i})\leq\nu_{1}(y_{0}^{i})$ one gets

[TABLE]

(c)

If $x_{0}^{i-1}=x_{1}^{i-1}=x_{0}^{i}\neq x_{1}^{i}$ , then necessarily $y_{0}^{i}=y_{1}^{i}$ .

$x$$\frac{x+y}{2}$$y$$x_{0}^{i-1}\!\!\!\!\!=x_{1}^{i-1}\!\!\!\!\!=x_{0}^{i}$$x_{1}^{i}$$y_{0}^{i-1}$$y_{1}^{i-1}$$y_{0}^{i}=y_{1}^{i}$

Using Fact 1, one gets $\nu_{-}(a_{0}+i)\leq\nu_{1}(y_{0}^{i})$ . Since $\nu_{-}(a_{0}+i-1)=\pi(x_{0}^{i-1},y_{0}^{i-1})+\pi(x_{1}^{i-1},y_{1}^{i-1})$ and $\pi(x_{1}^{i-1},y_{1}^{i-1})\leq\nu_{1}(y_{0}^{i-1})$ , it follows that

[TABLE]

(d)

If $x_{0}^{i-1}\neq x_{1}^{i-1}$ , $x_{1}^{i-1}=x_{0}^{i}$ $x_{0}^{i}\neq x_{1}^{i}$ , then necessarily $y_{0}^{i-1}=y_{1}^{i-1}$ and $y_{0}^{i}=y_{1}^{i}$ .

$x$$\frac{x+y}{2}$$y$$x_{0}^{i-1}$$x_{1}^{i-1}\!\!\!\!\!\!\!=x_{0}^{i}$$x_{1}^{i}$$y_{0}^{i-1}\!\!\!\!\!=y_{1}^{i-1}$$y_{0}^{i}=y_{1}^{i}$

Reasoning as in the preceding cases, one gets $\nu_{-}(a_{0}+i-1)\leq\nu_{1}(y_{1}^{i-1})$ , $\nu_{-}(a_{0}+i)\leq\nu_{1}(y_{1}^{i})$ and so

[TABLE]

Conclusion : by considering successively the elements $a\in M_{-}$ in increasing order, case 1 can be repeated successively several times and we may pass from case 1 to case 2 or from case 2 to case 1. Therefore after a finite use of cases 1 and 2 described above, (14) and (15) imply (12). This concludes the proof of Theorem 9. ∎

2.3. From the Klartag-Lehec Inequality to the Prékopa-Leindler Inequality

First, let us explain how to recover the conclusion of Theorem 1 for $t=1/2$ and continuous functions using Theorem 3. More precisely we are going to show that if $F,G,H,K:\mathbb{R}\to\mathbb{R}^{+}$ are continuous functions such that

[TABLE]

then

[TABLE]

Then taking in particular $H=K$ gives the conclusion of Theorem 1 for $t=1/2$ .

Proof of (16).

Let $N\geq 1$ and for all positive integer $n$ consider the grid $x_{i}^{n}=-N+2\frac{iN}{n}$ , $i\in\{0,\ldots,n\}$ . Define $f,g,h,k:\mathbb{Z}\to\mathbb{R}^{+}$ as follows :

[TABLE]

If $i,j\in\{0,\ldots,n\}$ then, there is some $\varepsilon\in\{0,1\}$ such that

[TABLE]

and so $H(\frac{x_{i}^{n}+x_{j}^{n}}{2})\leq h(\lfloor\frac{i+j}{2}\rfloor)$ . Similarly, $K(\frac{x_{i}^{n}+x_{j}^{n}}{2})\leq k(\lceil\frac{i+j}{2}\rceil)$ . Therefore, for all $i,j\in\{0,\ldots,n\}$ ,

[TABLE]

The functions $f,g,h,k$ thus satisfy the assumption of Theorem 3 and so

[TABLE]

By uniform continuity of $f,g,h,k$ on $[-2N,2N]$ , multiplying both sides by $(2N/n)^{2}$ and letting $n\to+\infty$ , it follows that

[TABLE]

Finally, letting $N\to+\infty$ gives (16). ∎

2.4. Displacement convexity of entropy : from discrete to continuous

In the same vein as in the previous sub-section, one can deduce from Theorem 8, the following well-known continuous version of the displacement convexity of the relative entropy with respect to Lebesgue measure.

Theorem 12.

Let $\nu_{0},\nu_{1}$ be probability measures on $\mathbb{R}$ with compact supports and define $\nu_{1/2}$ as the law of $\frac{X_{0}+X_{1}}{2}$ , where $(X_{0},X_{1})$ is distributed according to the monotone rearrangement coupling $\pi$ between $\nu_{0}$ and $\nu_{1}$ . Then it holds

[TABLE]

Proof.

Without loss of generality, one can assume that $H(\nu_{0}|\mathrm{Leb})+H(\nu_{1}|\mathrm{Leb})<+\infty$ . Consider $(X_{0},X_{1})$ distributed according to $\pi$ and define, for $n\geq 1$ , $\pi^{n}=\mathrm{Law}\left(\frac{\lfloor nX_{0}\rfloor}{n},\frac{\lfloor nX_{1}\rfloor}{n}\right)$ and $\nu_{0}^{n}=\mathrm{Law}\left(\frac{\lfloor nX_{0}\rfloor}{n}\right)$ and $\nu_{1}^{n}=\mathrm{Law}\left(\frac{\lfloor nX_{1}\rfloor}{n}\right)$ . The coupling $\pi^{n}$ is easily seen to be monotone. Since Theorem 8 immediately extends to probability measures on $\frac{1}{n}\mathbb{Z}$ , one gets

[TABLE]

where $m^{n}$ is the counting measure on $\frac{1}{n}\mathbb{Z}$ and

[TABLE]

Assuming that $\nu_{0}([-K,K[)=\nu_{1}([-K,K[)=1$ , where $K\geq 1$ is an integer and denoting by $\mu^{n}$ the probability measure $\frac{1}{2nK}\mathbf{1}_{[-K,K[}m^{n}$ , (17) is equivalent to

[TABLE]

Let $\mu$ be the uniform (continuous) distribution on $[-K,K[.$ On the one hand, for $i\in\{0,1\}$

[TABLE]

where the inequality comes from Jensen’s inequality applied to the convex function $x\mapsto x\log x.$ On the other hand, it is easy to see that $\nu_{-}^{n}$ and $\nu_{+}^{n}$ both weakly converge to $\nu_{1/2}$ (this comes from the almost sure convergence of the underlying random variables) and that $\mu^{n}$ weakly converges to $\mu$ . Therefore, by lower semicontinuity of $(\alpha,\beta)\mapsto H(\alpha|\beta)$ for the weak convergence topology, one concludes that

[TABLE]

which proves the claim. ∎

3. Inequalities with curvature terms for log-concave distributions.

Finally, let us show how to derive from Theorem 3 other versions adapted to log-concave probability measures. The following result is a straightforward restatement of Theorem 3.

Corollary 13.

*Let $\mu$ be a probability measure on $\mathbb{Z}$ such that $\mu(x)>0$ for all $x\in\mathbb{Z}$ .

If $f,g,h,k:\mathbb{Z}\to\mathbb{R}^{+}$ are such that*

[TABLE]

where

[TABLE]

then it holds

[TABLE]

Proof.

Simply note that the functions $F(x)=f(x)\mu(x)$ , $G(x)=g(x)\mu(x)$ , $H(x)=h(x)\mu(x)$ and $K(x)=k(x)\mu(x)$ , $x\in\mathbb{Z}$ , satisfy the assumptions of Theorem 3. ∎

Note that the cost function $c_{\mu}$ always satisfies

[TABLE]

Let us introduce the optimal transport cost $\mathcal{T}_{c_{\mu}}$ associated to this cost function $c_{\mu}$ :

[TABLE]

with $\Pi(\nu_{0},\nu_{1})$ the set of probability measures on $\mathbb{Z}^{2}$ such that the first marginal of $\pi$ is $\nu_{0}$ and the second is $\nu_{1}.$

Corollary 14.

Let $\mu$ be a probability measure on $\mathbb{Z}$ such that $\mu(x)>0$ for all $x\in\mathbb{Z}$ . Then $\mu$ satisfies the following transport-entropy inequality : for all probability measures $\nu_{0},\nu_{1}$ on $\mathbb{Z}$ ,

[TABLE]

Proof.

Let $u,v:\mathbb{Z}\to\mathbb{R}$ be such that

[TABLE]

Then according to Corollary 13 applied to $f=e^{u}$ , $g=e^{v}$ and $h=k=1$ , it holds

[TABLE]

This is the dual form of (19). ∎

The preceding corollary is the most interesting when the cost function $c_{\mu}$ is non-negative. A natural condition ensuring non-negativity of $c_{\mu}$ is the log-concavity of $\mu$ . We recall that a probability measure $\mu$ on $\mathbb{Z}$ is log-concave if it is such that

[TABLE]

If one defines, for any $t\in\mathbb{R}$ , $V_{\mu}(t)$ as the linear interpolation between $\log\mu(\lfloor t\rfloor)$ and $\log\mu(\lceil t\rceil)$ , then it is easy to check that $\mu$ is log-concave if and only if the function $V_{\mu}$ is concave on $\mathbb{R}$ .

Lemma 15.

Suppose that $\mu$ is log-concave on $\mathbb{Z}$ and such that $\mu(x)>0$ for all $x\in\mathbb{Z}$ , then $c_{\mu}(x,y)\geq 0$ for all $x,y\in\mathbb{Z}.$

Proof.

Without loss of generality, one can assume that $x<y$ . If $(x+y)=2k$ , with $k\in\mathbb{Z}$ , then we have to show that $\mu(k)^{2}\geq\mu(x)\mu(y)$ . With the notation $V_{\mu}$ introduced above, this inequality is equivalent to $\frac{V_{\mu}(k)-V_{\mu}(x)}{k-x}\geq\frac{V_{\mu}(y)-V_{\mu}(k)}{y-k}$ which follows immediately from the concavity of $V_{\mu}$ . If $x+y=2k+1$ , then the inequality $\mu(k)\mu(k+1)\geq\mu(x)\mu(y)$ is equivalent to $\frac{V_{\mu}(k)-V_{\mu}(x)}{k-x}\geq\frac{V_{\mu}(y)-V_{\mu}(k+1)}{y-(k+1)}$ which again follows from the concavity of $V_{\mu}$ . ∎

As an illustration, we end this section with the computation of the cost $c_{\mu}$ for two specific examples of probability measures $\mu$ on $\mathbb{Z}$ . Consider first the double-sided geometric-type measures $\mu(x)=ce^{-|x|}$ , $x\in\mathbb{Z}$ , where $c$ is the normalization constant. Then, an easy computation leads to $c_{\mu}(x,y)=2\min(|x|,|y|)\mathds{1}_{xy<0}$ . While for $\mu(x)=ce^{-2x^{2}}$ (with $c$ again the normalization constant), we get $c_{\mu}(x,y)=(x-y)^{2}\mathds{1}_{x+y\in 2\mathbb{Z}}+[(x-y)^{2}-1]\mathds{1}_{x+y\in 2\mathbb{Z}+1}$ . There is essentially no gain in the first case, which corresponds to a flat situation, while the second example resembles the continuous setting with strictly convex potential for which $\Gamma_{2}$ -calculus applies (see [2, 3, 30]).

Bibliography31

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] R. Ahlswede and D. E. Daykin, An inequality for the weights of two families of sets, their unions and intersections , Z. Wahrsch. Verw. Gebiete 43 (1978), no. 3, 183–185. MR 0491189
2[2] C. Ané, S. Blachère, D. Chafaï, P. Fougères, I. Gentil, F. Malrieu, C. Roberto, and G. Scheffer, Sur les inégalités de Sobolev logarithmiques , Panoramas et Synthèses [Panoramas and Syntheses], vol. 10, Société Mathématique de France, Paris, 2000, With a preface by Dominique Bakry and Michel Ledoux. MR MR 1845806 (2002 g:46132)
3[3] D. Bakry, I. Gentil, and M. Ledoux, Analysis and geometry of Markov diffusion operators , Grundlehren der Mathematischen Wissenschaften [Fundamental Principles of Mathematical Sciences], vol. 348, Springer, Cham, 2014.
4[4] S. G. Bobkov and M. Ledoux, From Brunn-Minkowski to Brascamp-Lieb and to logarithmic Sobolev inequalities , Geom. Funct. Anal. 10 (2000), no. 5, 1028–1052. MR 1800062
5[5] A.I. Bonciocat and K.T. Sturm, Mass transportation and rough curvature bounds for discrete spaces , J. Funct. Anal. 256 (2009), no. 9, 2944–2966.
6[6] D. Cordero-Erausquin, R. J. Mc Cann, and M. Schmuckenschläger, A Riemannian interpolation inequality à la Borell, Brascamp and Lieb , Invent. Math. 146 (2001), no. 2, 219–257. MR 1865396
7[7] by same author, Prékopa-Leindler type inequalities on Riemannian manifolds, Jacobi fields, and optimal transport , Ann. Fac. Sci. Toulouse Math. (6) 15 (2006), no. 4, 613–635. MR 2295207
8[8] S. Dubuc, Critères de convexité et inégalités intégrales , Ann. Inst. Fourier (Grenoble) 27 (1977), no. 1, x, 135–165. MR 0444863

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

Transport Proofs of some discrete variants of the Prékopa-Leindler inequality

Abstract.

Key words and phrases:

1991 Mathematics Subject Classification:

Introduction

Theorem 1** (Prékopa-Leindler).**

Theorem 2** (Ahlswede-Daykin).**

Theorem 3**.**

Proof of Theorem 1 for d=1d=1d=1.

1. The Four Functions theorem

1.1. A transport proof of the Four Functions Theorem

Theorem 4**.**

Lemma 5**.**

Remark 6**.**

Proof.

Proof of Theorem 4.

1.2. From the Four Function Theorem to the Prékopa-Leindler Inequality

Proposition 7**.**

Proof.

2. Klartag-Lehec Prékopa-Leindler inequality on Z\mathbb{Z}Z

2.1. From Klartag-Lehec Inequality to the Four Functions Theorem

2.2. Transport proof of the Klartag-Lehec Inequality

Theorem 8** (displacement convexity of entropy).**

Proof of Theorem 3.

Theorem 9**.**

Proof of Theorem 8.

Lemma 10**.**

Proof of Lemma 10.

Lemma 11**.**

Proof of Lemma 11.

Proof of Theorem 9.

2.3. From the Klartag-Lehec Inequality to the Prékopa-Leindler Inequality

Proof of (16).

2.4. Displacement convexity of entropy : from discrete to continuous

Theorem 12**.**

Proof.

3. Inequalities with curvature terms for log-concave distributions.

Corollary 13**.**

Proof.

Corollary 14**.**

Proof.

Lemma 15**.**

Proof.

Theorem 1 (Prékopa-Leindler).

Theorem 2 (Ahlswede-Daykin).

Theorem 3.

Proof of Theorem 1 for $d=1$ .

Theorem 4.

Lemma 5.

Remark 6.

Proposition 7.

2. Klartag-Lehec Prékopa-Leindler inequality on $\mathbb{Z}$

Theorem 8 (displacement convexity of entropy).

Theorem 9.

Lemma 10.

Lemma 11.

Theorem 12.

Corollary 13.

Corollary 14.

Lemma 15.