Transport Proofs Of Some Discrete Variants Of The Pr{\'e}Kopa-leindler Inequality
Nathael Gozlan (MAP5 - UMR 8145), Cyril Roberto (LAMA), Paul-Marie, Samson (LAMA), Prasad Tetali (School of Mathematics)

TL;DR
This paper provides a transport-based proof of discrete displacement convexity of entropy on integers, leading to new discrete forms of the Prékopa-Leindler inequality, including the Four Functions Theorem and recent results on Z.
Contribution
It introduces a novel transport proof for discrete displacement convexity, deriving new discrete inequalities from continuous analogs.
Findings
Established a transport proof for discrete displacement convexity on integers
Derived two discrete forms of the Prékopa-Leindler inequality
Connected continuous and discrete convexity results
Abstract
We give a transport proof of a discrete version of the displacement convexity of entropy on integers (Z), and get, as a consequence, two discrete forms of the Pr{\'e}kopa-Leindler Inequality : the Four Functions Theorem of Ahlswede and Daykin on the discrete hypercube [1] and a recent result on Z due to Klartag and Lehec [16].
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGeometric Analysis and Curvature Flows · Mathematical Dynamics and Fractals · Spectral Theory in Mathematical Physics
Transport Proofs of some discrete variants of the Prékopa-Leindler inequality
Nathael Gozlan, Cyril Roberto, Paul-Marie Samson, Prasad Tetali
N. Gozlan : Université Paris Descartes, MAP5, UMR 8145, 45 rue des Saints Pères, 75270 Paris Cedex 06
P.-M. Samson : Université Paris-Est, Laboratoire d’Analyse et de Mathématiques Appliquées (UMR 8050), UPEM, UPEC, CNRS, F-77454, Marne-la-Vallée, France
C. Roberto : Université Paris Nanterre - Modal’X, 200 avenue de la République 92000 Nanterre, France
P. Tetali : School of Mathematics & School of Computer Science, Georgia Institute of Technology, Atlanta, GA 30332
[email protected], [email protected], [email protected],[email protected]
Abstract.
We give a transport proof of a discrete version of the displacement convexity of entropy on integers (), and get, as a consequence, two discrete forms of the Prékopa-Leindler Inequality : the Four Functions Theorem of Ahlswede and Daykin on the discrete hypercube [1] and a recent result on due to Klartag and Lehec [16].
Key words and phrases:
Prékopa-Leindler Inequality, Optimal Transport
1991 Mathematics Subject Classification:
60E15, 32F32 and 26D10
This research is partly funded by the Bézout Labex, funded by ANR, reference ANR-10-LABX-58 and the Labex MME-DII funded by ANR, reference ANR-11-LBX-0023-01. Research of P.T. is supported in part by the NSF grant DMS-1811935
Introduction
The aim of the paper is to develop a transport approach to some discrete versions of the Prékopa-Leindler Inequality [25, 26, 19], namely the Four Functions Theorem due to Ahlswede and Daykin [1] and a recent result of Klartag and Lehec [16] on . Both inequalities will be a consequence of the stronger displacement convexity of entropy on the set of integers. Before presenting these discrete functional inequalities, let us recall the original continuous statement inspiring them.
The classical Prékopa-Leindler Inequality is the following.
Theorem 1** (Prékopa-Leindler).**
Suppose that are measurable functions such that, for some ,
[TABLE]
Then
[TABLE]
The Prékopa-Leindler Inequality is a functional version of the celebrated Brunn-Minkowski Inequality stating that for all Borel sets and all it holds
[TABLE]
where denotes the Lebesgue measure on It is more generally intimately related to the study of log-concave measures which is of considerable importance in convex geometry, probability theory and statistics. In particular, many geometric and functional inequalities for uniformly log-concave probability measures can be derived from Theorem 1 (see in particular the paper [4] by Bobkov and Ledoux). We refer to [11] for a thorough presentation of the subject as well as for historical comments on Theorem 1.
The question of extending the Prékopa-Leindler inequality outside the flat space framework has been tackled by many authors in recent years and turned out to be extremely fruitful in Geometry, Analysis and Probability. A first step has been accomplished by Cordero-Erausquin, McCann and Schmuckenschläger in [6, 7], who obtained extensions of the Prékopa-Leindler inequality on Riemannian manifolds with a lower bounded Ricci curvature. Their extension is closely related to displacement convexity properties of entropic functionals, first introduced by McCann in [21] in the flat space framework, and then extended to Riemannian manifolds by Otto and Villani [24] and von Renesse and Sturm [31]. This displacement convexity formulation is actually equivalent to lower bounds on the Ricci curvature and led to the Lott-Sturm-Villani [20, 27, 28] definition of metric measure spaces with lower bounded Ricci curvature which makes sense even in a non-smooth framework.
In a similar vein, it would also be satisfactory to extend the Prékopa-Leindler inequality to discrete frameworks such as graphs (which are not covered by the Lott-Sturm-Villani theory). Several general definitions of discrete spaces with lower bounded curvature were recently proposed, in particular by Bonciocat and Sturm [5], Ollivier [22], Ollivier and Villani [23], Erbar and Maas [9], Hillion [15] or the authors [13]. While these different definitions are all efficient at the level of functional inequalities and are satisfied by a large collection of classical graphs, none of them really succeeds in leading to a satisfactory Prékopa-Leindler or Brunn-Minkowski inequality on those spaces.
However, for at least two specific discrete spaces, convincing Prékopa-Leindler type inequalities already exist.
The first one, is the celebrated Four Functions Theorem on the discrete hypercube by Ahlswede and Daykin [1]. To recall its statement, we will need the following notation. The discrete hypercube will be denoted by and for all , one defines
[TABLE]
Theorem 2** (Ahlswede-Daykin).**
Suppose that are such that
[TABLE]
then
[TABLE]
Note that this result mimics the statement of Theorem 1 for on . Theorem 2 has important implications in terms of correlation inequalities, as it gives back in particular the classical FKG inequality which has a lot of applications in percolation and statistical mechanics [10].
The second discrete form of the Prékopa-Leindler Inequality we will consider is a recent one due to Klartag and Lehec [16], and holds on the space of integers. Denote by and the ceiling and floor functions respectively.
Theorem 3**.**
Suppose that are such that
[TABLE]
Then
[TABLE]
As we will see in Section 2, Theorem 3 implies Theorem 2 for (which then gives the full conclusion by induction, see the proof of Theorem 2 in Section 1). Moreover Theorem 3 implies back Theorem 1 for (and thus for all other values of ). The proof given by Klartag and Lehec in [16] relies on rather sophisticated tools of stochastic analysis on the Poisson space and in particular on a stochastic representation formula for the relative entropy functional with respect to the Poisson distribution on the (non-negative) integers.
As already stated above, the main objective of the present paper is to recover Theorems 2 and 3 by means of optimal transport tools. In the continuous setting, optimal transport is indeed a very efficient way to establish functional inequalities (see [29, 30] and the references therein) and it is a challenging question to see how these powerful techniques can be adapted to the discrete world. To make this introduction more self-contained and to illustrate the difficulties in dealing with discrete structures, let us briefly recall a classical transport proof of Theorem 1 in dimension .
Proof of Theorem 1 for .
Without loss of generality, one can assume that , with and two positive and continuous functions. Defining , , a natural transport map between the probability measures and is given by , where , , and , , are the cumulative distribution functions of and . The change of variable formula immediately gives the following relation between and :
[TABLE]
Plugging into (1) one gets by change of variables (, note that is increasing by construction)
[TABLE]
where the inequality comes from (1) and the arithmetic-geometric inequality , , (appplied to and ), while the last equality comes from (3). ∎
The proof for is done by induction (see e.g the proof of [18, Theorem 2.13]). It is also possible to prove this result directly in dimension , by using the Brenier or the Knothe transport maps and the Monge-Ampère equation. See [29, Chapter 6] for details. Note that the use of coupling arguments for establishing Brunn-Minkowski type inequalities goes back at least to Knothe [17].
Analyzing the proof above immediately reveals two obvious obstacles that prevent to export it easily to the discrete setting:
- (1)
Transport maps between probability measures and usually do not exist when the space is discrete and one often needs to cut the mass of atoms of the source measure to reconstruct the target measure ; 2. (2)
Even if there is a transport map sending on , there is no Jacobian equation such as (3).
In the case of Theorem 2 and 3, it turns out that these difficulties can be circumvented. It would be useless at this point to state general rules, however it seems at least that in both situations choosing helps a lot by introducing symmetry and compensations to overcome the lack of Jacobian equation.
In fact, we will go beyond Theorem 2 and 3 by proving, by transport arguments, a stronger statement: namely an entropic version of the Prékopa-Leindler Inequality (that we may also call displacement convexity of entropy), see Theorem 8 for a precise statement. In that sense, since such an entropic statement implies the Klartag-Lehec version of the Prékopa-Leindler Inequality on , which in turn, at the price of an obvious induction step, implies the Four Functions theorem, all results appear to be the consequence of one single (transport) proof. Moreover, our displacement convexity result on the integers, as the mesh size goes to 0, converges to the classical displacement convexity of entropy on the line (for ), obtained by McCann in [21] which shows the compatibility of our results to the well-known equivalent statement in the continuous.
The paper is organized as follows.
In Section 1, we give a simple proof of Theorem 2, based on the construction of an explicit coupling in dimension and on the dual formulation of the relative entropy functional. As already explained, Theorem 2 can also be seen as a consequence of Theorem 3. However, the proof is very simple and it seemed to us that it nicely illustrates the power of the transport techniques in discrete and therefore it is worth a separate presentation. Then we show how to recover a significant part of the classical Prékopa-Leindler inequality from Theorem 2, passing from the discrete to the continuous by means of the Central Limit Theorem.
In Section 2, we prove a stronger entropic version of Theorem 3, namely Theorem 8, based on the one-dimensional monotone rearrangement coupling. We also show how to fully recover the Prékopa-Leindler inequality starting from Theorem 3, again passing from discrete to continuous, but here using instead that the mesh size of the grid shrinks to [math].
Finally, Section 3 is devoted to curved versions of Theorem 3 applying to probability measures with a log-concave probability mass function.
1. The Four Functions theorem
1.1. A transport proof of the Four Functions Theorem
In the following, we prove the Four Functions Theorem using transport ingredients and a duality formula.
We will use the following notations. The set of all probability measures on will be denoted by and the set of functions on by . For all and , the function is defined by
[TABLE]
For convenience, we restate the Ahlswede-Daykin Theorem with an additive hypothesis (which corresponds to Theorem 2 with , , and ).
Theorem 4**.**
Let . Suppose that , , , are such that
[TABLE]
Then
[TABLE]
Recall the following duality formula involving the relative entropy functional. Let be the uniform measure on and define for all probability measures on
[TABLE]
Then, for any function , it holds
[TABLE]
In the proof of Theorem 4 we will also use the following coupling lemma whose proof is elementary. We recall that if are two probability measures on a measurable space , a coupling of and (in that order) is a probability measure on the product space having as first marginal and as second marginal, that is to say such that
[TABLE]
for all Recall also that that if is a probability measure on and a measurable map taking values in another measurable space , then the image of under the map (or push forward of under the map ) is the probability measure denoted by defined as , .
Lemma 5**.**
Let and set .
if then there exists a (unique) coupling of and such that is also a coupling of and . Moreover in this case and , , and .
if then there exists a (unique) coupling of and such that is a coupling of . Moreover , , and .
Remark 6**.**
The coupling in (resp. ) is nothing but the non-decreasing (non-increasing) rearrangement coupling.
The above lemma is very much one-dimensional. In fact, it is easy to construct examples of measures , for , such that there does not exist any coupling of and such that (with that acts coordinate by coordinate) is a coupling of and or a coupling of and .
Proof.
We will first prove Item . In the following diagram we represent the couplings on the left, and on the right, with their marginals.
[TABLE]
Once one observes that necessarily (since there do not exist with and ), and and , then all the values of and can be deduced from the marginals (details are left to the reader). A similar reasoning leads to the conclusion of Item . The uniqueness part is obvious from the construction. ∎
Proof of Theorem 4.
The proof goes by induction on . We will prove the base case towards the end of the proof. Assume first that the result holds on . Then choose four functions satisfying
[TABLE]
Fix ; applying Condition (5) to and we get that
[TABLE]
which is precisely the condition of the theorem in dimension for the four functions and . Applying the induction hypothesis we conclude that
[TABLE]
The latter holds for all . Hence, if we set , for , we have
[TABLE]
Now applying the result on , we conclude that
[TABLE]
This leads to the desired conclusion since, by construction, for all it holds .
Hence, in order to conclude the proof we need to prove the theorem on . To that purpose, fix four functions satisfying Condition (5) (with ) and let . Let us show that
[TABLE]
First assume that . Thanks to Item of Lemma 5 above, there exists a coupling of and such that the coupling defined as the push forward of under the map is still a coupling of and . It follows from the very definition of the coupling, from Condition (5), and by definition of the push-forward, that
[TABLE]
Therefore, by (4),
[TABLE]
which proves (6) in this case.
Now, if , then according to Item of Lemma 5, there exists a coupling of and such that the probability is now a coupling of and (in that order). Therefore, reasoning exactly as in (7), one gets , from which one concludes that (6) holds also in this case.
Finally, taking the supremum over and in (6) gives , thanks to (4),
[TABLE]
and completes the proof of Theorem 4. ∎
A careful reading of the proof of Theorem 4 actually leads to a slightly more general result that we now describe. Consider a functional on and assume that it can be written as follows
[TABLE]
where is a given function. Then, we define by induction a sequence of functions on as follows: and for all ,
[TABLE]
where we recall that for all and , the function is defined by , .
Following the exact same proof of Theorem 4 (details of which are left to the reader), we can conclude that, if , , , are such that
[TABLE]
then
[TABLE]
This is a generalization of Theorem 4 since the relative entropy leads to by (4), and therefore, by a straightforward induction, to . However, we could not find any other explicit example of functional and of real interest. One of the reasons can be found in Hardy, Littlewood and Polya [14, Chapter 3]. Indeed, studying the generalized mean , these authors prove that, under some mild assumptions, it must be that for some constants , leading back to the previous example.
Another natural example may be given by for all expect one measure, say , for which . Then, and therefore , where is the -fold product of , i.e. . In that case, the conclusion above is nontrivial though being a consequence of the classical conclusion of the four functions theorem (by considering in the limit ).
A further generalization may be as follows. Let denote a semi-continuous, strictly convex function satisfying and . Then, given , we set , if is absolutely continuous with respect to with density , and otherwise. With such a definition, the special choice amounts to . Furthermore, since , by Jensen’s inequality for all . Also, for any and , set which generalizes (4). For such ’s, as proved in [12, Proposition 2.9], it holds
[TABLE]
and
[TABLE]
with , . For instance, the choice , leads to if and otherwise. At the price of multiplying by a constant, we can assume that so that is explicit so that one can, at least theoretically, express in this case.
1.2. From the Four Function Theorem to the Prékopa-Leindler Inequality
Using the Four Functions Theorem, we shall prove the following weak version of the Prékopa-Leindler Inequality. We state and prove the result in dimension one, for simplicity, but it holds in any dimension with no extra complication besides presentation.
Proposition 7**.**
Let be three continuous functions satisfying
[TABLE]
Assume furthermore that is convex and bounded from below. Then, it holds
[TABLE]
It should be noticed that equality cases are known in the Prékopa-Leindler inequality [8] and correspond to choosing precisely convex, and and proper translation and dilation of . Of course, the extra assumptions of continuity of and lower boundedness of could be removed via standard approximation arguments, but we refrain from further discussion, since it does not seem possible to remove the convexity assumption on and to recover the full conclusion of Theorem 1.
Proof.
Let be continuous functions satisfying
[TABLE]
with convex and bounded from below. First let us assume that and are bounded from above. For any , define the following three functions on : for , set
[TABLE]
Then we observe that, for any , coordinate-wise
[TABLE]
Hence, the condition satisfied by and transfers to and as follows: for all ,
[TABLE]
where the last inequality follows from the convexity of . Let be a constant such that , and . Then, it holds
[TABLE]
In other words , and satisfy the condition of the Four Functions Theorem (with ) so that, denoting by the uniform probability measure on ,
[TABLE]
Applying the Central Limit Theorem, one gets
[TABLE]
where denotes the Standard Gaussian probability measure on Replacing by , and , where , one easily gets
[TABLE]
Letting , the monotone convergence theorem gives the desired inequality. Finally, one can easily remove the upper boundedness assumption on by truncation and monotone convergence. ∎
2. Klartag-Lehec Prékopa-Leindler inequality on
2.1. From Klartag-Lehec Inequality to the Four Functions Theorem
To make clear the connection with the preceding section, let us first remark that Theorem 3 implies the one dimensional version of the Four Functions Theorem (and thus the result in all dimensions by tensorization).
Indeed let be four non-negative functions on satisfying the hypothesis of the Four Functions Theorem, namely for any ,
[TABLE]
Setting for any
[TABLE]
and similarly , one may easily check that that for any
[TABLE]
Therefore applying Theorem 3 we get the conclusion of the Four Functions Theorem,
[TABLE]
2.2. Transport proof of the Klartag-Lehec Inequality
Our goal is now to establish the following entropic version of Klartag-Lehec Inequality which is actually stronger than Theorem 3. In what follows, we recall that the monotone coupling between two probability measures and on is defined by
[TABLE]
where is a random variable uniformly distributed on and where for all , , is the cumulative distribution of and , , is the generalized inverse of
Theorem 8** (displacement convexity of entropy).**
Suppose that are two probability measures on with compact supports. Define (recall the definition of the push forward right before Lemma 5)
[TABLE]
where is the monotone coupling between and , and for all ,
[TABLE]
Then, denoting by the counting measure on , it holds
[TABLE]
Before turning to the proof of Theorem 8, let us first recall how to recover Theorem 3 from Theorem 8.
Proof of Theorem 3.
The proof uses (again) the dual expression of the log-Laplace transform of any bounded function :
[TABLE]
where the supremum runs over all probability measures on with bounded support. Let be four non-negative functions satisfying (2). Given and setting , one may simply check that equivalently for all ,
[TABLE]
Integrating this inequality with respect to the monotone coupling of two probability measures on with bounded support and implies
[TABLE]
Therefore, applying Inequality (9) of Theorem 8 implies
[TABLE]
where the last inequality is a consequence of Identity (10). Then optimizing over all probability measures with bounded support and , and using again (10) one gets
[TABLE]
The conclusion of Theorem 3 follows by monotone convergence as goes to 0 and goes to infinity. ∎
Now we turn to the proof of Theorem 8 which in turn is a consequence of the following result of independent interest.
Theorem 9**.**
With the same notation as in Theorem 8, it holds
[TABLE]
Proof of Theorem 8.
The logarithm function being concave one gets by Jensen’s inequality, thanks to (11),
[TABLE]
Now observe that, by definition of , and ,
[TABLE]
completing the proof. ∎
In the proof of Theorem 9 we will make repeated use of the following elementary lemma:
Lemma 10**.**
- (1)
*Let be such that with and . Then if and only if and . *
In this case, . 2. (2)
Let be such that , , and with .
- •
If , .
- •
If , if and only if with .
The following figures illustrate the next lemma.
x_{1}$$x_{2}$$y_{1}=y_{2}item (1)= \lfloor\frac{x_{1}+y_{1}}{2}\rfloor=\lfloor\frac{x_{2}+y_{2}}{2}\rfloor$$y_{2}-y_{1}+x_{2}-x_{1}=1,\;\;\frac{x_{1}+y_{1}}{2}\in\mathbb{Z}$$x$$\frac{x+y}{2}$$y$$x_{1}=x_{2}$$y_{1}$$y_{2}
a$$a^{\prime}$$x_{1}$$x_{2}$$y_{1}=y_{2}item (2), a^{\prime}=a+1$$a$$a=\lfloor\frac{x_{1}+y_{1}}{2}\rfloor,\quad a^{\prime}=\lfloor\frac{x_{2}+y_{2}}{2}\rfloor$$a^{\prime}$$y_{2}-y_{1}+x_{2}-x_{1}=1,\;\frac{x_{1}+y_{1}}{2}\in\mathbb{Z}+\frac{1}{2}$$x$$\frac{x+y}{2}$$y$$x_{1}=x_{2}$$y_{1}$$y_{2}
Proof of Lemma 10.
(1) If , then and thus Hence Without loss of generality one can assume that and . But in this case, . The fact that then implies that The converse is obvious. In this case .
(2) If , then
[TABLE]
Now let us assume that . If , then and so Therefore and so . The condition then implies that . Then it holds The converse is obvious. ∎
Before proving Theorem 9 let us introduce some notation. We will denote
[TABLE]
and for all ,
[TABLE]
(with thus when ).
Lemma 11**.**
For any , .
Proof of Lemma 11.
Let . By compactness of the support of , the set is finite. Suppose that . Let be the minimal first coordinate of the elements of , and let be the minimal second coordinate of the elements of having as first coordinate. If is another element of , then either and , or and in this case, by monotonicity of the support of , one has . According to Item (1) of Lemma 10, one has and ( and ) or ( and ). By monotonicity of the support of , these two cases exclude each other and so ∎
For , we will denote by the set of such that . If , the unique element of will be denoted by . If , we will denote by and the two elements of , with the convention that and and as in Lemma 10 and the proof above.
Proof of Theorem 9.
Using the notation above, we need to show that the following quantity is less than or equal to .
[TABLE]
The strategy to bound by 1 is to show that, in fact,
[TABLE]
For that purpose we consider two cases.
First case. Let be such that
[TABLE]
Then let us show that for all , it holds
[TABLE]
We distinguish between two sub-cases, and . Suppose first that . Then and therefore
[TABLE]
Moreover, since satisfies (13), Item 2 of Lemma 10 gives that
[TABLE]
Since , this gives (14). Now let us assume that . Then one can assume without loss of generality that with and thus In this case,
[TABLE]
and reasoning as above
[TABLE]
which establish (14).
Second case. Let and such that for all and such that and (i.e. is maximal). Since , the only possibility is that for all (this set being empty if ). Let us assume that and (the other cases are dealt similarly). Let us denote and (recall that by definition and ). According to Lemma 10, it holds and .
Let us introduce
[TABLE]
and let us show that
[TABLE]
We will use the following facts:
- •
Fact 1 : For all it holds
[TABLE]
- •
Fact 2 : For all
[TABLE]
and and .
Observe that
[TABLE]
where, for ,
[TABLE]
and
[TABLE]
According to Fact 2, in order to prove (15), it is enough to show that for all
For , one can assume without loss of generality that . Then, according to Fact 1, and since , it follows that . The case is similar.
Now let us consider the case . Observe that either either . Without loss of generality, one can assume that (the case follows by symmetry in and ), so that
[TABLE]
Let us consider the following subcases:
- (a)
If , then are pairwise distinct.
::\,\,a_{0}+i$$x$$\frac{x+y}{2}$$y$$x_{0}^{i-1}\!\!\!\!\!=x_{1}^{i-1}\!\!\!\!\!=x_{0}^{i}=x_{1}^{i}$$y_{0}^{i-1}$$y_{1}^{i-1}$$y_{0}^{i}$$y_{1}^{i}
Since and , by using Fact 1, one gets
[TABLE]
The last inequality holds since and are pairwise distinct.
- (b)
If , then necessarily .
x$$\frac{x+y}{2}$$y$$x_{0}^{i-1}$$x_{1}^{i-1}\!\!\!\!\!=x_{0}^{i}=x_{1}^{i}$$y_{0}^{i-1}=y_{1}^{i-1}$$y_{0}^{i}$$y_{1}^{i}
Using Fact 1, one gets . Since and one gets
[TABLE]
- (c)
If , then necessarily .
x$$\frac{x+y}{2}$$y$$x_{0}^{i-1}\!\!\!\!\!=x_{1}^{i-1}\!\!\!\!\!=x_{0}^{i}$$x_{1}^{i}$$y_{0}^{i-1}$$y_{1}^{i-1}$$y_{0}^{i}=y_{1}^{i}
Using Fact 1, one gets . Since and , it follows that
[TABLE]
- (d)
If , , then necessarily and .
x$$\frac{x+y}{2}$$y$$x_{0}^{i-1}$$x_{1}^{i-1}\!\!\!\!\!\!\!=x_{0}^{i}$$x_{1}^{i}$$y_{0}^{i-1}\!\!\!\!\!=y_{1}^{i-1}$$y_{0}^{i}=y_{1}^{i}
Reasoning as in the preceding cases, one gets , and so
[TABLE]
Conclusion : by considering successively the elements in increasing order, case 1 can be repeated successively several times and we may pass from case 1 to case 2 or from case 2 to case 1. Therefore after a finite use of cases 1 and 2 described above, (14) and (15) imply (12). This concludes the proof of Theorem 9. ∎
2.3. From the Klartag-Lehec Inequality to the Prékopa-Leindler Inequality
First, let us explain how to recover the conclusion of Theorem 1 for and continuous functions using Theorem 3. More precisely we are going to show that if are continuous functions such that
[TABLE]
then
[TABLE]
Then taking in particular gives the conclusion of Theorem 1 for .
Proof of (16).
Let and for all positive integer consider the grid , . Define as follows :
[TABLE]
[TABLE]
If then, there is some such that
[TABLE]
and so . Similarly, . Therefore, for all ,
[TABLE]
The functions thus satisfy the assumption of Theorem 3 and so
[TABLE]
By uniform continuity of on , multiplying both sides by and letting , it follows that
[TABLE]
Finally, letting gives (16). ∎
2.4. Displacement convexity of entropy : from discrete to continuous
In the same vein as in the previous sub-section, one can deduce from Theorem 8, the following well-known continuous version of the displacement convexity of the relative entropy with respect to Lebesgue measure.
Theorem 12**.**
Let be probability measures on with compact supports and define as the law of , where is distributed according to the monotone rearrangement coupling between and . Then it holds
[TABLE]
Proof.
Without loss of generality, one can assume that . Consider distributed according to and define, for , and and . The coupling is easily seen to be monotone. Since Theorem 8 immediately extends to probability measures on , one gets
[TABLE]
where is the counting measure on and
[TABLE]
Assuming that , where is an integer and denoting by the probability measure , (17) is equivalent to
[TABLE]
Let be the uniform (continuous) distribution on On the one hand, for
[TABLE]
where the inequality comes from Jensen’s inequality applied to the convex function On the other hand, it is easy to see that and both weakly converge to (this comes from the almost sure convergence of the underlying random variables) and that weakly converges to . Therefore, by lower semicontinuity of for the weak convergence topology, one concludes that
[TABLE]
which proves the claim. ∎
3. Inequalities with curvature terms for log-concave distributions.
Finally, let us show how to derive from Theorem 3 other versions adapted to log-concave probability measures. The following result is a straightforward restatement of Theorem 3.
Corollary 13**.**
*Let be a probability measure on such that for all .
If are such that*
[TABLE]
where
[TABLE]
then it holds
[TABLE]
Proof.
Simply note that the functions , , and , , satisfy the assumptions of Theorem 3. ∎
Note that the cost function always satisfies
[TABLE]
Let us introduce the optimal transport cost associated to this cost function :
[TABLE]
with the set of probability measures on such that the first marginal of is and the second is
Corollary 14**.**
Let be a probability measure on such that for all . Then satisfies the following transport-entropy inequality : for all probability measures on ,
[TABLE]
Proof.
Let be such that
[TABLE]
Then according to Corollary 13 applied to , and , it holds
[TABLE]
This is the dual form of (19). ∎
The preceding corollary is the most interesting when the cost function is non-negative. A natural condition ensuring non-negativity of is the log-concavity of . We recall that a probability measure on is log-concave if it is such that
[TABLE]
If one defines, for any , as the linear interpolation between and , then it is easy to check that is log-concave if and only if the function is concave on .
Lemma 15**.**
Suppose that is log-concave on and such that for all , then for all
Proof.
Without loss of generality, one can assume that . If , with , then we have to show that . With the notation introduced above, this inequality is equivalent to which follows immediately from the concavity of . If , then the inequality is equivalent to which again follows from the concavity of . ∎
As an illustration, we end this section with the computation of the cost for two specific examples of probability measures on . Consider first the double-sided geometric-type measures , , where is the normalization constant. Then, an easy computation leads to . While for (with again the normalization constant), we get . There is essentially no gain in the first case, which corresponds to a flat situation, while the second example resembles the continuous setting with strictly convex potential for which -calculus applies (see [2, 3, 30]).
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] R. Ahlswede and D. E. Daykin, An inequality for the weights of two families of sets, their unions and intersections , Z. Wahrsch. Verw. Gebiete 43 (1978), no. 3, 183–185. MR 0491189
- 2[2] C. Ané, S. Blachère, D. Chafaï, P. Fougères, I. Gentil, F. Malrieu, C. Roberto, and G. Scheffer, Sur les inégalités de Sobolev logarithmiques , Panoramas et Synthèses [Panoramas and Syntheses], vol. 10, Société Mathématique de France, Paris, 2000, With a preface by Dominique Bakry and Michel Ledoux. MR MR 1845806 (2002 g:46132)
- 3[3] D. Bakry, I. Gentil, and M. Ledoux, Analysis and geometry of Markov diffusion operators , Grundlehren der Mathematischen Wissenschaften [Fundamental Principles of Mathematical Sciences], vol. 348, Springer, Cham, 2014.
- 4[4] S. G. Bobkov and M. Ledoux, From Brunn-Minkowski to Brascamp-Lieb and to logarithmic Sobolev inequalities , Geom. Funct. Anal. 10 (2000), no. 5, 1028–1052. MR 1800062
- 5[5] A.I. Bonciocat and K.T. Sturm, Mass transportation and rough curvature bounds for discrete spaces , J. Funct. Anal. 256 (2009), no. 9, 2944–2966.
- 6[6] D. Cordero-Erausquin, R. J. Mc Cann, and M. Schmuckenschläger, A Riemannian interpolation inequality à la Borell, Brascamp and Lieb , Invent. Math. 146 (2001), no. 2, 219–257. MR 1865396
- 7[7] by same author, Prékopa-Leindler type inequalities on Riemannian manifolds, Jacobi fields, and optimal transport , Ann. Fac. Sci. Toulouse Math. (6) 15 (2006), no. 4, 613–635. MR 2295207
- 8[8] S. Dubuc, Critères de convexité et inégalités intégrales , Ann. Inst. Fourier (Grenoble) 27 (1977), no. 1, x, 135–165. MR 0444863
