Further investigations of R\'enyi entropy power inequalities and an entropic characterization of s-concave densities
Jiange Li, Arnaud Marsiglietti, James Melbourne

TL;DR
This paper explores the role of convexity in Renyi entropy power inequalities, showing their validity for s-concave densities and providing new entropic characterizations of such densities.
Contribution
It demonstrates the failure of Renyi entropy power inequalities for certain parameters and establishes their validity for s-concave densities, along with new convergence and characterization results.
Findings
Renyi entropy power inequality fails for r in (0,1) in general
s-concave densities satisfy Renyi entropy power inequalities
Convergence of Renyi entropies in CLT for specific density classes
Abstract
We investigate the role of convexity in R\'enyi entropy power inequalities. After proving that a general R\'enyi entropy power inequality in the style of Bobkov-Chistyakov (2015) fails when the R\'enyi parameter , we show that random vectors with -concave densities do satisfy such a R\'enyi entropy power inequality. Along the way, we establish the convergence in the Central Limit Theorem for R\'enyi entropies of order for log-concave densities and for compactly supported, spherically symmetric and unimodal densities, complementing a celebrated result of Barron (1986). Additionally, we give an entropic characterization of the class of -concave densities, which extends a classical result of Cover and Zhang (1994).
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDiffusion and Search Dynamics · Point processes and geometric inequalities · Sparse and Compressive Sensing Techniques
Further investigations of Rényi entropy power inequalities and an entropic characterization of s-concave densities
Jiange Li, Arnaud Marsiglietti, James Melbourne
Abstract
We investigate the role of convexity in Rényi entropy power inequalities. After proving that a general Rényi entropy power inequality in the style of Bobkov-Chistyakov (2015) fails when the Rényi parameter , we show that random vectors with -concave densities do satisfy such a Rényi entropy power inequality. Along the way, we establish the convergence in the Central Limit Theorem for Rényi entropies of order for log-concave densities and for compactly supported, spherically symmetric and unimodal densities, complementing a celebrated result of Barron (1986). Additionally, we give an entropic characterization of the class of -concave densities, which extends a classical result of Cover and Zhang (1994).
1 Introduction
Let be a random vector in . Suppose that has the density with respect to the Lebesgue measure. For , the Rényi entropy of order (or simply, -Rényi entropy) is defined as
[TABLE]
For , the -Rényi entropy can be extended continuously such that the RHS of (1) is for ; for ; and for . The case corresponds to the classical Shannon differential entropy. Here, we denote by the Lebesgue measure of the support of , and represents the essential supremum of . The -Rényi entropy power is defined by
[TABLE]
In the following, we drop the subscript when .
The classical Entropy Power Inequality (henceforth, EPI) of Shannon [39] and Stam [41], states that the entropy power is super-additive on the sum of independent random vectors. There has been recent success in obtaining extensions of the EPI from the Shannon differential entropy to -Rényi entropy. In [9, 10], Bobkov and Chistyakov showed that, at the expense of an absolute constant , the following Rényi EPI of order holds
[TABLE]
Ram and Sason soon after gave a sharpened constant depending on the number of summands [36]. Madiman, Melbourne, and Xu sharpened constants in the case by identifying extremizers in [31, 32]. Savaré and Toscani [38] showed that a modified Rényi entropy power was concave along the solution of a nonlinear heat equation, which generalizes Costa’s concavity of entropy power [19]. Bobkov and Marsiglietti [11] proved the following variant of Rényi EPI
[TABLE]
for and some exponent only depending on . It is clear that (3) holds for more than two summands. Improvement of the exponent was given by Li [27].
One of our goals is to establish analogues of (2) and (3) when the Rényi parameter . Both (2) and (3) can be derived from Young’s convolution inequality in conjunction with the entropic comparison inequality for any . The latter is an immediate consequence of Jensen’s inequality. When the Rényi parameter , analogues of (2) and (3) require a converse of the entropic comparison inequality aforementioned. This technical issue prevents a general Rényi EPI of order for generic random vectors. Our first result shows that a general Rényi EPI of the form (2) indeed fails for all .
Theorem 1**.**
For any and , there exist independent random vectors in , for some and , such that
[TABLE]
We have an explicit construction of such random vectors. They are essentially truncations of some spherically symmetric random vectors with finite covariance matrices and infinite Rényi entropies of order . The key point is the convergence along the Central Limit Theorem (henceforth, CLT) for Rényi entropies of order ; that is, the -Rényi entropy of their normalized sum converges to the -Rényi entropy of a Gaussian. This implies that, after appropriate normalization, the LHS of (4) is finite, but the RHS of (4) can be as large as possible. The entropic CLT has been studied for a long time. A celebrated result of Barron [3] shows the convergence in the CLT for Shannon entropy (see [26] for a multidimensional setting). The recent work of Bobkov and Marsiglietti [12] studies the convergence in the CLT for Rényi entropy of order for real-valued random variables (see also [7] for convergence in Rényi divergence, which is not equivalent to convergence in Rényi entropy unless ). In Section 2, we establish the analogue of [12, Theorem 1.1] in higher dimensions and we prove convergence along the CLT for Rényi entropies of order for a large class of densities.
As mentioned above, the reverse entropic comparison inequality prevents Rényi EPIs of order for generic random vectors. However, a large class of random vectors with the so-called -concave densities do satisfy such a reverse entropic comparison inequality. Our next results show that Rényi EPI of order holds for such densities. This extends the earlier work of Marsiglietti and Melbourne [33, 34] for log-concave densities (which corresponds to the case).
Let . A function is called -concave if the inequality
[TABLE]
holds for all such that and . For , the RHS of (5) is understood in the limiting sense; that is for , for , and for . The case corresponds to log-concave functions. The study of measures with -concave densities was initiated by Borell in the seminal work [13, 14]. One can think of -concave densities, in particular log-concave densities, as functional versions of convex sets. There has been a recent stream of research on a formal parallel relation between functional inequalities of -concave densities and geometric inequalities of convex sets.
Theorem 2**.**
For any and , there exists such that for all independent random vectors with -concave densities in , we have
[TABLE]
In particular, one can take
[TABLE]
where is the Hölder conjugate of .
Theorem 3**.**
Given , there exist and such that for and independent random vectors and in with -concave densities,
[TABLE]
In particular, one can take
[TABLE]
where
[TABLE]
Owing to the convexity, random vectors with -concave densities also satisfy a reverse EPI, which was first proved by Bobkov and Madiman [8]. This can be seen as the functional lifting of Milman’s well known reverse Brunn-Minkowski inequality [35]. Motivated by Busemann’s theorem [17] in convex geometry, Ball, Nayar and Tkocz [2] conjectured that the following reverse EPI
[TABLE]
holds for any symmetric log-concave random vector . The -Rényi entropy analogue was asked in [30], and the case was soon verified in [27]. It was also observed in [27] that the -Rényi entropy analogue is equivalent to the convexity of -cross-section body in convex geometry introduced by Gardner and Giannopoulos [23]. The equivalent linearization of (6) reads as follows. Let be a symmetric log-concave random vector in such that . Then for any we have
[TABLE]
Cover and Zhang [20] proved the above inequality under the stronger assumption that and have the same log-concave distribution. They also showed that this provides a characterization of log-concave distributions on the real line. The following theorem extends Cover and Zhang’s result from log-concave densities to a more general class of -concave densities. This gives an entropic characterization of -concave densities and implies a reverse Rényi EPI for random vectors with the same -concave density.
Theorem 4**.**
Let . Let be a probability density function on . For any fixed integer , the identity
[TABLE]
holds for all such that if and only if the density is -concave.
The paper is organized as follows. In Section 2, we explore the convergence along the CLT for -Rényi entropies. For , the convergence is fully characterized for densities on , while for sufficient conditions are obtained for a large class of densities. More precisely, we prove the convergence for log-concave densities and for compactly supported, spherically symmetric and unimodal densities. As an application, we prove in Section 3 that a general -Rényi EPI fails when , thus establishing Theorem 1. We also complement this result by proving Theorems 2 and 3. In the last section, we provide an entropic characterization of the class of -concave densities, and include a reverse Rényi EPI as an immediate consequence.
2 Convergence along the CLT for Rényi entropies
Let be a sequence of independent identically distributed (henceforth, i.i.d.) centered random vectors in with finite covariance matrix. We denote by the normalized sum
[TABLE]
An important tool used to prove various forms of CLT is the characteristic function. Recall that the characteristic function of a random vector is defined by
[TABLE]
Before providing sufficient conditions for the convergence along the CLT for Rényi entropy of order , we first extend [12, Theorem 1.1] to higher dimensions.
Theorem 5**.**
Let . Let be i.i.d. centered random vectors in . We denote by the density of defined in (7). The following statements are equivalent.
* as , where is a Gaussian random vector with mean [math] and the same covariance matrix as .* 2. 2.
* is finite for some integer .* 3. 3.
* for some .* 4. 4.
* has a bounded density for some integer .*
Proof.
: Assume that as . Then there exists an integer such that
[TABLE]
Since is finite, we conclude that is finite as well.
: Assume that is finite for some integer . Then has a density .
Case 1: If , we have . Using Plancherel’s identity, we have . It follows that
[TABLE]
For , we have
[TABLE]
Case 2: If , we apply the Hausdorff-Young inequality to obtain
[TABLE]
where is the conjugate of such that . Hence, for , we have
[TABLE]
: Since for some , one may apply Gnedenko’s local limit theorems (see [24]), which is valid in arbitrary dimensions (see [5]). In particular, we have
[TABLE]
where denotes the density of a Gaussian random vector with mean [math] and the same covariance matrix as . We deduce that there exists an integer and a constant such that for all .
: Since is bounded, then , and we deduce by Plancherel’s identity that for . Hence, (8) holds and there exists such that for all . Let us show that as , where denotes the density of a Gaussian random vector with mean [math] and the same covariance matrix as . By the CLT, for any , there exists such that for all large enough,
[TABLE]
which implies that
[TABLE]
The function satisfies similar inequalities. Hence, for any , there exists such that for all large enough,
[TABLE]
On the other hand, by (8), for all , the function converges everywhere to as . Since is dominated by the integrable function , one may use the Lebesgue dominated theorem to conclude that
[TABLE]
∎
Remark 6*.*
Theorem 5 fails for . For example, one can consider i.i.d. random vectors with a bounded density such that (e.g., Cauchy-type distributions). The implication (and thus ) will not hold since by Jensen inequality for all . As observed by Barron [3], the implication does not necessarily hold in the Shannon entropy case .
The following result yields a sufficient condition for convergence along the CLT to hold for Rényi entropies of order for a large class of random vectors in .
Theorem 7**.**
Let . Let be i.i.d. centered log-concave random vectors in . Then we have for all , and
[TABLE]
where is the normalized sum in (7) and is a Gaussian random vector with mean [math] and the same covariance matrix as .
Proof.
Since log-concavity is preserved under independent sum, is log-concave for all . Hence, for all , has a bounded log-concave density , which satisfies
[TABLE]
for all , and for some constants , possibly depending on the dimension (see, e.g., [16]). Hence, for all , we have
[TABLE]
We deduce that for all .
The boundedness of implies that (8) holds, and thus there exists an integer such that for all ,
[TABLE]
where is the covariance matrix of (and thus does not depend on ). Moreover, since is log-concave, one has for all that
[TABLE]
Hence, for all , we have
[TABLE]
where the last inequality follows from Markov’s inequality and the fact that
[TABLE]
Hence, for every , one may choose a positive number such that for all large enough,
[TABLE]
and hence
[TABLE]
On the other hand, from (8), we conclude as in the proof of Theorem 5 that for all ,
[TABLE]
∎
A function is called unimodal if the super-level sets are convex for all . Next, we provide a convergence result for random vectors in with unimodal densities under additional symmetry assumptions. First, we need the following stability result.
Proposition 8**.**
The class of spherically symmetric and unimodal random variables is stable under convolution.
Proof.
Let and be two spherically symmetric and unimodal densities. By assumption, satisfy that for an orthogonal map and implies . By the layer cake decomposition, we write
[TABLE]
Apply Fubini’s theorem to obtain
[TABLE]
Notice that by the spherical symmetry and decreasingness of , the super-level set
[TABLE]
is an origin symmetric ball. Thus we can write the integrand in (9) as
[TABLE]
This quantity is clearly dependent only on , giving spherical symmetry. In addition, as the convolution of two log-concave functions, is log-concave as well. It follows that for every , and we have
[TABLE]
Integrating this inequality completes the proof. ∎
Let us establish large deviation and pointwise inequalities for compactly supported, spherically symmetric and unimodal densities.
Theorem 9** (Hoeffding [25]).**
Let be independent random variables with mean 0 and bounded in , respectively. One has for all ,
[TABLE]
The following result is Hoeffding’s inequality in higher dimensions.
Lemma 10**.**
Let be centered independent random vectors in satisfying for some . One has for all that
[TABLE]
Proof.
Let be the -th coordinate of the random vector . Then we have
[TABLE]
where inequality (10) follows from the pigeon-hole principle, (11) from a union bound, and (12) follows from applying Theorem 9 to and . ∎
We deduce the following pointwise estimate for unimodal spherically symmetric and bounded random variables.
Corollary 11**.**
Let be i.i.d. random vectors with spherically symmetric unimodal density supported on the Euclidean ball for some . Let denote the density of the normalized sum . Then there exists such that for all and ,
[TABLE]
Proof.
Stating Lemma 10 in terms of , we have
[TABLE]
Since the class of spherically symmetric unimodal random variables is stable under independent summation by Proposition 8, is spherically symmetric and unimodal, so that
[TABLE]
where represents the Euclidean ball of radius centered at the origin and is the volume of the unit ball. Note that
[TABLE]
since is increasing, so that (14) follows. Now applying (13) we have
[TABLE]
and our result holds with
[TABLE]
∎
We are now ready to establish a convergence result for bounded spherically symmetric unimodal random vectors.
Theorem 12**.**
Let . Let be i.i.d. random vectors in with a spherically symmetric unimodal density with compact support. Then we have
[TABLE]
where is the normalized sum in (7) and is a Gaussian random vector with mean [math] and the same covariance matrix as .
Proof.
Let us denote by the density of . Since is bounded, one may apply (8) together with Lebesgue dominated convergence to conclude that for all ,
[TABLE]
On the other hand, by Corollary 11, one may choose such that for all ,
[TABLE]
and hence
[TABLE]
∎
3 Rényi EPIs of order
A striking difference between Rényi EPIs of orders and is the lack of an absolute constant. Indeed, it was shown in [10] that for Rényi EPI of the form (2) holds for generic independent random vectors with an absolute constant . In the following subsection, we show that such a Rényi EPI does not hold for .
3.1 Failure of a generic Rényi EPI
Definition 13**.**
For , we define as the largest number such that for all and any independent random vectors in , we have
[TABLE]
Then we can rephrase Theorem 1 as follows.
Theorem 14**.**
For , the constant defined in (15) satisfies .
The motivating observation for this line of argument is the fact that for , there exist distributions with finite covariance matrices and infinite -Rényi entropies. One might anticipate that this could contradict the existence of an -Rényi EPI, as the CLT forces the normalized sum of i.i.d. random vectors drawn from such a distribution to become “more Gaussian”. Heuristically, one anticipates that should approach for large , where is the normalized sum in (7) and is a Gaussian vector with the same covariance matrix as , while is infinite.
Proof of Theorem 14.
Let us consider the following density
[TABLE]
with and implicitly determined to make a density. Since the density is spherically symmetric, its covariance matrix can be rewritten as for some , where is the identity matrix. Computing in spherical coordinates one can check that is finite for , and we can thus define a density . What is more, when , the limiting density has a finite covariance matrix, and has finite Rényi entropy if and only if .
For fixed , we take , where guarantees the existence of such . In this case, the limiting density is well defined and it has finite covariance matrix , but the corresponding -Rényi entropy is infinite. Now we select independent random vectors from the distribution . Since is a spherically symmetric and unimodal density with compact support, we can apply Theorem 12 to conclude that
[TABLE]
where is the normalized sum in (7) and is the standard -dimensional Gaussian. Since , we can take large enough such that . Then we can take large enough such that
[TABLE]
Since the limiting density has infinite -Rényi entropy, given , we can take large enough such that
[TABLE]
Combining (16) and (17), we conclude that for inequality (15) to hold we must have
[TABLE]
for all . Then the statement follows from taking the limit . ∎
Remark 15*.*
Random vectors in our proof has identical -concave density with . In the following section, we provide a complementary result by showing that Rényi EPI of order does hold for -concave densities when .
3.2 Rényi EPIs for -concave densities
As showed above, a generic Rényi EPI of the form (2) fails for . In this part, we establish Rényi EPIs of the forms (2) and (3) for an important class of random vectors with -concave densities (see (5)).
Following Lieb [29], we prove Theorems 2 and 3 by showing their equivalent linearizations. The following linearization of (2) and (3) is due to Rioul [37]. The case was used in [27].
Theorem 16** ([37]).**
Let be independent random vectors in . The following statements are equivalent.
There exist a constant and an exponent such that
[TABLE] 2. 2.
For any such that , one has
[TABLE]
where is the discrete entropy defined as
[TABLE]
Inequality (19) is the linearized form of inequality (18). One of the ingredients used to establish (19) is Young’s sharp convolution inequality [4, 15]. Its information-theoretic formulation was given in [21], which we recall below. We denote by the Hölder conjugate of such that .
Theorem 17** ([15, 21]).**
Let . Let such that , and let be positive reals such that . For any independent random vectors in , one has
[TABLE]
The second ingredient is a comparison between Rényi entropies and . When , we have , and Jensen’s inequality implies that . In this case, one can deduce (19) from (20) with replaced by . However, when , the order of and are reversed, i.e., , and we need a reverse entropy comparison inequality. The so-called -concave densities do satisfy such a reverse entropy comparison inequality. The following result of Fradelizi, Li, and Madiman [22] serves this purpose.
Theorem 18** ([22]).**
Let . Let be an integrable -concave function. The function
[TABLE]
is log-concave for , where
[TABLE]
We deduce the following Rényi entropic comparison for random vectors with -concave densities.
Corollary 19**.**
Let be a random vector in with a -concave density. For , we have
[TABLE]
Proof.
Write . Using the log-concavity of the function in Theorem 18, we have
[TABLE]
The above inequality can be rewritten in terms of entropy power as follows
[TABLE]
The desired statement follows from taking the logarithm of both sides of the above inequality. ∎
Theorem 17 together with Corollary 19 yields the following Rényi EPI with a single Rényi parameter for -concave densities.
Theorem 20**.**
Let and . Let be independent random vectors in with -concave densities. For all such that , we have
[TABLE]
where
[TABLE]
Proof.
Let be defined by , where and are Hölder conjugates of and , respectively. Combining Theorem 17 with Corollary 19, we have
[TABLE]
Notice that , where is given in (21) and . Thus,
[TABLE]
Using the identities and , we have
[TABLE]
The last identity follows from . Using (24) and (23), the RHS of (22) can be written as
[TABLE]
This concludes the proof. ∎
Having Theorems 16 and 20 at hand, we are ready to prove Theorems 2 and 3.
3.2.1 Proof of Theorem 2
Put Theorems 16 and 20 together. Then it suffices to find such that the following inequality
[TABLE]
holds for all such that . Hence, we can set
[TABLE]
where the infimum runs over all such that . For fixed , both and are sum of one-dimensional convex functions of the form . Furthermore, both and are permutation invariant. Hence, the minimum is achieved at . This yields the numerical value of in Theorem 2.
3.2.2 Proof of Theorem 3
The following lemma in [33] serves us in the proof of Theorem 3.
Lemma 21** ([33]).**
Let . Let be twice differentiable on , continuous on , such that and . Let us also assume that for , that is strictly increasing, and that is strictly decreasing. Then increasing on implies that is increasing on as well. In particular,
[TABLE]
Proof of Theorem 3.
Apply Theorems 16 and 20 with . Then it suffices to find such that for all we have
[TABLE]
where
[TABLE]
We can set
[TABLE]
We will show that the optimal value is achieved at . Since the function is symmetric about , it suffices to show that
[TABLE]
is increasing on . It has been shown in [27] that is increasing on . We will show that for each the function is also increasing on . One can check that and satisfy the conditions in Lemma 21. Hence, it suffices to show that is increasing on . Elementary calculation yields that
[TABLE]
Define and . Then one can check that
[TABLE]
Hence, we have
[TABLE]
where
[TABLE]
Since , it suffices to show that is increasing on . We rewrite as follows
[TABLE]
where
[TABLE]
We will show that both and are increasing on .
Now let us focus on . Since , one can check that
[TABLE]
Let us denote
[TABLE]
The condition implies that . With these notations, we have
[TABLE]
The last identity follows from
[TABLE]
Since and , it suffices to show that
[TABLE]
[TABLE]
Then the desired statement follows from that and . We conclude that is increasing on .
It remains to show that is increasing on . Recall the definition of in (27), one can check that
[TABLE]
where and are defined in (28) and (29), and
[TABLE]
Since
[TABLE]
it suffices to show that for . Using the identity
[TABLE]
one can check that
[TABLE]
where
[TABLE]
Notice that , which implies that is a constant. Since , we have
[TABLE]
Hence, , i.e., is decreasing. Therefore, since when , we have
[TABLE]
It suffices to have
[TABLE]
which is equivalent to
[TABLE]
This finishes the proof that every is also increasing on . Then the numerical value of in Theorem 3 follows from setting in (25). ∎
Remark 22*.*
Our optimization argument heavily relies on the fact that and are monotonically increasing for . As observed in [27], the monotonicity of does not depend on the value of . Numerical examples show that , even the whole quantity in (26), is not monotone when is small. This is one of the reasons for the restriction .
Remark 23*.*
Note that the condition of Theorem 18 can be rewritten as
[TABLE]
We do not know whether Theorem 3 holds when
[TABLE]
4 An entropic characterization of -concave densities
Let and be real-valued random variables (possibly dependent) with the identical density . Cover and Zhang [20] proved that
[TABLE]
holds for every coupling of and if and only if is log-concave. This yields an entropic characterization of one-dimensional log-concave densities. We will extend Cover and Zhang’s result to Rényi entropies of random vectors with -concave densities (defined in (5)), which particularly include log-concave densities as a special case. This was previously proved in [28] when is continuous.
Firstly, we introduce some classical variations of convexity and concavity which will be needed in our proof.
Definition 24**.**
Let be fixed. A function with convex support is called almost -convex if the following inequality
[TABLE]
holds for almost every pair in the domain of . We say that is -convex if the above inequality holds for every pair in the domain of . Particularly, for , it is usually called mid-convex or Jensen convex. We say that is convex if is -convex for any .
One can define almost -concavity, -concavity and concavity by reversing inequality (30). Adamek [1, Theorem 1] showed that an almost -convex function is identical to a -convex function except on a set of Lebesgue measure 0. (To apply the theorem there, one can take the ideals and as the family of sets with Lebesgue measure 0 in and , respectively). In general, -convexity is not equivalent to convexity, as it is not a strong enough notion to imply continuity, at least not in a logical framework that accepts the axiom of choice. Indeed, counterexamples can be constructed using a Hamel basis for as a vector space over . However, in the case that is Lebesgue measurable, a classical result of Blumberg [6] and Sierpinski [40] (see also [18] in more general setting) shows that -convexity implies continuity, and thus convexity.
Theorem 25**.**
Let and we define . Let be a probability density on . The following statements are equivalent.
The density is -concave. 2. 2.
For any , we have for any random vectors and with the identical density . 3. 3.
We have for any random vectors and with the identical density .
Proof.
We only prove the statement for , or equivalently . The proof for , or equivalently , is similar and sketched below.
: The proof is taken from [28]. We include it for completeness. Let be the density of . Then we have
[TABLE]
This is equivalent to the desired statement. Identity (31) follows from the assumption that and have the same distribution. In inequality (4), we use the concavity of and the fact that is decreasing when . Inequality (4) follows from Hölder’s inequality and the fact that is decreasing when . For , the statement follows from the same argument in conjunction with the convexity of , the converse of Hölder’s inequality and the fact that is increasing when .
: Obvious by taking .
: We will prove the statement by contradiction. We first show an example borrowed from Cover and Zhang [20] to illustrate the “mass transferring” argument used in our proof. Consider the density in the intervals and . It is clear that is not -concave. The joint distribution of with is supported on the diagonal line . The Radon-Nikodym derivative with respect to the one-dimensional Lebesgue measure on the line exists and is shown in Figure 1. We remove some “mass” from the diagonal line to the lines and . The new Radon-Nikodym derivative is shown in Figure 2. Let be a pair of random variables whose joint distribution possesses this new Radon-Nikodym derivative. It is easy to see that and still have the same density . But is uniformly distributed on , and thus . One can check that .
Now we turn to the general case. Suppose that is not -concave, i.e., is not concave (for ). We claim that there exists a set of positive Lebesgue measure on such that the inequality
[TABLE]
holds for all . Otherwise, the converse of (34) holds for almost every pair , and thus is an almost mid-concave function (i.e., 1/2-concave). By Theorem 1 in [1], is identical to a mid-concave function except on a set of Lebesgue measure 0. Without changing the distribution, we can modify such that is mid-concave. Using the equivalence of mid-concavity and concavity (under the Lebesgue measurability), after modification, is concave, i.e., is -concave. This contradicts our assumption. Hence, there exists such a set with positive Lebesgue measure on . Then there exists such that (34) holds for a set of with positive Lebesgue measure on . We rephrase this statement in a form suitable for our purpose. There is such that the set
[TABLE]
has positive Lebesgue measure on . For , we denote by a ball of radius whose intersection with has positive Lebesgue measure on . Consider such that , where and have the identical density . Let be the Radon-Nikodym derivative of with respect to the -dimensional Lebesgue measure on the “diagonal line” . Now we build a new density by translating a small amount of “mass” from “diagonal points” and to “off-diagonal points” and . To be more precise, we define the new joint density as
[TABLE]
where and is the indicator function of the set . The function is supported on the “diagonal line” and “off-diagonal segments” and , which are disjoint for sufficiently small . (This is similar to Figure 2). When is small enough, is non-negative everywhere. Furthermore, our construction preserves the “total mass”. Hence, the function is indeed a probability density with respect to the -dimensional Lebesgue measure on the “diagonal line” and two “off-diagonal segments”. Let be a pair with the joint density . The marginals and have the same distribution as that of , since the “positive mass” on “off-diagonal points” complements the “mass deficit” on “diagonal points” when we project in the and directions. We claim that has larger entropy than . One can check that the density of is
[TABLE]
Let denote the union of , and . Then we have
[TABLE]
Since , for small enough, is the union of disjoint translates of . When is sufficiently small, we have
[TABLE]
where inequality (37) follows from the observation that for (see (35)) the derivative of the integrand at is
[TABLE]
Since , (36) together with (38) implies that
[TABLE]
This is contradictory to our assumption. Hence, has to be -concave. For , we redefine the set by reversing inequality (35), and inequality (37) will be also reversed. We will arrive at the same conclusion. ∎
Remark 26*.*
The proof of is an immediate consequence of Theorem 3.36 in [30]. The theorem there draws heavily on the ideas of [42], where a related study, deriving the Schur convexity of Rényi entropies under the assumption of exchangeability and -concavity of the random variables, generalizing Yu’s results in [43] on the entropies of sums of i.i.d. log-concave random variables. Although we state Theorem 25 for two random vectors, the argument also works for more than two random vectors. Hence, it implies the seemingly stronger Theorem 4.
As an immediate consequence of Theorem 25, we have the following reverse Rényi EPI for random vectors with the same distribution.
Corollary 27**.**
Let and let . Let and be (possibly dependent) random vectors in with the same density being -concave. Then we have
[TABLE]
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] M. Adamek. Almost λ 𝜆 \lambda -convex and almost Wright-convex functions. Mathematica Slovaca , 53(1):67–73, 2003.
- 2[2] K. Ball, P. Nayar and T. Tkocz. A reverse entropy power inequality for log-concave random vectors. Studia Math , 235(1):17–30, 2016.
- 3[3] A. R. Barron. Entropy and the central limit theorem. Ann. Probab. , 14:336–342, 1986.
- 4[4] W. Beckner. Inequalities in Fourier analysis. Ann. of Math. (2) , 102(1):159–182, 1975.
- 5[5] R. N. Bhattacharya and R. Ranga Rao. Normal approximation and asymptotic expansions. John Wiley & Sons, Inc. Also: Soc. for Industrial and Appl. Math., Philadelphia, 2010 , 1976.
- 6[6] H. Blumberg, On convex functions. Trans. Amer. Math. Soc. , 20, 40–44, 1919.
- 7[7] S. G. Bobkov, G. P. Chistyakov and F. Götze. Rényi divergence and the central limit theorem. Ann. Probab. 47 (2019), no. 1, pp. 270-323.
- 8[8] S. Bobkov and M. Madiman. Reverse Brunn-Minkowski and reverse entropy power inequalities for convex measures. J. Funct. Anal. , 262:3309–3339, 2012.
