Centered Sobolev inequality and exponential convergence in $\Phi$-entropy
Lingyan Cheng, Liming Wu

TL;DR
This paper establishes a link between Sobolev inequalities and exponential convergence of Markov diffusion semigroups in -entropy, providing estimates and perturbation results, with specific insights in the one-dimensional case.
Contribution
It demonstrates the equivalence between Sobolev inequalities and exponential convergence in -entropy, offering new estimates and perturbation results, especially in one dimension.
Findings
Sobolev inequality is equivalent to exponential convergence in -entropy.
Provides estimates of convergence in total variation.
Derives two-sided bounds of the Sobolev constant via Hardy inequality in 1D.
Abstract
In this short paper we find that the Sobolev inequality () is equivalent to the exponential convergence of the Markov diffusion semigroup to the invariant measure , in some -entropy. We provide the estimate of the exponential convergence in total variation and a bounded perturbation result under the Sobolev inequality. Finally in the one-dimensional case we get some two-sided estimates of the Sobolev constant by means of the generalized Hardy inequality.
Click any figure to enlarge with its caption.
Figure 1Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNonlinear Partial Differential Equations · Numerical methods in inverse problems · Geometric Analysis and Curvature Flows
Centered Sobolev inequality and exponential convergence in -entropy
Lingyan Cheng
Lingyan Cheng. Institute of Applied Mathematics, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, 100190, Beijing China.
and
Liming Wu
Liming Wu. Laboratoire de Mathématiques Appliquées, CNRS-UMR 6620, Université Blaise Pascal, 63177 Aubière, France.
Abstract.
In this short paper we find that the Sobolev inequality
[TABLE]
() is equivalent to the exponential convergence of the Markov diffusion semigroup to the invariant measure , in some -entropy. We provide the estimate of the exponential convergence in total variation and a bounded perturbation result under the Sobolev inequality. Finally in the one-dimensional case we get some two-sided estimates of the Sobolev constant by means of the generalized Hardy inequality.
MSC 2010 : 26D10. 60J60. 47J20.
Keywords: Sobolev inequlity, diffusion process, -entropy, exponential convergence.
1. Introduction
1.1. Centered Sobolev inequality
Let be a probability measure on some Polish space equipped with the Borel -field . The main object of this paper is the following centered version of Sobolev inequality
[TABLE]
where , is a conservative Dirichlet form on with domain and is the best constant. This inequality will be denoted by .
When , this becomes the usual Poincaré inequality
[TABLE]
where . Thus is exactly the best Poincaré constant .
When , the left-hand side (LHS in short) of (1.1), understood as the limit when , equals to , where
[TABLE]
is the entropy of . So the Sobolev-type inequality (1.1) becomes
[TABLE]
the usual log-Sobolev inequality ([2]). Thus coincides with the best log-Sobolev constant .
When , is a centered version of the classic defective Sobolev inequality :
[TABLE]
For example for the Lebesgue measure on and , the above Sobolev inequality holds with , for (, see Aubin [1]). Notice that the defective Sobolev inequality with fails for probability measure .
The centered Sobolev inequality was studied by Aubin [1] and Beckner [6] for the normalized volume measure on the unit sphere in . They obtained the exact result : if (for ). Bakry and Ledoux [4], using the diffusion semigroup method, proved the following sharp and general result of (see also Ledoux [15, Theorem 3.1]) :
Theorem 1.1**.**
([4]) Let be a Markov diffusion generator satisfying the Bakry-Emery’s curvature-dimension condition for some and . Then for every , (1.1) holds with .
This deep theorem of Bakry-Ledoux generalizes the famous Lichrowicz bound about .
When , the LHS of (1.1), understood as the limit when , equals to . Setting , we see that becomes
[TABLE]
Relationship between the Sobolev inequalities for different is summarized in
Theorem 1.2**.**
- (a)
For any , . 2. (b)
* is nondecreasing in .* 3. (c)
For any , the Sobolev inequality is equivalent to the Poincaré inequality, more precisely
[TABLE]
This result is essentially contained in Bakry and Ledoux [4].
In other words this family of Sobolev inequalities for different has four interesting cases : (1) ; (2) ; (3) and (4) .
1.2. Semigroup
Let be a Markov semigroup such that for all (i.e. is an invariant measure), strongly continuous on . Let be the generator of , whose domain in is denoted by (). We always assume that
(A1)* is contained in and dense in w.r.t. the norm (i.e. is a form core of ), and*
[TABLE]
In other words is the symmetrized Dirichlet form of . This assumption holds automatically if is self-adjoint (i.e. is symmetric on ).
It is well known that the Poincaré inequality is equivalent to the exponential convergence of to in :
[TABLE]
And if is a diffusion semigroup, the log-Sobolev inequality is equivalent to the exponential convergence of to in the relative entropy
[TABLE]
See Bakry [2]. Notice that the later equivalence is false in the jump case ([18]).
But unlike Poincaré and log-Sobolev, the role of the Sobolev inequality (1.1) for different from in the exponential convergence of is unknown. Our first purpose of this paper is to fill this gap.
This paper is organized as follows. In the next section we establish the equivalence between the Sobolev inequality and the exponential convergence of to , in some -entropy sense. Several corollaries and applications are derived for illustrating the usefulness of the Sobolev inequality (1.1), especially for the rate of the exponential convergence of to in total variation.
In §3 we recall the relationship between the defective Sobolev inequality and centered Sobolev inequality when and present a bounded perturbation result.
In §4 we present some two-sided estimates of the optimal constant of Sobolev inequality when on the real line, by following Barthe and Roberto [5].
2. Equivalence between Sobolev inequality and exponential convergence
2.1. Framework
Besides (A1), we assume
(A2) (Existence of the carré-du-champs operator)* there is an algebra contained in and dense in w.r.t. the norm . So the carré-du-champs operator*
[TABLE]
is well defined. can be extended as a continuous mapping from .
(A3)* is a diffusion semigroup, i.e. is the transition probability semigroup of a continuous Markov process valued in defined on . *
Under those assumptions, for every ,
[TABLE]
is a -martingale, and
[TABLE]
(this holds at first for , then for by continuous extension). Consequently if and is infinitely differentiable, then by Ito’s formula and
[TABLE]
We write . For any -function on , integrating (2.1) we have
[TABLE]
Next (2.1) implies that is a derivation,
[TABLE]
2.2. Exponential convergence in the -entropy
Definition 2.1**.**
Given a lower bounded convex function , the -entropy of a function is defined as
[TABLE]
The main result of this section is
Theorem 2.2**.**
For the diffusion Markov semigroup with invariant probability measure satisfying (A1), (A2) and (A3), the Sobolev inequality (1.1) is equivalent to the exponential convergence in the -entropy
[TABLE]
where
[TABLE]
We begin with a known result (see Chafai [7]).
Lemma 2.3**.**
Let be a lower bounded -convex function and be a class of functions in , stable for (i.e. if , ). The exponential convergence in the -entropy
[TABLE]
for some positive constant is equivalent to
[TABLE]
Proof.
Since for
[TABLE]
by (2.2), the equivalence above follows from Gronwall’s lemma. ∎
Proof of Theorem 2.2.
For the exponential convergence in the -entropy we may restrict to . In that case as is on , we can apply Lemma 2.3.
At first this equivalence is well known for as recalled in the Introduction.
We begin with the case . By Lemma 2.3, the exponential convergence (2.4) is equivalent to
[TABLE]
Setting , this last inequality is equivalent to
[TABLE]
which is exactly the Sobolev inequality (1.1).
For , by Lemma 2.3, the exponential convergence (2.4) is equivalent to
[TABLE]
Setting , this last inequality is equivalent to
[TABLE]
which is exactly the Sobolev inequality (1.1).
Finally for , by Lemma 2.3, the exponential convergence (2.4) is equivalent to
[TABLE]
which is exactly the Sobolev inequality (1.5) for . ∎
2.3. Exponential convergence in Hellinger metric
Now we present an application to the exponential convergence in the Hellinger metric . Recall that for two probability measures where is some reference measure,
[TABLE]
is in fact independent of the choice of .
Corollary 2.4**.**
Assume that the adjoint operator of satisfies also (A1), (A2), (A3). The Sobolev inequality (1.1) for is equivalent to
[TABLE]
for any -probability density function .
Recall that the distribution of is if the initial distribution of is .
Proof.
We have for any -probability density function ,
[TABLE]
And for the exponential convergence in (2.4) (with ), one may restrict to the functions such that by homogeneity. So this corollary follows directly by Theorem 2.2(a). ∎
Remark 2.5**.**
Let (the total variation). It is known that (Gibbs and Su [9])
[TABLE]
So under the Sobolev inequality (1.1) with , we have
[TABLE]
an explicit estimate of the exponential convergence in total variation. **
2.4. Exponential convergence in total variation
We now generalize the result above to general different from .
Corollary 2.6**.**
Assume that satisfies (A1), (A2), (A3). If the Sobolev inequality holds for some , then for any -probability density ,
[TABLE]
Proof.
It follows from Theorem 2.2 and the lemma below.∎
Lemma 2.7**.**
Let . Then for any such that ,
[TABLE]
and
[TABLE]
Proof.
We have
[TABLE]
that is (2.7).
For (2.8) we may assume that . Letting ,
[TABLE]
(which is the conditional expectation of knowing ), by Jensen’s inequality we have
[TABLE]
So it is enough to prove (2.8) for , a two-valued function. Let be the two values of (so ), and
[TABLE]
Since , . Consider
[TABLE]
We have , and . Hence for (2.8), by Taylor’s formula we have only to show that
[TABLE]
Notice that
[TABLE]
and for all . We now divide our discussion into two cases.
Case 1. . In this case, , then for all , consequently
[TABLE]
which implies (2.9).
Case 2. . Since and , there is a unique such that , i.e. () or . Consequently
[TABLE]
for . The last bound is optimal because it becomes equality if . That completes the proof of (2.9). ∎
3. Defective Sobolev inequality implies centered Sobolev inequality and a bounded perturbation result
3.1. Defective Sobolev inequality implies Sobolev inequality
Theorem 3.1**.**
([3]) If the defective Sobolev inequality (1.4) holds with some positive constants for some , and the Poincaré inequality (1.2) holds with the best constant , then we have
[TABLE]
The above theorem 3.1 is a direct consequence of the following lemma.
Lemma 3.2**.**
Let and be a square integrable function on a probability space . Then for all , we have
[TABLE]
This lemma is also contained in [3] and will be used in the next section.
Notice that if the defective Sobolev inequality holds for some , then with the density bounded ([4]). That implies is a Hilbert-Schmidt operator, then compact on : in particular the Poincaré inequality holds true.
3.2. Bounded perturbation
It is well known that -entropy defined in definition 2.1 has the following variational form :
[TABLE]
for all . The following proposition shows that the Sobolev inequality (1.1) is stable by bounded transformation of the probability measure .
Proposition 3.3**.**
Assume that the Dirichlet form for some carré-du-champs operator which is a derivation, i.e. for all and . Assume that the probability measure satisfies Sobolev inequality (1.1) with the best constant for . Let be the probability measure defined by such that , where is the normalization constant. Then satisfies Sobolev inequality
[TABLE]
Proof.
By the proof of Theorem 2.2, the Sobolev inequality (1.1) is equivalent to
[TABLE]
where
[TABLE]
We have by (3.3),
[TABLE]
which implies the result. ∎
3.3. Reflected Brownian motion
Given a domain of , let be the Sobolev space of the functions on with the norm . Recall the extension theorem on Sobolev space:
Theorem 3.4**.**
Suppose that is a bounded domain with Lipschitz boundary. Then there exist a bounded linear operator and a constant such that
- (1)
* for a.e. ;* 2. (2)
.
According to the well known Sobolev inequality on , we have the following
Corollary 3.5**.**
For any bounded domain () with Lipschitz boundary, the Sobolev inequality (1.1) holds for with the normalized Lebesgue measure on for any (this last quantity is interpreted as if ).
Proof.
By Theorem 3.4, we have for all ,
[TABLE]
where , is the best Sobolev constant. Then the defective Sobolev inequality (1.4) holds with . The result follows by Theorem 3.1. ∎
4. Sobolev inequality in dimension one
In [5], F. Barthe and C. Roberto provide the estimate of the optimal constant of Sobolev inequality when on the real line. In this section we generalize the estimate of the optimal constant to the case on the real line by the method in [5].
Theorem 4.1**.**
Let and (non-negative) be Borel measures on with and , where is the absolutely continuous component of . Let be a median of . Let be the optimal constant satisfying :
[TABLE]
for every smooth function .
Then we have , where
[TABLE]
We will use the following Proposition and Lemmas to prove Theorem 4.1.
Proposition 4.2**.**
(See [5])* Let (non-negative) be Borel measures on , where is a median of and , where is the absolutely continuous component of . Let be a family of non-negative Borel measurable functions on . We set for any measurable function . Let be the smallest constant such that for every smooth function with , we have*
[TABLE]
Then , where
[TABLE]
Lemma 4.3**.**
Let be a non-negative integrable function on a probability space . Let and be some constants, then we have
[TABLE]
Proof.
For any Borel measurable function , by Hölder’s inequality, we have
[TABLE]
Hence
[TABLE]
Using (4.2), we have
[TABLE]
The last inequality is derived by
[TABLE]
Hence the lemma is established. ∎
Lemma 4.4**.**
Let , be a finite measure on . Let be a measurable subset with and K be a constant with . Then we have
[TABLE]
Proof.
Simply, we denote by the right hand side of the above equality. Without loss of generality, we can assume on , hence
[TABLE]
For any , by Jensen’s inequality, we have
[TABLE]
Hence
[TABLE]
We take , then equality in (4.3) holds. Hence
[TABLE]
which is the desired result. ∎
Now we prove Theorem 4.1.
Proof of Theorem 4.1..
Step 1. We estimate the upper bound of . For any smooth function , let , and . It is easy to see they are all continuous and , when . We set and , then . By Lemma 3.2 and Lemma 4.3, we have
[TABLE]
Now we deal with . Since on , we have by Proposition 4.2,
[TABLE]
where
[TABLE]
By Lemma 4.4, we have
[TABLE]
Similarly, we have
[TABLE]
Since on , we have
[TABLE]
Hence .
Step 2. We estimate the lower bound of . At first, we suppose that is a continuous function which vanishes on and is smooth on . By approximation, satisfies (4.1). Noting that in order to approach the supremum, the test function on . By Lemma 4.3, we have
[TABLE]
Since , we have for many non-negative functions . By (4.1), for such functions with , we have
[TABLE]
By Proposition 4.2, we have
[TABLE]
Since , using Lemma 4.4, we have
[TABLE]
Then we suppose that is a continuous function which vanishes on and is smooth on . Similarly, we have
[TABLE]
Hence . The proof is completed. ∎
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Th. Aubin. Nonlinear analysis on manifolds. Monge-Ampère equations. Springer , 1982.
- 2[2] D. Bakry. L’hypercontractivité et son utilisation en théorie des semigroupes. In Ecole d’Eté de Probabilités de Saint-Flour (1992) , number 1581 in Lecture Notes in Mathematics. Springer-Verlag, 1994.
- 3[3] D. Bakry, I. Gentil and M. Ledoux. Analysis and Geometry of Markov diffusion operators. In Grundlehren der mathematischen Wissenschaften 348 , Springer 2014.
- 4[4] D. Bakry and M. Ledoux. Sobolev inequalities and Myers’s diameter theorem for an abstract Markov generator. In Duke Math. J. , 85: 253–270, 1996.
- 5[5] F. Barthe and C. Roberto. Sobolev inequalities for probability measures on the real line. Studia Math. , 159, No 3: 481–497, 2003.
- 6[6] W. Beckner. Sobolev inequalities, the Poisson semigroup and analysis on the sphere S n superscript 𝑆 𝑛 S^{n} . Proc. Nat. Acad. Sci. , 89: 4816–4819, 1992.
- 7[7] D. Chafai. Entropies, convexity, and functional inequalities: on Φ Φ \Phi -entropies and Φ Φ \Phi -Sobolev inequalities. J. Math. Kyoto Univ. , 44 (2004), no. 2, 325-363.
- 8[8] M.F. Chen. Eigenvalues, inequalities, and ergodic theory. Probability and its Applications , Springer-Verlag, 2005.
