Diffusion Hypercontractivity via Generalized Density Manifold
Wuchen Li

TL;DR
This paper introduces a new framework connecting diffusion hypercontractivity with various inequalities through a generalized density manifold, employing advanced calculus and transport metrics.
Contribution
It develops a one-parameter family of diffusion hypercontractivity results and introduces a novel PH^{-1}I inequality linking divergence, transport, and information measures.
Findings
Established a new family of diffusion hypercontractivity inequalities.
Derived the PH^{-1}I inequality connecting divergence, transport, and Fisher information.
Presented a mean-field Bakry-Emery calculus and Yano's volume measure formula.
Abstract
We prove a one-parameter family of diffusion hypercontractivity and present the associated Log-Sobolev, Poincare and Talagrand inequalities. A mean-field type Bakry-Emery iterative calculus and volume measure based integration formula (Yano's formula) are presented. Our results are based on the interpolation among divergence functional, generalized diffusion process, and generalized optimal transport metric. As a result, an inequality among Pearson divergence (P), negative Sobolev metric H^-1 and generalized Fisher information functional (I), named PH^{-1}I inequality, is derived.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Diffusion Hypercontractivity via Generalized Density Manifold
Wuchen Li
Abstract.
We prove a one-parameter family of diffusion hypercontractivity and present the associated Log-Sobolev, Poincaré and Talagrand inequalities. A mean-field type Bakry-Émery iterative calculus and volume measure based integration formula (Yano’s formula) are presented. Our results are based on the interpolation among divergence functional, generalized diffusion process, and generalized optimal transport metric. As a result, an inequality among Pearson divergence (P), negative Sobolev metric ( ) and generalized Fisher information functional (I), named PI inequality, is derived.
Key words and phrases:
Information theory; Mean-field Bakry-Émery calculus; Generalized Log-Sobolev inequality; Generalized Poincaré inequality; Generalized Talagrand inequality; Generalized Yano’s formula.
Wuchen Li is supported by AFOSR MURI (Grant No. 18RT0073).
1. Introduction
Diffusion hypercontractivity plays essential roles in functional inequalities [2, 10] and information theory [6, 7]. It is often used to estimate the convergence rates of Markov chain Monte Carlo methods. Among these studies, Bakry–Émery criterions [1] provide sufficient conditions for showing convergences rates of diffusion processes and related inequalities. Recently, optimal transport provides the other viewpoint on this topic [22]. In this viewpoint, the probability density space is embedded with an infinite-dimensional Riemannian metric, named Wasserstein metric [19]. Here the density space with Wasserstein metric is named density manifold [13]. Following this metric viewpoint, the diffusion hypercontractivity, in particular, the Bakery–Émery criterions, can be derived by Hessian operators of divergence functionals in density manifold [20]. This study has been extended to general base metric spaces [18, 21]. On the other hand, the relation between local behavior of diffusion hypercontractivity (such as Poincaré inequality) and integral formula, known as Yano’s formula [23], has been discovered in [5, 14]. It reveals the connection between integration formula on the base manifold and Riemannian calculus in density manifold.
In this paper, we study the hypercontractivity of generalized diffusion processes, named Hessian transport stochastic differential equations [17]. See related information theory background in [25]. Following the generalized (mobility) density manifold proposed in [4, 8], the density manifold’s Riemannian calculus [15] and geometric insights of inequalities provided in [20], we introduce the generalized Bakry–Émery criterions in Theorem 1. These criterions provide sufficient conditions for showing convergence rates of generalized diffusion processes and establishing generalized Log-Sobolev and Talagrand inequalities. In addition, a generalized Yano’s formula in Theorem 2 is derived, which provides a reference measure related integral formulas. Using Yano’s formula, we establish the generalized Poincaré inequality in Corollary 3. More importantly, a PI inequality is presented in Theorem 4.
In literature, the generalized optimal transport metric has been proposed in [8] and many groups have studied associated generalized functional inequalities [3, 9]. Firstly, [9] studies functional inequalities for the classical Kolmogorov-Fokker-Planck equation, where the Bakry–Émery criterions are classical and generalized optimal transport metrics depend on the reference measure. Here we study transport metrics related stochastic processes and then build new functional inequalities. We introduce new Bakry–Émery criterions, in which the metric of density manifold in inequalities do not depend on the reference measure. For example, we obtain several functional inequalities related to the classical metric. In addition, [3] formulates divergence functional related inequalities for a type of drift-diffusion processes. In this study, they apply the classical Bakry-Émery iterative calculus. While in this paper, we introduce a new mean field Bakry-Émery iterative calculus.
This paper is organized as follows. In section 2, we state the main result of this paper. We establish hypercontractivity for generalized drift-diffusion processes and prove several functional inequalities. A generalized Yano’s formula is also derived. In section 3, we formulate the primary tool of the proof. In section 4, the proof is presented. In section 5, a generalized Bakry-Émery iterative calculus is presented.
2. Main result
Given a compact and smooth Riemannian manifold without boundary. Denote its volume form by , the Ricci curvature tensor by Ric, the gradient, divergence, Laplacian operators by , , respectively, and the Hessian operator by Hess.
Given a reference probability density function with , consider the –drift diffusion process
[TABLE]
where is the standard Brownian motion in with the infinitesimal generator
[TABLE]
Consider the –divergence functional111It is often named –divergence with . We use notation for the simplicity of presentation.
[TABLE]
where has the form
[TABLE]
Consider the –Fisher information functional
[TABLE]
Consider the *–Wasserstein distance222This notation of Wasserstein-distance is first studied in [8]. When , we notice that the notation of represents the classical -Wasserstein distance, not the -Wasserstein distance. *
[TABLE]
where , and the infimum is over all potential function .
We next provide sufficient conditions for the hypercontractivity of –drift diffusion process and functional inequalities among –divergence, –Fisher information and –Wasserstein distance.
Theorem 1** (Generalized hypercontractivity).**
Let , if there exists a constant , such that
[TABLE]
Let be a smooth initial distribution and be the probability density function of –drift diffusion process, then
[TABLE]
Moreover, for any smooth probability density function with , the generalized Log-Sobolev inequality holds
[TABLE]
and the generalized Talagrand inequality holds
[TABLE]
Example 1** (Kullback–Leibler divergence).**
Consider , . Here the forms the classical Kullback–Leibler divergence function (relative entropy), and is the classical relative Fisher information, is the classical Langevin process, and is the classical -Wasserstein distance. Here the condition (1) forms
[TABLE]
which is the classical Bakry–Émery criterion. Under this condition, the distribution of drift diffusion process converges to ; the Log–Sobolev inequality (3) holds
[TABLE]
and the Talagrand inequality holds
[TABLE]
Here, if is Ricci flat, i.e. and we denote , the condition (1) forms .
Example 2** (Pearson divergence).**
Consider , . Here is named Pearson divergence function, is the [math]–Fisher information and is the [math]–drift diffusion process. The condition (1) forms
[TABLE]
Under this condition, the distribution of drift diffusion process converges to and the generalized Log–Sobolev inequality (3) holds
[TABLE]
We also show a new integration identity, which is found in the proof of Theorem 1.
Theorem 2** (Generalized Yano’s formula).**
Denote , then
[TABLE]
Remark 1*.*
When is a uniform measure, i.e. , the above formula is the classical Yano’s formula [23]:
[TABLE]
When , it is the generalized Yano’s formula studied in [5, 14]
[TABLE]
Our derivation extends these classical Yano’s formulas with general volume measure and its power . For example, when , we obtain
[TABLE]
Later on, using the generalized Yano’s formula, we prove the following Poincaré inequality.
Corollary 3** (Generalized Poincaré inequality).**
If there exists a constant , such that when ,
[TABLE]
or when ,
[TABLE]
then
[TABLE]
for any with .
Remark 2*.*
When , Corollary 3 recovers the classical Poincaré inequality
[TABLE]
Remark 3*.*
Here we derive the formulation of generalized Poincaré inequality by the approximation of generalized Log-Sobolev inequality. In details, denote , where and . The L.H.S. of (5) comes from the Hessian metric tensor for the –divergence :
[TABLE]
While the R.H.S. of (5) is from the second order approximation in term of for the –relative Fisher information:
[TABLE]
Example 3** (Reverse Kullback–Leibler divergence).**
Consider , . Here is named reverse Kullback–Leibler divergence function or Cross entropy. In this case, the condition in Corollary 3 forms
[TABLE]
Under this condition, the generalized Poincaré inequality holds
[TABLE]
where and . Again, if is Ricci flat, i.e. , then the condition in Corollary 3 forms .
Last, notice the fact that when , the -Wasserstein distance is exactly the distance:
[TABLE]
where is the negative Sobolev distance between and , i.e.
[TABLE]
We next show an inequality among Pearson divergence (P), metric and [math]–Fisher information (I), named inequality.
Theorem 4** (Inequalities for metric).**
Suppose , where , then the inequality holds
[TABLE]
In addition, if , then the -Talagrand inequality holds
[TABLE]
Remark 4*.*
The PI inequality is an analog of inequalities among (H), Wasserstein-2 metric and –Fisher information, known as HWI inequality; see details in [20].
Remark 5*.*
If , the PI inequality shows
[TABLE]
In addition, using the fact that and , we have
[TABLE]
In next sections, we apply geometric tools in probability space to prove the above inequalities and integral formulas.
3. Generalized Density manifold
In this section, we introduce the main tool to prove above results. We first review a class of Riemannian metrics in probability space, introduced by Wasserstein distance. We then present its Riemannian calculus, including gradient and Hessian operators. By using gradient operators in this metric, we connect –divergence, –Fisher information and –drift diffusion process.
3.1. Density manifold and its Riemannian calculus
Consider the set of smooth and strictly positive densities
[TABLE]
The tangent space of at is given by
[TABLE]
Consider the –Wasserstein metric tensor in the probability space as follows.
Definition 5**.**
The inner product is defined as for any and :
[TABLE]
where is a weighted elliptic operator. In addition, denote , , with , , then
[TABLE]
Remark 6*.*
An observation is that if , the proposed metric is the metric [8]. If , the proposed metric is the -Wasserstein metric [13, 19].
We note that the characterization of geodesics in has been studied in [4, 8]. In this paper, we focus on the Riemannian calculus for density manifold , using both in tangent bundle and in cotangent bundle.
Proposition 6**.**
The Christoffel symbol in satisfies
[TABLE]
where , , and
[TABLE]
Proof.
The proof follows the study in [15]. We derive the Christoffel symbol by using the Lagrangian formulation of geodesics. Consider the minimization of the geometric action functional in density space
[TABLE]
where is a density path with fixed boundary points , . The geodesics in satisfies the Euler-Lagrange equation
[TABLE]
i.e.
[TABLE]
where is a function depending only on . Using the fact that
[TABLE]
then equation (6) forms
[TABLE]
By timing both sides with and comparing with the geodesics equation,
[TABLE]
we derive the Christoffel symbol. Similarly, we can formulate the Christoffel symbol (raised Christoffel symbol) in term of . ∎
Proposition 7**.**
The geodesics equation in satisfies
[TABLE]
Denote Legendre transform , then the co-geodesics equation satisfies
[TABLE]
Proof.
The geodesics equation follows . We next demonstrate the Hamiltonian formulation of geodesics flow. Consider the Legendre transform in :
[TABLE]
Then , and
[TABLE]
Then the co-geodesic flow satisfies
[TABLE]
which is the equation pair (7). ∎
For completeness of this paper, we also present the Lagrangian coordinates of geodesics in generalized density manifold.
Proposition 8** (Lagrangian coordinates).**
Denote , where is the diffeomorphism and is the push-forward operator. Then the Lagrangian coordinates of geodesic equation (7) satisfies
[TABLE]
Remark 7*.*
Here we present three examples of geodesics in Lagrangian coordinates for generalized density manifold.
- (i)
If , the geodesic equation forms
[TABLE]
which is a well-known result in optimal transport.
- (ii)
If , the geodesic equation satisfies
[TABLE]
- (iii)
If , the , i.e. , geodesic equation satisfies
[TABLE]
Proof.
Denote
[TABLE]
where satisfies (7). Then
[TABLE]
Notice the fact that
[TABLE]
where the equality holds since . Substituting the above and into (8), we obtain
[TABLE]
where the second last equality uses the fact and . ∎
Proposition 9**.**
Consider a functional .
- (i)
The Riemannian gradient operator of in satisfies
[TABLE]
where is the first variation. The squared norm of gradient operator forms
[TABLE]
- (ii)
The Riemannian Hessian operator of in satisfies
[TABLE]
where , , and is the second variation operator.
Remark 8*.*
Several interesting examples of Hessian operators of have been studied in [4], including linear and interaction potential energies.
Proof.
(i) The Riemannian gradient operator satisfies
[TABLE]
Then
[TABLE]
(ii) The Riemannian Hessian operator satisfies
[TABLE]
We next formulate the terms , separately. Notice the fact that
[TABLE]
where the second and third equalities are shown by integration by parts with respect to , . In addition, we estimate three terms in (h2).
[TABLE]
where the first and second equality holds by integration by parts with respect to . Similarly,
[TABLE]
And
[TABLE]
Combining the above three terms in (h2), we finish the proof. ∎
3.2. Gradient systems and –drift diffusion process
In this sequel, we present the relation among Riemannian gradient operators in , –divergence functional, –Fisher information and –drift diffusion process, see details in [17].
Given a -divergence functional, the Kolomogrov forward operator of –drift diffusion process is the negative gradient descent direction in . And the squared gradient norm of divergence functional in forms the –Fisher information functional.
Lemma 10**.**
The following statements hold.
- (i)
[TABLE]
where is the adjoint operator of in .
- (ii)
[TABLE]
Proof.
We first prove (i). On the one hand, the the Kolomogrov forward operator forms
[TABLE]
We need to show
[TABLE]
Notice the fact
[TABLE]
where the second equality uses the fact and the fourth equality applies the fact . On the other hand, the negative gradient operator of in satisfies
[TABLE]
Comparing the above, we finish the proof.
We next prove (ii). Notice the fact that
[TABLE]
where the second equality uses the fact that .
∎
Shortly, we shall apply the above two geometric relations to give sufficient conditions for hypercontractivity of –diffusion process and generalized functional inequalities.
4. Proof
Here we present the proof of generalized diffusion hypercontractivity using the geometric tools built in previous section.
4.1. Sketch of proof
Consider the gradient flow of –divergence functional
[TABLE]
where is the probability density function at time . Then the first time derivative of –divergence along the gradient flow forms
[TABLE]
And the second time derivative of –divergence becomes
[TABLE]
If we can bound the ratio between the first and second derivative, i.e.
[TABLE]
we prove Theorem 1. This is true if we integrate (10) on both sides for , then
[TABLE]
By Grownwall’s inequality, we obtain the hypercontractivity of –drift diffusion process
[TABLE]
In addition, notice the fact that , then inequality (11) forms
[TABLE]
By choosing with arbitrary , the Log-Sobolev inequality (3) is proven.
From above arguments, the proof boils down to estimate the ratio in (10). Here the formulation (10) is equivalent to
[TABLE]
Next, our goal is to derive the Hessian operators of –divergence in .
4.2. Hessian operator estimation
Lemma 11** (Hessian of –divergence in ).**
Denote , then
[TABLE]
Remark 9*.*
In fact, there are several interesting cases for Hessian operators of –divergence in density manifold. Denote .
- (i)
If , then
[TABLE]
- (ii)
If is a uniform measure [4], then
[TABLE]
- (iii)
If , then
[TABLE]
Proof.
From proposition 9, we can compute the Hessian operator of –Divergence directly. For readers who are not family with geometric computations, the following direct method is also given. The Hessian operator is given by taking the second order time derivative of along the co-geodesics flow (7). Consider the first order time derivative
[TABLE]
And the second order time derivative satisfies
[TABLE]
We estimate (a), (b) and (c) separately.
For (a), we denote . Then
[TABLE]
We next estimate (b).
[TABLE]
Here we derive (b1), (b2), (b3), (b4) more explicitly. Notice the fact that
[TABLE]
and
[TABLE]
In addition,
[TABLE]
and
[TABLE]
We last derive (c).
[TABLE]
We estimate (c1), (c2), (c3), (c4) explicitly. Notice the fact that
[TABLE]
and
[TABLE]
In addition,
[TABLE]
We now summarize all the formulas.
[TABLE]
Notice the fact that
[TABLE]
where the last equality is from Bochner’s formula, i.e.
[TABLE]
By substituting (d) into (12), we obtain
[TABLE]
We lastly reformulate the term (e):
[TABLE]
Notice that (e1) has a formulation
[TABLE]
where the second equality holds by the fact that .
Substituting (e1) into (e), we obtain
[TABLE]
Substituting the above formula into (13), we derive
[TABLE]
∎
We observe that the Hessian operator in is more complicated than the one with . Since when , there is no interaction bilinear term between the Hessian operator and the squared gradient norm. We overcome this by the following estimates. Denote the bilinear form:
[TABLE]
Lemma 12**.**
Denote , then for any ,
[TABLE]
Proof.
The proof is based on an estimation for the bilinear form . Notice
[TABLE]
Then
[TABLE]
and
[TABLE]
Denote , , then , and thus
[TABLE]
We further denote , then
[TABLE]
which finishes the proof. ∎
4.3. Proof
Proof of Theorem 1.
Firstly, following Lemma 11 and Lemma 12, we prove that condition (1) implies both the convergence result (10) and the functional inequality (3).
Secondly, the generalized Talagrand inequality (4) follows directly from the gradient flow interpolation of inequality in Proposition of [20]. For completeness of this paper, we present it here. Consider the real value function
[TABLE]
where is the density function at time . Notice that and , since following (10).
We next claim . If so, we finish the proof. To prove it, we show that
[TABLE]
Notice the fact that
[TABLE]
and along the gradient flow ,
[TABLE]
In addition
[TABLE]
Thus
[TABLE]
which finishes the proof. ∎
Proof of Theorem 2.
We prove the equality by using the Hessian operator of in at the point . Notice that for any , then
[TABLE]
Following the Hessian operator formula in (9), denote , then
[TABLE]
where the second equality uses the fact and (14). Comparing the above with formula at in Lemma 11, we prove the equality. ∎
Proof of Corollary 3.
We first prove the following claim.
Claim:
[TABLE]
Proof of Claim.
The first equality holds by the definition of Hessian operator at as in the proof of Theorem 2. We next focus on the second equality. Denote . Then the minimization in the second equation of (15) forms
[TABLE]
The minimizer of above minimization satisfies the following eigenvalue problem
[TABLE]
i.e.
[TABLE]
In other words, , where represents the smallest non-zero eigenvalue.
On the other hand, denote , then the minimizer of minimization (15) in the third equality forms
[TABLE]
Similarly, the minimizer of above minimization satisfies the following eigenvalue problem
[TABLE]
i.e.
[TABLE]
Thus . From the above, we have , which finishes the proof of claim. ∎
From the above claim, the smallest eigenvalue of Hessian operator of in at is precisely the lower bound for the Poincaré inequality. Here from the generalized Yano’s formula, we have
[TABLE]
where
[TABLE]
Thus
[TABLE]
From the above, we can estimate the smallest eigenvalue of Hessian operator, which finishes the proof. ∎
Proof of Theorem 4.
We first prove the inequality. Denote be a geodesic curve of least energy in , with metric, where and . Then from Proposition 7, , i.e. . Thus
[TABLE]
By taking the Taylor expansion of in at , we obtain
[TABLE]
where . From the Cauchy-Schwarz inequality, we have
[TABLE]
In addition, the condition implies , thus
[TABLE]
Substituting (17) and (18) into (16), we prove the inequality. In addition, the -Talagrand inequality follows directly from Theorem 1. ∎
Remark 10*.*
The current method fails when or . In these cases, there is no finite lower bound for the bilinear form and squared gradient norm for any . One can not obtain the finite ratio between and . Thus we can not establish the exponential decay results in term of –divergence.
However, the current method fails does not mean that we can not find the convergence guarantee condition of –diffusion processes when . In fact, we can always formulate –divergence as the gradient flow of 1-divergence (relative entropy) w.r.t. density manifold metric \Big{(}-\nabla\cdot(\rho\mu^{\gamma-1}\nabla)\Big{)}^{-1}. In this case, the study of diffusion hypercontractivity forms a classical Bakry–Émery method. In other words, one can always apply the entropy method or entropy-entropy production as in [24] to find the associated diffusion hypercontractivity and convergence rate in 1-divergence. See related details in [3].
Remark 11*.*
We comment on the proof of different types of inequalities. (i) For Log-Sobolev and Talagrand inequalities, we only need the Hessian operator along the gradient flow to have a lower bound. (ii) For Poinćare inequality, we require the Hessian operator at the equilibrium measure to have a lower bound. (iii) For the divergence, metric and information inequality, such as HWI or PI inequality, we require the Hessian operator to have a lower bound for any tangent directions in density manifold. Interestingly, the above three conditions coincide in the case of .
5. Generalized Bakry–Émery Calculus
In this section, we propose the generalized Bakry–Émery iterative calculus. This definition follows the connection of Hessian operator in density manifold with the generator (Kolmogorov backward operator) of –drift diffusion process.
We first define the generalized iterative Bakry–Émery Gamma operators.
Definition 13** (–Bakry–Émery calculus).**
Denote the –Gamma one operator by
[TABLE]
where , .
Denote the –Gamma two operator by
[TABLE]
where , .
Remark 12*.*
We note that when , we recover the classical iterative Bakry–Émery operators. Here and are independent of with
[TABLE]
where is the generator of classical Langevin drift diffusion process. In addition, when , the generalized Bakery–Émery Gamma one and Gamma two operators depend on the current density . In other words, they are mean-field formulations of Gamma operators.
We next prove an equality to bridge generalized Bakry–Émery calculus and Hessian operator of –divergence in density manifold.
Proposition 14**.**
[TABLE]
where and with .
Proof.
For simplicity of presentation, in the proof, we omit the notation of with the generalized Gamma operators, e.g. .
Let us recalculate the Hessian operator of in by using (9) directly. Using the generalized iterative operators, we reformulate (9) as follows:
[TABLE]
We next rewrite (19) in three terms. First, we prove the following claim.
Claim 1:
[TABLE]
Proof of Claim 1.
Notice
[TABLE]
and
[TABLE]
The above two facts show that
[TABLE]
Using the fact that and .
[TABLE]
where the second last equality holds by the integration by parts formula. ∎
Secondly, by switching and in Claim 1, we have
[TABLE]
Thirdly, we show the following claim.
Claim 2:
[TABLE]
Proof of Claim 2.
Here
[TABLE]
where the second equality uses the fact that
[TABLE]
and the last equality holds because is the adjoint operator in . ∎
By summing (20), (21) and times (22) and using (19), we have
[TABLE]
∎
We last point out that the generalized Bakry–Émery iterative calculus implies generalized hypercontractivity.
Proposition 15** (Generalized Bakry–Émery criterion).**
If there exists a constant , such that
[TABLE]
for , with respect to any . Then the generalized hypercontractivity (2) and the generalized Log-Sobolev inequality (3) hold.
Remark 13*.*
Our generalized Bakry-Émery operators follow the proof in proposition 19 of [15]. In other words, when the divergence functional is the relative entropy, i.e. , we have the classical Bakry-Émery iterative calculus. For generalized divergence functional, we introduce the generalized iterative Bakry-Émery calculus.
Remark 14*.*
When or , the ratio between generalized Gamma two operator and Gamma one operator gives the bound in (23). This is not the case for . In general, we need to apply the mean field (integral formula w.r.t ) of the Gamma two operator to bound the Gamma one operator, and then derive the related Log-Sobolev inequalities.
Proof.
Here the proof applies Proposition 14 and the gradient flow formulation (10) to prove Theorem 1. ∎
As a summary, we show the generalized Bakry–Émery criterion (23) here, and estimate its precise bound in Theorem 1. Besides, we comment on major differences and difficulties between generalized Bakry–Émery criterions and classical ones. The Hessian operator in generalized density manifold involves an additional quadratic form . Thus the smallest eigenvalue of Hessian operator in density manifold is not enough to provide a lower bound for the convergence rate of generalized drift-diffusion processes. Here, we carefully derive the global behavior of dynamics. This is to control the additional quadratic form along with the gradient flow for any . Besides, a local viewpoint is provided for establishing the Poincaré inequality, which follows local behavior of dynamics, i.e., the Hessian operator in density manifold at the minimizer . This local property relates to the integral formula, known as the Yano’s formula.
6. Discussion
In this paper, we study the diffusion hypercontractivity for –drift-diffusion process, and prove generalized Log-Sobolev, Poincaré and Talagrand inequalities. Firstly, using as the Lyapunov function, the global exponential convergence of –drift-diffusion process is presented for . It is to estimate the smallest eigenvalue of Hessian operator in density manifold along with the gradient flow for any . Secondly, the local behavior of -drift diffusion process is shown for any . It is to estimate the smallest eigenvalue of Hessian operator at the reference measure . This local property allows us to obtain a class of generalized Poincaré inequalities. Besides, we identify the generalized Poincaré inequality and Yano’s formula. Lastly, our approach can be formulated into the generalized Bakry–Émery iterative operators. Here, the Gamma one and Gamma two operators are mean-field based, which depend on the current density; see related studies in probability models [16]. In future work, we will study more general mean-field Bakry–Émery conditions for related diffusion hypercontractivity and functional inequalities.
Notations
We apply the following notations in this paper.
[TABLE]
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] D. Bakry and M. Émery. Diffusions hypercontractives. Séminaire de probabilités de Strasbourg , 19:177–206, 1985.
- 2[2] D. Bakry, I. Gentil, M. Ledoux. Logarithmic Sobolev Inequalities. In: Analysis and Geometry of Markov Diffusion Operators. Grundlehren der mathematischen Wissenschaften (A Series of Comprehensive Studies in Mathematics), vol 348. Springer, Cham, 2014.
- 3[3] F. Bolley, I. Gentil. Phi-entropy inequalities for diffusion semigroups. Journal de Math matiques Pures et Appliqu es , 93(5): 449-473, 2010.
- 4[4] J. A. Carrillo, S. Lisini, G. Savar and D. Slepcev. Nonlinear mobility continuity equations and generalized displacement convexity. Journal of Functional Analysis 258(4):1273-1309, 2009.
- 5[5] S.-N. Chow, W. Li, and H. Zhou. Entropy dissipation of Fokker-Planck equations on graphs. Discrete & Continuous Dynamical Systems, series A , 38(10): 4929-4950, 2018.
- 6[6] T. M. Cover and J. A. Thomas. Elements of Information Theory . Wiley Series in Telecommunications. Wiley, New York, 1991.
- 7[7] I. Csiszár and P. C. Shields. Information Theory and Statistics: A Tutorial. Foundations and Trends™ in Communications and Information Theory , 1(4):417–528, 2004.
- 8[8] J. Dolbeault, B. Nazaret, and G. Savare A new class of transport distances. Calculus of Variations and Partial Differential Equations , (2):193–231, 2009.
