Peierls-Bogolyubov's inequality for deformed exponentials
Frank Hansen, Jin Liang, Guanghua Shi

TL;DR
This paper explores convexity and concavity of trace functions involving deformed logarithms and exponentials, deriving new inequalities that generalize Peierls-Bogolyubov's inequality and improve bounds for Tsallis relative entropy.
Contribution
It introduces new trace inequalities for deformed exponentials, extending Peierls-Bogolyubov's inequality and enhancing bounds for Tsallis relative entropy.
Findings
Derived new trace inequalities for deformed exponentials.
Generalized Peierls-Bogolyubov's inequality.
Improved lower bounds for Tsallis relative entropy.
Abstract
We study convexity or concavity of certain trace functions for the deformed logarithmic and exponential functions, and obtain in this way new trace inequalities for deformed exponentials that may be considered as generalizations of Peierls-Bogolyubov's inequality. We use these results to improve previously known lower bounds for the Tsallis relative entropy.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Peierls-Bogolyubov’s inequality for deformed exponentials
Frank Hansen
Institute for Excellence in Higher Education, Tohoku University, Sendai, Japan
Jin Liang
School of Mathematical Sciences, Shanghai Jiao Tong University, Shanghai, China
Guanghua
School of Mathematical Sciences, Shanghai Jiao Tong University, Shanghai, China
(April 3, 2017
Revised May 31, 2017)
Abstract
We study convexity or concavity of certain trace functions for the deformed logarithmic and exponential functions, and obtain in this way new trace inequalities for deformed exponentials that may be considered as generalizations of Peierls-Bogolyubov’s inequality. We use these results to improve previously known lower bounds for the Tsallis relative entropy.
Keywords: Deformed exponential function; Peierls-Bogolyubov’s inequality; Tsallis relative entropy.
1 Introduction
In statistical mechanics and in quantum information theory the calculation of the partition function of the Hamiltonian of a physical system is an important issue, but the computation is often difficult. However, it may be simplified by first computing a related quantity where is an easier to handle component of the Hamiltonian. Usually, the Hamiltonian is written as a sum of two operators, and the Peierls-Bogolyubov inequality states that
[TABLE]
which then provides information about the difficult to calculate partition function. We give in this paper generalizations of Peierls-Bogolyubov’s inequality in terms of the so-called deformed exponential and logarithmic functions. We formulate the results for operators on a finite dimensional Hilbert space but note that the results with proper modifications extend also to infinite dimensional spaces.
Main Theorem**.**
Let be self-adjoint operators, and let be a positive functional on
- (i)
If and and both and are bounded from above by then
[TABLE] 2. (ii)
If and and both and are bounded from above by then
[TABLE] 3. (iii)
If and and both and are bounded from below by then
[TABLE] 4. (iv)
If and and both and are bounded from below by then
[TABLE] 5. (v)
If and and both and are bounded from below by then
[TABLE]
If in particular is the trace this inequality reduces to
[TABLE]
In subsection 5.2 we give explicit formulae for the Fréchet differential operators in the parameter ranges and Note that the left-hand sides in the above theorem may be written as
[TABLE]
where in is replaced by the trace. If we in let tend to one, we obtain the inequality
[TABLE]
for and arbitrary self-adjoint operators and If we furthermore let tend to one we recover Peierls-Bogolyubov’s inequality (1.1).
Furuichi [4, Corollary 3.2] proved in the case by very different methods. It may be instructive to compare the above results with the first author’s study [7] of the deformed Golden-Thompson trace inequality.
We obtain, in Theorem 3.1, another variant Peierls-Bogolyubov type of inequality, and we improve, in Theorem 4.2, previously known lower bounds for the Tsallis relative entropy.
The Peierls-Bogolyubov inequality has been widely used in statistical mechanics and quantum information theory. Recently, Bikchentaev [1] proved that the Peierls-Bogolyubov inequality characterizes the tracial functionals among all positive functionals on a algebra. Moreover, Carlen and Lieb in [2] combined this inequality with the Golden-Thompson inequality to discover sharp remainder terms in some quantum entropy inequalities.
1.1 Deformed exponentials
The deformed logarithm is defined by setting
[TABLE]
for The deformed logarithm is also denoted the -logarithm. The inverse function is called the -exponential. It is denoted by and is given by the formula
[TABLE]
for The -logarithm is a bijection of the positive half-line onto the open interval Furthermore,
[TABLE]
Note also that
[TABLE]
for If tends to one then the -logarithm and the -exponential functions converge, respectively, toward the logarithmic and the exponential functions.
2 Preliminaries
Proposition 2.1**.**
Let be a real positive function defined in the cone of positive definite operators acting on a Hilbert space and assume is homogeneous of degree
- (i)
If is convex and then is convex. 2. (ii)
If is convex and then is concave. 3. (iii)
If is convex and and then is convex. 4. (iv)
If is concave and then is concave. 5. (v)
If is concave and then is convex. 6. (vi)
If is concave and and then is convex.
Proof.
Assume first that is a convex function. The level set
[TABLE]
is then convex. Take and assume Let and be any choice of positive numbers such that and We note that and obtain
[TABLE]
Therefore and by homogeneity, we conclude that is convex. If we choose such that and This is possible since is assumed to be positive. Since the exponent is negative we obtain and and therefore by homogeneity
[TABLE]
It follows that and thus
[TABLE]
where we again used that the exponent is negative. Therefore and by homogeneity we conclude that is concave. This proves and Under the assumptions in we proceed as under to obtain
[TABLE]
By homogeneity and since the exponent is negative, we obtain the inequality
[TABLE]
implying convexity of We obtain and by a variation of the reasoning used to obtain and ∎
Proposition 2.2**.**
Consider the function
[TABLE]
defined in positive definite operators. Then
- (i)
* is concave for * 2. (ii)
* is convex for and * 3. (iii)
* is concave for and * 4. (iv)
* is convex for and * 5. (v)
* is convex for and *
Proof.
Since the real function is convex in positive numbers for and and concave for it is well known that the trace function retains the same properties. A historic account of this result may be found in [8, Introduction]. By and in Proposition 2.1 we thus obtain that the function
[TABLE]
is concave for and convex for Furthermore, since the real function is concave and increasing for we derive in the assertion. Part then follows by Proposition 2.1 and Part follows from Proposition 2.1 by noting that Part follows from Proposition 2.1 by noting that and part finally follows from Proposition 2.1. ∎
Note that for is the Schatten -norm of the positive definite matrix The convexity in this case may also be derived by noting that a norm satisfies the triangle inequality and is positively homogeneous.
Proposition 2.3**.**
Let be an arbitrary operator and consider the function
[TABLE]
defined in positive definite operators. Then
- (i)
* is concave for and * 2. (ii)
* is convex for and * 3. (iii)
* is concave for and * 4. (iv)
* is convex for and * 5. (v)
* is convex for and *
Proof.
By continuity we may assume invertible. Since the function is operator convex for or it follows that the trace function is convex for these parameter values. It then follows by and in Proposition 2.1 that the function
[TABLE]
is concave for and convex for Furthermore, since the real function is concave and increasing for we derive part of the assertion. Part then follows by Proposition 2.1. Parts to now follow by minor variations of the reasoning in the preceding proposition. ∎
2.1 Some deformed trace functions
Theorem 2.4**.**
Consider the function
[TABLE]
defined in self-adjoint for and in self-adjoint for Then
- (i)
If and then is convex, 2. (ii)
If and then is convex, 3. (iii)
If and then is concave.
Proof.
Note that the conditions on ensure that for both and By calculation we obtain
[TABLE]
Under the assumptions in we obtain
[TABLE]
for By Proposition 2.2 and since the factor is negative, it follows that is convex. If then and the convexity of follows by Proposition 2.2. This proves the first statement. Under the assumptions in we obtain
[TABLE]
thus is convex by Proposition 2.2. Under the assumptions in we first consider the case and obtain
[TABLE]
thus is concave by Proposition 2.2. If then we use Proposition 2.2 to obtain that is convex. Since we conclude that is concave also in this case. ∎
Theorem 2.5**.**
Let be arbitrary and consider the function
[TABLE]
defined in self-adjoint for and in self-adjoint for
- (i)
If and then is convex, 2. (ii)
If and then is convex, 3. (iii)
If and then is concave.
Proof.
By calculation we obtain
[TABLE]
Under the assumptions in we obtain
[TABLE]
for By Proposition 2.3 and since the factor is negative, it follows that is convex. If then and the convexity of follows by Proposition 2.3. This proves the first statement. Under the assumptions in we obtain
[TABLE]
thus is convex by Proposition 2.3. The last case is argued as in the preceding theorem by considering the cases and separately. ∎
Note in the above theorem there is a gap between [math] and for the possible values of
3 Peierls-Bogolyubov type inequalities
We first obtain a variant Peierls-Bogolyubov type inequality as a consequence of Proposition 2.2. Take positive definite operators and define the function
[TABLE]
Since is convex for and we obtain the inequality,
[TABLE]
for these parameter values. By concavity we obtain the opposite inequality for the parameter values and and for the parameter values for and
Theorem 3.1**.**
For positive definite operators we have
- (i)
If and then
[TABLE] 2. (ii)
If and or if and then
[TABLE]
Proof.
With the parameter values in we may let tend to zero in (3.1) and obtain the inequality We note that is the left hand side in the desired inequality. Furthermore,
[TABLE]
where we used the chain rule for Fréchet differentiation, the linearity of the trace, and the formula in [5, Theorem 2.2]. This proves case Case follows by virtually the same argument using the opposite inequality in (3.1). ∎
We then explore consequences of Theorem 2.4. If we take self-adjoint operators such that both and are bounded from above by For we note that such that The function
[TABLE]
is thus well-defined and convex for and Therefore,
[TABLE]
for these parameter values.
For we take self-adjoint operators such that both and are bounded from below by For we note that such that The function defined in (3.2) is thus well-defined. It is convex for and and it is concave for and In the first case we thus retain the inequality in (3.3), while the inequality is reversed in the latter case.
Theorem 3.2**.**
Let be self-adjoint operators.
- (i)
If and and both and are bounded from above by then
[TABLE] 2. (ii)
If and and both and are bounded from below by then
[TABLE] 3. (iii)
If and and both and are bounded from below by then
[TABLE]
Proof.
With the parameter values in we may let tend to zero in (3.3) and obtain the inequality We note that is the left hand side in the desired inequality. Furthermore,
[TABLE]
where we used the chain rule for Fréchet differentiation, the derivatives of the deformed logarithmic and exponential functions, the linearity of the trace, and the formula in [5, Theorem 2.2]. This proves case The other cases follow by a variation of this reasoning. ∎
By a similar line of arguments as in the two previous theorems we finally obtain the following consequences of Theorem 2.5.
Theorem 3.3**.**
Let be arbitrary and be self-adjoint.
- (i)
If and and both and are bounded from above by then
[TABLE] 2. (ii)
If and and both and are bounded from below by then
[TABLE] 3. (iii)
If and and both and are bounded from below by then
[TABLE]
Proof.
We follow a similar path as in the proof of Theorem 3.2 and consider the function
[TABLE]
which by Theorem 2.5 is convex for the parameter values in We obtain by an argument similar to the one given in the proof of Theorem 3.2 that and we note that is the left hand side in the desired inequality. Furthermore,
[TABLE]
where we used the chain rule for Fréchet differentiation, the derivative of the deformed logarithmic function, and the linearity of the trace. This proves case Since the function in (3.4) is convex for the parameter values in and concave for the parameter values in these cases follow by virtually the same line of arguments as in ∎
Note that in Theorem 3.3 is a generalization of in Theorem 3.2. Since is arbitrary in the above theorem we may replace the trace by any other positive functional on The main theorem now follows from Theorem 3.2 and Theorem 3.3.
4 The Tsallis relative entropy
In this section we study lower bounds for the (generalized) Tsallis relative entropy. For basic information about the Tsallis entropy and the Tsallis relative entropy we refer the reader to references [9, 10].
The Tsallis relative entropy is for positive definite operators and defined by setting
[TABLE]
By letting tend to one this expression converges to the relative quantum entropy
[TABLE]
introduced by Umegaki [11]. It is known [3, Proposition 2.4] that the Tsallis relative entropy is non-negative for states. This also follows directly from the following:
Lemma 4.1**.**
Let and be states. Then
[TABLE]
for
Proof.
Consider states and and let be the set of exponents such that We take and obtain
[TABLE]
where we used Cauchy-Schwarz’ inequality. This shows that is midpoint-convex. Since also is closed and we conclude that ∎
Theorem 4.2**.**
Let and take Then, for positive definite operators the inequality
[TABLE]
is valid, where by convention
Proof.
Let be positive definite operators and take and By setting
[TABLE]
we obtain self-adjoint such that both and are bounded from below by We may thus apply of Theorem 3.2 and obtain after a little calculation the inequality
[TABLE]
By setting and renaming by we obtain the stated inequality for and ∎
The lower bound of the Tsallis relative entropy in Theorem 4.2 was obtained in [3, Theorem 3.3] in the special case The family of lower bounds given above is in general not an increasing function in the parameter and may therefore, depending on and provide better lower bounds.
5 Various Fréchet differentials
In order to obtain a more detailed understanding of the bounds obtained in the Main Theorem we need to provide explicit formulae for the Fréchet differential operator in the parameter range The integral representation
[TABLE]
valid for is well-known. Since is operator monotone the representation may be quite easily derived by calculating the representing measure, see for example [6, Theorem 5.5]. Furthermore, since by an elementary calculation
[TABLE]
we obtain the integral representation
[TABLE]
valid for positive definite Since by (5.1) we have
[TABLE]
for and
[TABLE]
we obtain the integral representation
[TABLE]
valid for positive definite By using the rule for the Fréchet differential of a product, or by an elementary direct calculation, we obtain the general identity
[TABLE]
which combined with (5.2) provides a formula for the Fréchet differential of for If is self-adjoint the formula in (5.5) may be written on the form
[TABLE]
which is then manifestly self-adjoint.
5.1 The deformed logarithm
By setting in (5.1) we obtain
[TABLE]
and thus
[TABLE]
for and We therefore obtain the following integral representation of the deformed logarithm
[TABLE]
valid for Since by an elementary calculation
[TABLE]
we derive the formula
[TABLE]
valid for positive definite and Note that
[TABLE]
for all by the definition of the deformed logarithm. If we in formula (5.7) let tend to we obtain
[TABLE]
as expected. If we instead set we recover the classical integral
[TABLE]
valid for and
5.2 The deformed exponential
We next derive integral representations for the deformed exponential in the parameter interval We first note that
[TABLE]
for Therefore,
[TABLE]
for positive definite We divide the analysis into four cases:
If then
[TABLE]
and we may therefore calculate by the formula in (5.4). 2. 2.
If then thus
[TABLE] 3. 3.
If then we have
[TABLE]
We may therefore calculate by the formulae in (5.5) and (5.2). 4. 4.
If then thus
[TABLE] 5. 5.
If then
[TABLE]
and we may calculate by the formula in (5.2).
Note that we for any and have the identity
[TABLE]
Likewise,
[TABLE]
for commuting and
Acknowledgments
The authors would like to thank the anonymous referees for helpful suggestions. The first author acknowledges support by the Japanese Grant-in-Aid for scientific research 17K05267 and by the National Science Foundation of China 11301025. The second and third authors acknowledge support from the National Science Foundation of China 11571229.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] A. M. Bikchentaev, The Peierls-Bogoliubov inequality in C*-algebras and characterization of tracial functionals. Lobachevskii J. Math. 32(3): 175-179, 2011.
- 2[2] E. A. Carlen, E. H. Lieb, Remainder terms for some quantum entropy inequalities. J. Math. Phys. 55(4): 042201, 2014.
- 3[3] S. Furuichi, K. Yanagi, K. Kuriyama, Fundamental properties of Tsallis relative entropy. J. Math. Phys. 45(12): 4868-4877, 2004.
- 4[4] S. Furuichi. Trace inequalities in non-extensive statistical mechanics. Linear Algebra and Its Applications 418:821–827, 2006.
- 5[5] F. Hansen and G.K. Pedersen. Perturbation formulas for traces on C ∗ superscript 𝐶 C^{*} -algebras. Publ. RIMS, Kyoto Univ. 31:169–178, 1995.
- 6[6] F. Hansen. The fast track to Löwner’s theorem. Linear Algebra Appl. 438:4557–4571, 2013.
- 7[7] F. Hansen, Golden-Thompson’s inequality for deformed exponentials. J. Stat. Phys. 159(5): 1300-1305, 2015.
- 8[8] E. Lieb and G.K. Pedersen, Convex multivariable trace functions, Reviews in Mathematical Physics 14: 631–648, 2002.
