Interpolation of Operators With Trace Inequalities Related To The Positive Weighted Geometric Mean
Victoria Chayes

TL;DR
This paper explores the interpolation of weighted geometric means of positive matrices using trace inequalities related to the Golden-Thompson inequality, extending known results to the full real line and characterizing equality cases.
Contribution
It extends inequalities for the weighted geometric mean of positive matrices to all real t, providing a comprehensive view of interpolation with Golden-Thompson inequalities and equality characterizations.
Findings
Extended inequalities to the entire real line for weighted geometric means.
Provided new proofs for existing inequalities using exterior inequalities.
Characterized equality cases for strictly increasing unitarily invariant norms.
Abstract
There are various generalizations of the geometric mean for to positive matrices, and we consider the standard positive geometric mean . Much research in recent years has been devoted to relating the weighted version of this mean for with operators and in Golden-Thompson-like inequalities. These inequalities are of interest to mathematical physicists for their relationship to quantum entropy, relative quantum entropy, and R\'{e}nyi divergences. However, the weighted mean is well-defined for the full range of . In this paper we examine the value of and variations thereof in comparison to and β¦
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMathematical Inequalities and Applications Β· Matrix Theory and Algorithms Β· Numerical methods in inverse problems
Interpolation of Operators With Trace Inequalities Related To The Positive Weighted Geometric Mean
Victoria M.Β Chayes
Department of Mathematics
Rutgers University
Piscataway, NJ 08854
Abstract
There are various generalizations of the geometric mean for to positive matrices, and we consider the standard positive geometric mean . Much research in recent years has been devoted to relating the weighted version of this mean for with operators and in Golden-Thompson-like inequalities. These inequalities are of interest to mathematical physicists for their relationship to quantum entropy, relative quantum entropy, and RΓ©nyi divergences. However, the weighted mean is well-defined for the full range of . In this paper we examine the value of and variations thereof in comparison to and for any unitarily invariant norm and in particular the trace norm, creating for the first time the full picture of interpolation of the weighted geometric mean with the Golden-Thompson Inequality. We expand inequalities known for with , to the entire real line, and comment on how the exterior inequalities can be used to provide elegant proofs of the known inequalities for . We also characterize the equality cases for strictly increasing unitarily invariant norms.
K****eywordsβoperator interpolation Β trace inequalities Β geometric matrix mean Β log majorization Β quantum entropy
1 Introduction
β¦.Consider the following functions on positive definite complex matrices:
[TABLE]
Each can be seen as an operator analogue of the geometric mean function defined on . In fact, the third is known as the geometric mean of positive matrices, first introduced by Pusz and Woronowicz [14] in 1975 as a way of generalizing to sesquilinear forms.
β¦.Mathematicians and mathematical physicists will recognize the first two operator functions from their placement in the Golden-Thompson Inequality, proven by Golden for non-negative definite matrices in 1965 [7] and independently the same year by Thompson [16] for all Hermitian matrices and ,
[TABLE]
Characterizing the general relationship between these operator functions in a similar manner will be the goal of this paper.
β¦.Each of the operators functions can be extended as a function of as
[TABLE]
The third operator is referred to as the weighted geometric mean for , and Hiai and Petz proved in 1993 [10] the famous complement to the Golden-Thompson Inequality for Hermitian matrices and
[TABLE]
This is a corollary of the more general identity proven in 1994 with log majorization techniques [1] [2] for all , , and unitarily invariant norms
[TABLE]
β¦.The relationship between and has been fully characterized: first in [12] for the range then in [9] for the range , from which negative reciprical inequalities can be deduced. This paper for the first time pulls together the full range of the comparison of all three operators. We do so by proving a similar complimentary identity:
Theorem 1.1**.**
Let and be Hermitian, and . Then
[TABLE]
for any unitarily invariant norm .
Theorem 1.1 is proven in Section 4. It should be noted that Theorem 1.1 can be deduced in the in [12]; here we provide proof for the entire range together.
β¦.This allows us for the first time to characterize the relationship between the three operator functions for all :
Theorem 1.2**.**
For Hermitian matrices and , and and any unitarily invariant norm , then
[TABLE]
β¦.Note that as the trace is a unitarily invariant norm for positive matrices, that the above inequalities all hold for the traces of the matrices. Complimentary negative inequalities can be found taking .
Proof.
Equation 1.11 comes from taking the case of Equation 1.9, combined with the Golden-Thompson Inequality. Equation 1.12 comes taking the case in Equation 1.10 and Equation 4.12 (proven in [9] and discussed in Section 4) and the Golden-Thompson Inequality for the positive case, and using Relationship 2.6 to extend to the negative case. Equation 1.13 comes from taking the case in Equations 1.10 and Equation 4.13 (also proven in [9] and discussed in Section 4) and the Golden-Thompson Inequality for the positive case, and once more using Relationship 2.6 to extend to the negative case.
Theorem 1 in the case of the trace norm is illustrated in Figure 1.
β
β¦.Finally, we characterize the equality cases for all of the inequalities in Theorems 1 and 2:
Theorem 1.3**.**
For any strictly increasing unitarily invariant norm , there is equality in any of the inequalities of Equations 1.11, 1.12, or 1.13 if and only if and commute. Furthermore, for any strictly increasing unitarily invariant norm , there is equality in Equation 1.10 if and only if and commute.
β¦.Section 2 introduces the geodesic interpretation of the weighted geometric mean operator, which gives more significance and clarity to its various properties. Section 3 introduces the technique of log majorization and its relevant applications, which instrumental in prior work in this subject and in proving our main theorems. Section 4 contains the proof of Theorems 1.1 and 1.3, commentary on its connection to quantum relative entropy, and the interconnectedness of the , , and cases.
2 The Weighted Geometric Mean As Geodesics
β¦.Let denote the space of positive definite complex matrices. We consider the Riemannian metric with arc length of the smooth path defined by
[TABLE]
where denotes the Hilbert-Schmidt norm. Then the corresponding distance is
[TABLE]
β¦.This metric, first introduced by Skovgaard [15] for its applications in statistics, has the particular nice property that it is invariant under conjugation:
[TABLE]
for all invertible matrices . It is shown in [4] [3] that there is a unique constant speed geodesic for any running between and in unit time, namely
[TABLE]
β¦.These geodesics satisfy [4] for all and
[TABLE]
Of particular interest to us will be the , , which gives the identity
[TABLE]
This will allow us to extend all inequalities proven for to , and in particular to connect the positive and negative cases of Theorem 1.2.
β¦.Ando and Kubo prove in [13] that the map
[TABLE]
is jointly concave and monotone increasing in X and Y for . Carlen and Lieb prove [4] that it is jointly convex for . However, it is challenging to expand these results as is not operator monotone for and neither concave nor convex for . Therefore, proofs of trace inequalities cannot rely solely on operator monotonicty or convexity, so we turn to our main tool: log majorization.
3 Log Majorization
β¦.Let and , with and . Then weakly majorizes , written , when
[TABLE]
and is majorized when the final inequality is an equality. Weak log majorization is similarly defined for non-negative vectors as
[TABLE]
with log majorization when the final inequality is an equality.
β¦.We define all of the above majorization for matrices, ie and all variations, when the singular values in descending order considered as a vector .
β¦.We list the following results for convenience:
Lemma 3.1**.**
Let . Then the following conditions are equivalent: , for every unitarily invariant norm , and for every unitarily invariant norm and continuous increasing function such that and is convex.
β¦.A full proof can be found in [11].
Corollary 3.1.1**.**
When , and for a function that satisfies the hypotheses of Lemma 3.1 such that , then .
Proof.
Lemma 1 implies , then gives . β
Theorem 3.2**.**
(Araki [2], Theorem 1) For all ,
[TABLE]
or equivalently
[TABLE]
Corollary 3.2.1**.**
For all self-adjoint and unitarily invariant norm ,
[TABLE]
or equivalently
[TABLE]
Proof.
Take in Theorem 3.2 and apply the Lie-Trotter product formula. β
4 Trace Inequalities
Proof of Theorem 1.1.
Let , and . For , then , is convex and increasing, and . Then applying Corollary 3.1.1 and Theorem 3.2,
[TABLE]
We explicitly write
[TABLE]
As , we apply Corollary 3.2.1:
[TABLE]
Then by Lemma 1,
[TABLE]
for any unitarily invariant norm. Relationship 2.6 extends this to . β
Proof of Theorem 1.3.
When and commute, all of the expressions being considered are equal, and hence their norms will be equal. Therefore, it remains to consider the implications of equality for strictly increasing unitarily invariant norms.
β¦.It is known from Hiai [8] (Theorem 3.1) that there is equality in the inequalities in Equation 1.11 for a strictly increasing unitarily invariant norm if and only if and commute, and in [9] for all the relationships between and . Therefore, it remains to prove that there is equality in Equation 1.10 (re-stated above in Equation 4.6) if and only if and commute; then Relationship 2.6 extends the characterization to all , taking .
β¦.We use the following Lemma from Hiai [8]:
Lemma 4.1**.**
(Hiai 1994) Let . Then is not strictly increasing in if and only if and commute.
β¦.From the proof of Theorem 1.1, we have
[TABLE]
Then for a strictly increasing unitarily invariant norm, it follows that we have
[TABLE]
We then apply Lemma 4.1 to the second equality considering the Lie-Trotter product formula. β
β¦.It is interesting to note that Equation 1.10, the case actually implies Equation 1.9, the case. In [10], Hiai and Petz show that Equation 1.9 is equivalent to
[TABLE]
for all . The forward implication comes from noting the inequality trivially becomes equality at , so taking the limit from above one can relate the derivatives of each side. The backwards inequality is far more involved. Hiai and Petz prove Equation 1.9 by proving Equation 4.9 then making use of their equivalence.
β¦.The exact same reasoning behind the forward implication applies substituting in knowledge of the trace inequality for . Therefore, it is an immediate consequence of [10] that Equation 1.10 implies Equation 1.9: taking the derivative of Equation 1.10 at with some changes of variable produces Equation 4.9 directly from Equation 1.10.
β¦.As mentioned in Section 1, Hiai [9] recently published work regarding log majorization and RΓ©nyi divergences, including that for ,
[TABLE]
β¦.It then follows from Lemma 3.1 that for ,
[TABLE]
for any unitarily invariant norm .
β¦.It is possible to prove a simplified version of Equation 4.10 (the special case of r=1) using similar methods as in [9], with the Furuta inequality:
Theorem 4.2**.**
Let be Hermitian, and . Then
[TABLE]
Proof.
We use the standard antisymmetric tensor power technique to prove log majorization, noting that for and then
[TABLE]
and that the antisymmetric tensor product has the properties , , and for all and , [Hiai20142]. Then as
[TABLE]
it suffices to show that
[TABLE]
β¦.Suppose . We use the following relation [6] for all , invertible, and real number :
[TABLE]
Then
[TABLE]
β¦.We now apply the Furuta Inequality [5] for , , , with :
[TABLE]
We choose , , , , . Note that as then , and . Furthermore,
[TABLE]
so all the hypotheses of the inequality are satisfied. Then
[TABLE]
and
[TABLE]
Therefore, we conclude
[TABLE]
which implies that
[TABLE]
and so
[TABLE]
β
β¦.Hiai provides a full analysis on the connection to RΓ©nyi divergences by noting that in the case, , and one can use the same technique taking the derivative:
[TABLE]
which using
[TABLE]
gives
[TABLE]
β¦.Theorem 4.2 gives the same derivative identity at . We thus conjecture that similar methods to [10] it may be possible to show that
[TABLE]
also implies
[TABLE]
5 Acknowledgements
Thank you to Professor Eric A Carlen for suggesting the problem and providing an introduction to the general subject.
This research was funded by the NDSEG Fellowship, Class of 2017.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Ando, T., Hiai, F.: Log majorization and complementary Golden-Thompson type inequalities. Linear Algebra and its Applications 197-198 , 113 β 131 (1994). DOI https://doi.org/10.1016/0024-3795(94)90484-7 . URL http://www.sciencedirect.com/science/article/pii/0024379594904847
- 2[2] Araki, H.: On an inequality of Lieb and Thirring. Letters in Mathematical Physics 19 (2), 167β170 (1990). DOI 10.1007/BF 01045887 . URL https://doi.org/10.1007/BF 01045887 Β· doiΒ β
- 3[3] Bhatia, R., Holbrook, J.: Riemannian geometry and matrix geometric means. Linear Algebra and its Applications 413 (2), 594 β 618 (2006). DOI https://doi.org/10.1016/j.laa.2005.08.025 . URL http://www.sciencedirect.com/science/article/pii/S 0024379505004350 . Special Issue on the 11th Conference of the International Linear Algebra Society, Coimbra, 2004
- 4[4] Carlen, E.A., Lieb, E.H.: Some trace inequalities for exponential and logarithmic functions. Bulletin of Mathematical Sciences (2018). DOI 10.1007/s 13373-018-0123-3 . URL https://doi.org/10.1007/s 13373-018-0123-3 Β· doiΒ β
- 5[5] Furuta, T.: a β₯ b β₯ 0 π π 0 a\geq b\geq 0 assures ( b r β a p β b r ) 1 / q β₯ b ( p + 2 β r ) / q superscript superscript π π superscript π π superscript π π 1 π superscript π π 2 π π (b^{r}a^{p}b^{r})^{1/q}\geq b^{(p+2r)/q} for r β₯ 0 , p β₯ 0 , q β₯ 1 formulae-sequence π 0 formulae-sequence π 0 π 1 r\geq 0,p\geq 0,q\geq 1 with ( 1 + 2 β r ) β q β₯ p + 2 β r 1 2 π π π 2 π (1+2r)q\geq p+2r . Proceedings of the American Mathematical Society 101 (1), 85β88 (1987). URL http://ww
- 6[6] Furuta, T.: Extension of the Furuta inequality and Ando-Hiai log-majorization. Linear Algebra and its Applications 219 , 139 β 155 (1995). DOI https://doi.org/10.1016/0024-3795(93)00203-C . URL http://www.sciencedirect.com/science/article/pii/002437959300203 C
- 7[7] Golden, S.: Lower bounds for the helmholtz function. Phys. Rev. 137 , B 1127βB 1128 (1965). DOI 10.1103/Phys Rev.137.B 1127 . URL https://link.aps.org/doi/10.1103/Phys Rev.137.B 1127
- 8[8] Hiai, F.: Equality cases in matrix norm inequalities of Golden-Thompson type. Linear and Multilinear Algebra 36 (4), 239β249 (1994). DOI 10.1080/03081089408818297 . URL https://doi.org/10.1080/03081089408818297 Β· doiΒ β
