Superadditivity of quantum relative entropy for general states
Angela Capel, Angelo Lucia, David P\'erez-Garc\'ia

TL;DR
This paper extends the superadditivity property of quantum relative entropy to arbitrary states, introducing a new inequality involving a state-dependent factor that bounds the sum of marginal divergences.
Contribution
It generalizes the superadditivity inequality for quantum relative entropy to all density operators, incorporating a state-dependent coefficient for broader applicability.
Findings
Established a new inequality for quantum relative entropy with arbitrary states.
Derived a state-dependent coefficient that bounds the superadditivity relation.
Provides theoretical foundation for analyzing quantum correlations in general states.
Abstract
The property of superadditivity of the quantum relative entropy states that, in a bipartite system , for every density operator one has . In this work, we provide an extension of this inequality for arbitrary density operators . More specifically, we prove that holds for all bipartite states and , where .
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Superadditivity of quantum relative entropy for general states
Ángela Capel
,
Angelo Lucia
and
David Pérez-García
Instituto de Ciencias Matemáticas (CSIC-UAM-UC3M-UCM), C/ Nicolás Cabrera 13-15, Campus de Cantoblanco, 28049 Madrid, Spain
QMATH, Department of Mathematical Sciences, University of Copenhagen, Universitetsparken 5, 2100 Copenhagen, Denmark and NBIA, Niels Bohr Institute, University of Copenhagen, Blegdamsvej 17, 2100 Copenhagen, Denmark
Departamento de Análisis Matemático, Universidad Complutense de Madrid, 28040 Madrid, Spain and Instituto de Ciencias Matemáticas (CSIC-UAM-UC3M-UCM), C/ Nicolás Cabrera 13-15, Campus de Cantoblanco, 28049 Madrid, Spain
Abstract.
The property of superadditivity of the quantum relative entropy states that, in a bipartite system , for every density operator one has . In this work, we provide an extension of this inequality for arbitrary density operators . More specifically, we prove that holds for all bipartite states and , where .
1. Introduction and notation
The quantum relative entropy between two density operators and in a finite dimensional Hilbert space, , is given by if and by otherwise111It can also be defined in infinite dimensions, as well as generalized von Neumann algebras [17]. However, in this work, for simplicity we will restrict to finite dimensions.. It constitutes a measure of distinguishability between two quantum states and is a fundamental tool in quantum information theory [17], [26].
The quantum relative entropy is the quantum analogue of the Kullback-Leibler divergence [12], the probabilistic relative entropy. Its origin lies in mathematical statistics, where it is used to measure how much two states differ in the sense of statistical distinguishability. The larger the relative entropy of two states is, the more information for discriminating between the hypotheses associated to them can be obtained from an observation.
One of the main properties of quantum relative entropy is superadditivity, which states that in a bipartite system one has:
[TABLE]
for all , where we use the standard notation and is the partial trace.
Since (Proposition 2)
[TABLE]
(1) is equivalent to the fact that the mutual information is always non-negative, a fact that appears ubiquitously in quantum information theory.
In the form (1), superadditivity of the quantum relative entropy has found applications in e.g. quantum thermodynamics [7], statistical physics [17, Chapter 13] or hypothesis testing [9]. Indeed, as proven recently in [27] (building on results from [15]), the property of superadditivity, along with the properties of continuity with respect to the first variable, monotonicity and additivity (Proposition 1), characterizes axiomatically the quantum relative entropy.
The main aim of this work is to provide a quantitative extension of (1) for an arbitrary density operator . Note that for all and , as a consequence of monotonicity of the quantum relative entropy for the partial trace, the following holds:
[TABLE]
Therefore we aim to give a constant at the LHS of (1) that measures how far is from .
Following [5] we will consider as the distance from to “ multiplied by the inverse of ”. In the case in which and commute there is a unique way to define this: . In the non-commutative case, however, there are many possible ways to define the multiplication by the inverse. The one we will take in the result below is a symmetric analogue of the commutative case, . Another one that will appear in the proof of this result is the derivative of the matrix logarithm on evaluated on , , whose explicit equivalent expressions shown in [13] and [21] will be presented later.
{tBox}
Theorem 1**.**
For any bipartite states :
[TABLE]
where
,
and denotes the identity operator in .
Note that if .
This result constitutes an improvement to (2) whenever (and, hence, ). Then, it is likely to be relevant for situations where it is natural to assume . This is the case of (quantum) many body systems where such property is expected to hold for spatially separated regions in the Gibbs state above the critical temperature. Indeed, a classical version of Theorem 1 proven by Cesi [3] and Dai Pra, Paganoni and Posta [5], was the key step to provide the arguably simplest proof of the seminal result of Martinelli and Olivieri [14] connecting the decay of correlations in the Gibbs state of a classical spin model with the mixing time of the associated Glauber dynamics, via a bound on the log-Sobolev constant.
1.1. Notation
We consider a finite dimensional Hilbert space . We denote the set of bounded linear operators acting on by (whose elements we denote by lowercase Latin letters: …), and its subset of Hermitian operators by (whose elements we call observables). The set of positive semidefinite Hermitian operators is denoted by . We also denote the set of density operators by (whose elements we also call states and denote by lowercase Greek letters: …).
A linear map is called a superoperator. We say that a superoperator is positive if it maps positive operators to positive operators. Moreover, we denote as completely positive if is positive for every , where is the space of complex matrices. We also say that is trace preserving if for all . Finally, if verifies all these properties, i.e., is a completely positive and trace preserving map, it is called a quantum channel (for more information on this topic, see [28]).
We denote by the trace norm and by the operator norm . In the following section, we will make use of this (Hölder) inequality [1]:
[TABLE]
In most of the paper, we consider a bipartite finite dimensional Hilbert space . When this is the case, we use the previous notation placing the subindex (resp. , ) in each of the previous sets to denote that the operators considered act on (resp. , ). There is a natural inclusion of in by identifying .
1.2. Relative entropy
Let be a finite dimensional Hilbert space, , verifying . The quantum relative entropy of and is defined by [23]:
[TABLE]
Remark 1*.*
In most of the paper we only consider density matrices (with trace ). Let . In this case, the quantum relative entropy is given by:
[TABLE]
In the following proposition, we collect some well-known properties of the relative entropy, which will be of use in the following section.
Proposition 1** (Properties of the relative entropy, [24]).**
s
Let be a bipartite finite dimensional Hilbert space, . Let . The following properties hold:
- (1)
Non-negativity.* and .* 2. (2)
Finiteness.* if, and only if, , where supp stands for support.* 3. (3)
Monotonicity.* for every quantum channel .* 4. (4)
Additivity.* .*
These properties, especially the property of non-negativity, allow to consider the relative entropy as a measure of separation of two states, even though, technically, it is not a distance (with its usual meaning), since it is not symmetric and lacks a triangle inequality.
Let us prove now the property of superadditivity, whenever .
Proposition 2**.**
Let and . If , then
,
where is the mutual information [20].
As a consequence,
.
Proof.
Since , we have
[TABLE]
Now, since is a relative entropy, it is greater or equal than zero (property 1 of Proposition 1), so
.
∎
We prove now a lemma for observables (non necessarily of trace ) which yields a relation between the relative entropy of two observables and the relative entropy of some dilations of each of them. In particular, it is a useful tool to express the relative entropy of two observables in terms of the relative entropy of their normalizations (i.e., the quotient of each of them by their trace).
Lemma 2**.**
Let be a finite dimensional Hilbert space and let such that . For all positive real numbers and , we have:
[TABLE]
Proof.
[TABLE]
where, in the first and third equality, we are using the linearity of the trace, and we are denoting by for every . ∎
Since the relative entropy of two density matrices is non-negative (property 1 of Proposition 1), we have the following corollary:
Corollary 1**.**
Let be a finite dimensional Hilbert space and let such that and . Then, the following inequality holds:
[TABLE]
Proof.
Since and are density matrices, we have that
,
and we can apply Lemma 2:
.
∎
2. Proof of main result
We divide the proof of Theorem 1 in four steps.
In the first step, we provide a lower bound for the relative entropy of and in terms of , and an error term, which we will further bound in the following steps.
{sBox}
Step 1**.**
For density matrices , it holds that
[TABLE]
*where and equality holds (both sides being equal to zero) if .
Moreover, if , then .*
Proof.
It holds that:
[TABLE]
where is defined as in the statement of the step and in the first equality we have used the fourth property of Proposition 1.
We can now apply Corollary 1 to obtain that
.
It is easy to check, given the definition of , that if , so both sides are equal to zero in this case.
Also, if , is equal to . In both cases we have . ∎
Our target now is to bound the error term, , in terms of the relative entropy of and times a constant which depends only on , and , and represents how far is from being a tensor product. In the second step of the proof, we will bound this term by the trace of the product of a term which contains this ‘distance’ between and and another term which depends on and not on . However, before that, we need to introduce some concepts and results.
First, we recall the Golden-Thompson inequality, proven independently in [8] and [22] (and extended to the infinite dimensional case in [19] and [2]), which says that for Hermitian operators and ,
[TABLE]
where we denote by the exponential of , given by
.
The trivial generalization of the Golden-Thompson inequality to three operators instead of two in the form is false, as Lieb mentioned in [13]. However, in the same paper, he provides a correct generalization of this inequality for three operators. This result has recently been extended by Sutter et al. in [21] via de so-called multivariate trace inequalities (see also the subsequent paper by Wilde [25], where similar inequalities are derived following the statements of [6]).
Theorem 3**.**
Let be positive semidefinite operators, and recall the definition of :
[TABLE]
* is positive semidefinite if is. We have that*
[TABLE]
This superoperator provides a pseudo-inversion of the operator with respect to the operator where it is evaluated. In particular, if and commute, it is exactly the standard inversion, as we can see in the following corollary.
Corollary 2**.**
If and commute, then
[TABLE]
and therefore
[TABLE]
This shows that Lieb’s theorem is really a generalization of Golden-Thompson inequality.
We use an alternative definition of this superoperator to obtain a necessary tool for the proof of Step 2. In [21, Lemma 3.4], Sutter, Berta and Tomamichel prove the following result:
Lemma 4**.**
For a positive semidefinite operator and a Hermitian operator the following holds:
**
with
.
Using this expression for , we can prove the following result, which is a quantum version of a result used in [5].
Lemma 5**.**
For every operator and the following holds:
**
where
.
Proof.
We only prove
,
since the other equality is completely analogous.
[TABLE]
because , the integral commutes with the trace, is a scalar for every and the exponent in the power of a tensor product can be split into both terms.
Now, since the trace is cyclic and using the fact that any operator in commutes with every operator in , we have:
[TABLE]
where we have used
,
and the fact that, for every and , the following holds:
.
∎
We are now in position to develop the second step of the proof.
{sBox}
Step 2**.**
With the same notation of 1, we have that
[TABLE]
where
.
Proof.
We apply Lieb’s theorem to the error term of inequality (8):
[TABLE]
where we are adding and substracting inside the trace in the last equality.
Now, using the fact , we have
.
Finally, in virtue of Lemma 5, it is clear that
.
Therefore,
.
Notice that if , then , so . ∎
In the third step of the proof, we need to bound in terms of the relative entropy of and times a constant depending only on (since represents how entangled is between the regions and ). The first well-known result we will use in this step is Pinsker’s inequality [4, 18].
Theorem 6**.**
For and density matrices, it holds that
[TABLE]
This result will be of use at the end of the proof to finally obtain the relative entropy in the right-hand side of the desired inequality. However, it is important to notice the different scales of the -norm of the difference between and and the relative entropy of and in Pinsker’s inequality. Since we are interested in obtaining the relative entropy with exponent one, we will need to increase the degree of the term with the trace we already have and from which we will construct an -norm (since, for the moment, its degree is one). We will see later that the fact that in we have split into two regions, the multiplicativity of the trace with respect to tensor products and the monotonicity of the relative entropy play a decisive role in the proof.
Another important fact that we notice in the left-hand side of Pinsker’s inequality is that there is a difference between two states (in fact, the ones appearing in the relative entropy). This justifies the use of Lemma 5 at the end of Step 2, to obtain something similar to the difference between and .
We are now ready to prove the third step in the proof of Theorem 1. {sBox}
Step 3**.**
With the notation of Theorem 1,
[TABLE]
Proof.
We use the multiplicativity with respect to tensor products of the trace norm and Hölder’s inequality between the trace norm and the operator norm. Thus,
[TABLE]
Finally, Pinsker’s inequality (Theorem 6) implies that
.
Therefore,
where in the last inequality we have used monotonicity of the relative entropy with respect to the partial trace (Proposition 1). ∎
If we now put together Steps 1, 2 and 3, we obtain the following expression
[TABLE]
with
.
This inequality already constitutes a quantitative extension of (1) for arbitrary density operators in the sense that if is a tensor product between and , we recover the usual superadditivity, and in general measures how far is from .
In the fourth and final step of the proof, we bound by
,
a quantity from which the closeness to [math] whenever is near from being a tensor product is directly deduced. It also has some physical interpretation in quantum many body systems that will be discussed after proving Step 4.
First, we need to introduce the setting of non-commutative spaces with a -weighted norm [11]. The central property of these non-commutative spaces is that they are equipped with a weighted norm which, for a full rank state , is given by
for every .
Analogously, the -weighted inner product is given by
for every .
Some fundamental properties of these spaces are collected in the following proposition.
Proposition 3**.**
Let . The following properties hold for -weighted norms:
- (1)
Order.* , with , we have .* 2. (2)
Duality.* , we have for .* 3. (3)
Operator norm.* , we have , the usual operator norm.*
Another tool we will use in the proof of Step 4 is the following result.
Lemma 7**.**
Consider and let be a quantum channel verifying , where denotes the dual of with respect to the Hilbert-Schmidt scalar product. Then, is contractive between and , i.e., the following inequality holds for every :
[TABLE]
Proof.
Using the property of duality for the -weighted norms of -spaces (property 2 of Proposition 3), we can write:
[TABLE]
where in the first step we have used the fact that, for every , coincides with the operator norm.
Recalling now that is the dual of with respect to the Hilbert-Schmidt scalar product, we have:
[TABLE]
Since we are considering the supremum over the observables verifying , if we apply to these inequalitites , we have (because of the assumption ).
Hence, if we denote , it is clear that whenever , also . Therefore,
[TABLE]
where the last equality comes again from the property of duality of weighted -norms.
∎
In the proof of the previous lemma we have made strong use of the property of duality of . Indeed, considering the -norm as dual of the operator norm, has been essential to obtain the desired result. Using similar tools, we can now prove the last step in the proof of Theorem 1.
{sBox}
Step 4**.**
With the notation of the previous steps, we have
[TABLE]
Proof.
The strategy we follow in this proof is the opposite to the one used in the previous lemma, i.e., we study now the -norm as the dual of the -norm. Since coincides with the usual -norm (operator norm) for every , we can write
.
Using the aforementioned property of duality for the -weighted norms of -spaces, we have:
[TABLE]
Let us analyze the terms and separately. For , we have:
[TABLE]
where in the third and last equality we have used the fact that the integral and the trace commute, and the fourth equality is due to the cyclicity of the trace. We have also defined:
.
If we were able to express in terms of , we could simplify the expression that appears in the supremum above. We can do that in the following way:
[TABLE]
where we have used again the properties of cyclicity of the trace and commutativity of the integral and the trace.
Placing now the values for and that we have just computed in the supremum of the first part of the proof, we have:
[TABLE]
This expression looks much simpler than the one we had before. However, we need to prove that in order to see as one of the terms where the supremum is taken. Indeed, if we consider the map given by
,
it is clearly a quantum channel and also verifies . Hence, in virtue of Lemma 7, we have
,
and, therefore,
In this last supremum over elements of -norm, we can undo the previous transformations in order to obtain again an -norm. First, we need to write the term in the supremum as a -product of two terms:
[TABLE]
Finally, using again the property of duality for the norms of and , we have:
[TABLE]
where we have used again the fact that coincides with the usual -norm for every .
In conclusion,
∎
By putting together Step 1, Step 2, Step 3 and Step 4, we conclude the proof of Theorem 1.
Remark 2*.*
This result constitutes an extension of the superadditivity property, i.e., the constant that appears in the statement of the main theorem is [math] when and is small whenever . A trivial upper bound can be found with respect to the trace distance as follows,
[TABLE]
Remark 3*.*
The term is also closely related to certain forms of decay of correlations of states that have already appeared in quantum many body systems, such as LTQO (Local Topological Quantum Order) [16], or the concept of local indistinguishability as a strengthened form of weak clustering in [10].
Let us suppose that for a certain small scalar that decays sufficiently fast as a function of the distance between regions and in a many body system, and denote by the expected value of an observable with respect to a state (usually the ground or thermal state of the system). Then, for every observable of the form , if the reduced density matrix on of is , the previous condition can be rewritten as
.
One can now compare this expression with the definition of decay of correlations
,
or LTQO
.
3. Conclusion
In this work, we have proven an extension of the property of superadditivity of the quantum relative entropy for general states. Our result constitutes an improvement to the usual lower bound for the relative entropy of two bipartite states, given by the property of monotonicity, in terms of the relative entropies in the two constituent spaces, whenever the second state is near to be a tensor product. Therefore, it might be relevant for situations where this property is expected to hold, such as quantum many body systems, in which it is likely that the Gibbs state satisfies this property in spatially separated systems.
In [10], Kastoryano and Brandao proved, for certain Gibbs samplers, the existence of a positive spectral gap for the dissipative dynamics, via a quasi-factorization result of the variance. This provides a bound for the mixing time of the evolution of the semigroup that drives the system to thermalization which is polynomial in the system size. We leave for future work the possibility of using the result of the present paper to obtain a quasi-factorization of the relative entropy in quantum many body systems, which could allow us to prove, under some conditions of decay of correlations on the Gibbs state, the existence of a positive log-Sobolev constant, obtaining an exponential improvement in the bound for the mixing time obtained in [10].
Acknowledgment
We are very grateful to D. Sutter and M. Tomamichel, who detected an error in a previous version of the paper. We also thank M. Junge for fruitful discussions. AC and DPG acknowledge support from MINECO (grant MTM2014-54240-P), from Comunidad de Madrid (grant QUITEMAD+- CM, ref. S2013/ICE-2801), and the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme (grant agreement No 648913). AC is partially supported by a La Caixa-Severo Ochoa grant (ICMAT Severo Ochoa project SEV-2011-0087, MINECO). AL acknowledges financial support from the European Research Council (ERC Grant Agreement no 337603), the Danish Council for Independent Research (Sapere Aude) and VILLUM FONDEN via the QMATH Centre of Excellence (Grant No. 10059). This work has been partially supported by ICMAT Severo Ochoa project SEV-2015-0554 (MINECO).
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] R. Bhatia , Matrix Analysis, Springer Science & \& Business Media 169 (1997), doi:10.1007/978-1-4612-0653-8.
- 2[2] M. Breitenbecker and H.R. Gruemm , Note on trace inequalities, Commun. Math. Phys. 26 (1972), 276-279, doi:10.1007/BF 01645522.
- 3[3] F. Cesi , Quasi-factorization of the entropy and logarithmic Sobolev inequalities for Gibbs random fields, Probab. Theory Related Fields 120 (2001), 569-584, doi:10.1007/PL 00008792.
- 4[4] I. Csiszár , Information-type measures of difference of probability distributions and indirect observations, Stud. Sci. Math. Hung. 2 (1967), 299-318.
- 5[5] P. Dai Pra, A.M. Paganoni and G. Posta , Entropy inequalities for unbounded spin systems, Ann. Probab. 30 (2002), 1959-1976, doi:10.1214/aop/1039548378.
- 6[6] F. Dupuis and M.M. Wilde , Swiveled Rényi entropies, Quantum Inf. Process. 15 (3) (2016), 1309-1345, doi:10.1007/s 11128-015-1211-x.
- 7[7] R. Gallego, J. Eisert and H. Wilming , Thermodynamic work from operational principles, New J. Phys. 18 (2016), 103017, doi:10.1088/1367-2630/18/10/103017.
- 8[8] S. Golden , Lower Bounds for the Helmholtz Function, Phys. Rev., Series II, 137 (1965), B 1127-B 1128, doi:10.1103/Phys Rev.137.B 1127.
