Gleason-type Theorems from Cauchy's Functional Equation
Victoria J Wright, Stefan Weigert

TL;DR
This paper offers an alternative proof of Gleason-type theorems in quantum mechanics by leveraging solutions to Cauchy's functional equation, connecting probability additivity to functional equation techniques.
Contribution
It introduces a novel proof method for Gleason-type theorems using functional equations, expanding the mathematical tools available for foundational quantum theory.
Findings
Provides a new proof of Gleason-type theorems
Links additivity in quantum probabilities to Cauchy's functional equation
Enhances understanding of the mathematical structure underlying quantum measurement
Abstract
Gleason-type theorems derive the density operator and the Born rule formalism of quantum theory from the measurement postulate, by considering additive functions which assign probabilities to measurement outcomes. Additivity is also the defining property of solutions to Cauchy's functional equation. This observation suggests an alternative proof of the strongest known Gleason-type theorem, based on techniques used to solve functional equations.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Gleason-type Theorems from
Cauchy’s Functional Equation
Victoria J Wright and Stefan Weigert
Department of Mathematics, University of York
York YO10 5DD, United Kingdom
[email protected], [email protected]
(May 2019)
Abstract
Gleason-type theorems derive the density operator and the Born rule formalism of quantum theory from the measurement postulate, by considering additive functions which assign probabilities to measurement outcomes. Additivity is also the defining property of solutions to Cauchy’s functional equation. This observation suggests an alternative proof of the strongest known Gleason-type theorem, based on techniques used to solve functional equations.
1 Introduction
Gleason’s theorem [1] is a fundamental result in the foundations of quantum theory simplifying the axiomatic structure upon which the theory is based. The theorem shows that quantum states must correspond to density operators if they are to consistently assign probabilities to the outcomes of projective measurements in Hilbert spaces of dimension three or larger.111By a consistent assignment of probabilities we mean one in which the probabilities for all outcomes of a given measurement sum to one.
More explicitly, let be the lattice of self-adjoint projections onto closed subspaces of a separable Hilbert space of dimension at least three. Consider functions , that are *finitely additive *for projections and onto *orthogonal *subspaces of , i.e.
[TABLE]
Gleason’s result shows that the solutions of Eq. (1) (in finite dimensional Hilbert spaces) 222Gleason proved this result in all separable Hilbert spaces if the condition of finite additivity is replaced with -additivity. These conditions are equivalent in finite dimensional Hilbert spaces. Later Christensen [8] showed that the weaker condition of finite additivity was also sufficient for the result to hold in infinite dimensions. necessarily admit an expression
[TABLE]
for some positive-semidefinite self-adjoint operator on .
The result does not hold, however, in Hilbert spaces of dimension two since the constraints (1) degenerate in this case: the projections lack the “intertwining” property [1] present in higher dimensions. In 2003, Busch [2] and then Caves et al. [3] extended Gleason’s theorem to dimension two by considering *generalised *quantum measurements described by positive operator-valued measures, or POMs. In analogy with Gleason’s original requirement, a state is now defined as an additive probability assignment not only on projections but on a larger set of operators, the space of effects333An effect on is a self-adjoint operator satisfying for all vectors . defined on a separable Hilbert space. Then, in finite dimensional Hilbert spaces, any function satisfying finite additivity,
[TABLE]
for effects such that
[TABLE]
is found to necessarily admit an expression of the form given in Eq. (2).444This result does not imply Gleason’s result since in dimensions greater than two the requirement (3) is stronger than the requirement (1). The effects and are said to coexist since the condition in Eq. (4) implies that they occur in the range of a single POM. More recently, it has been shown that this Gleason-type theorem555It is important to clearly distinguish Gleason*-type* theorems from Gleason’s original theorem. also follows from weaker assumptions: it is sufficient to require Eq. (3) hold only for effects and that coexist in *projective-simulable *measurements obtained by mixing projective measurements [4].
Finitely additive functions were first given serious consideration in 1821 when Cauchy [5] attempted to find all solutions of the equation
[TABLE]
for real variables . In addition to the obvious linear solutions, non-linear solutions to Cauchy’s functional equation are known to exist [6]. However, the non-linear functions satisfying Eq. (5) cannot be Lebesgue measurable [7], continuous at a single point [9] or bounded on any set of positive measure [10]. Similar results also hold for Cauchy’s functional equation with arguments more general than real numbers, reviewed in [11], for example.
Recalling that the Hermitian operators on form a real vector space, it becomes clear that the Gleason-type theorems described above can be viewed as results about the solutions of Cauchy’s functional equation for vector-valued arguments: additive functions on subsets of a real vector space, subject to some additional constraints, are necessarily linear. Taking advantage of this connection, we use results regarding Cauchy’s functional equation to present an alternative proof of known Gleason-type theorems.
In Sec. 2, we spell out four conditions that single out *linear *solutions to Cauchy’s functional equation defined on a finite interval of the real line. The main result of this paper—an alternative method to derive Busch’s Gleason-type theorem—is presented in Sec. 3. We conclude with a summary and a discussion of the results in Sec. 4.
2 Cauchy’s functional equation on a finite interval
In 1821 Cauchy [5] showed that a continuous function over the real numbers satisfying Eq. (5) is necessarily linear. It is important to note, however, that relaxing the continuity restriction does allow for non-linear solutions [6], as pathological as they may be.666The existence of non-linear solutions depends on the existence of Hamel bases and, thus, on the axiom of choice. Other conditions known to ensure linearity of a finitely additive function include Lebesgue measurability [7], positivity on small numbers [12] or continuity at a single point [9]. We begin by proving a related result, in which the domain of the function is restricted to an interval, as opposed to the entire real line.
Theorem 1**.**
Let and be a function that satisfies
[TABLE]
for all such that . The function is necessarily linear, i.e.
[TABLE]
if it satisfies any one of the following four conditions:
- (i)
* for some and all ;* 2. (ii)
* for some and all ;* 3. (iii)
* is continuous at zero;* 4. (iv)
* is Lebesgue-measurable.*
Theorem 1 says that non-linear solutions of Eq. (6) cannot be bounded from below or above, continuous at zero or Lebesgue measurable. We will now prove the linearity of for Case (i). The proofs for the remaining cases are given in Appendix A.
Proof.
We will extend to a finitely additive function on the entire real line. For any real number , Eq. (6) implies that
[TABLE]
where is a positive integer. If we choose an integer with , then we have
[TABLE]
In a first step, we extend the function to all non-negative real numbers by defining
[TABLE]
for real numbers and integers . This extension is well-defined since for any two sufficiently large integers, i.e. and with , we have
[TABLE]
according to Eq. (8), resulting in the identity
[TABLE]
Finite additivity on the positive half-line also holds since for any two non-negative numbers , we find
[TABLE]
for sufficiently large which ensures that .
In a second step, we extend the function to the entire real line by defining
[TABLE]
To show that the function is finitely additive on all of , three cases must be considered.
If both and , we have
[TABLE]
using that holds for non-negative real numbers and .
If , and , we have
[TABLE]
If , and , we have
[TABLE]
This property completes the proposed extension of the function to a finitely additive function on the real line that is bounded above on the interval . Ostrowski [13] and Kestelman [10] showed that finitely additive functions on the real line that are bounded above on a set of positive measure are necessarily linear. Therefore, the extended function is linear, and its restriction back to the interval is given by . ∎
3 When Cauchy meets Gleason: additive functions on effect spaces
The first Gleason-type theorem, published in 2003, assumes additivity of the frame function not only on projections that occur in the same projection-valued measure (PVM) but on the larger set of effects that coexist in the same POM.
Theorem 2** (Busch [2]).**
Let be the space of effects on and be the identity operator on . Any function satisfying
[TABLE]
and
[TABLE]
for all such that , admits an expression
[TABLE]
for some density operator , and all effects .
Theorem 2 rephrases the (finite-dimensional case of the) theorem proved by Busch [2] and the theorem due to Caves et al. [3]. Busch uses the positivity of the frame function to directly establish its homogeneity whereas Caves et al. derive homogeneity by showing that the frame function must be continuous at the zero operator. These arguments seem to run in parallel with Cases (ii) and (iii) of Theorem 1 presented in the previous section. In Sec. 3.2, we will give an alternative proof of Theorem 2 which can be based on any of the four cases of Theorem 1.
3.1 Preliminaries
To begin, let us introduce a number of useful concepts and establish a suitable notation. Throughout this section we will make use of the fact that the Hermitian operators on constitute a real vector space of dimension , which we will denote by . We may therefore employ the standard inner product , for Hermitian operators and , in our reasoning as well as the norm which it induces.
A discrete POM on is described by its range, i.e. by a sequence of effects \left\llbracket E_{1},E_{2},\dots\right\rrbracket that sum to the identity operator on . A minimal informationally-com-plete (MIC) POM on consists of exactly linearly independent effects, {\cal M}=\left\llbracket M_{1},\ldots,M_{d^{2}}\right\rrbracket. Hence, MIC-POMs constitute bases of the vector space of Hermitian operators, and it is known that they exist in all finite dimensions [14].
Positive linear combinations of effects will play an important role below, giving rise to the following definition.
Definition 1**.**
The positive cone of a set of Hermitian operators on is the set of non-negative linear combinations of the elements of , i.e. the set
[TABLE]
Note that the expression of an element of as a linear combination of elements of requires at most terms as a consequence of Caratheodory’s theorem.
Next, we introduce so-called “augmented” bases of the space which are built around sets of projections where the vectors form an orthonormal basis of .
Definition 2**.**
An *augmented basis *of the Hermitian operators on is a set of linearly independent rank-one effects satisfying
- (i)
for , with and an orthonormal basis of ; 2. (ii)
.
Given any orthonormal basis of , we can construct an augmented basis for the space of operators acting on it. First, complete the projectors
[TABLE]
into a basis of the Hermitian operators on , by adding further rank-one projections; this is always possible [14]. The sum
[TABLE]
is necessarily a positive operator. The relation implies that G must have at least one eigenvalue larger than . If is the largest eigenvalue of , then is an effect since it is a positive operator with eigenvalues less than or equal to one. Defining
[TABLE]
the set turns into an augmented basis. One can show that can never correspond to a POM. Nevertheless, the effects coexist in the sense that they can occur in one single POM, for example \left\llbracket B_{1},\ldots,B_{d^{2}},\operatorname{I}-G/\Gamma\right\rrbracket.
Given an effect, one can always represent it as a positive linear combination of elements in a suitable augmented basis.
Lemma 1**.**
For any effect there exists an augmented basis such that is in the positive cone of .
Proof.
By the spectral theorem we may write
[TABLE]
for an orthonormal basis of . Take to be an augmented basis with
[TABLE]
for and some . Then we may express as the linear combination
[TABLE]
with non-negative coefficients
[TABLE]
showing that the positive cone of the basis indeed contains the effect . ∎
Finally, we need to establish that the intersection of the positive cones associated with an augmented basis and a MIC-POM, respectively, has dimension .
Lemma 2**.**
Let be an augmented basis and \mathcal{M}=\left\llbracket M_{1},\ldots,M_{d^{2}}\right\rrbracket a MIC-POM on . The effects in the intersection of the positive cones of and span the real vector space of Hermitian operators on .
Proof.
Since the effects in a POM sum to the identity, we have
[TABLE]
With each of the coefficients in the unique decomposition on the right-hand side being finite and positive (as opposed to non-negative), the effect is seen to be an interior point of the positive cone . At the same time, the effect is located on the boundary of the cone since its expansion in an augmented basis has only non-zero terms. Let us define the operator
[TABLE]
which, for any positive , is an interior point of the cone : each of the positive coefficients in its unique decomposition in terms of the augmented basis is non-zero; we have used Property 1 of Def. 2 to express the identity in terms of the basis . For sufficiently small values of , the operator is also an interior point of the open ball with radius about the point since
[TABLE]
holds whenever
[TABLE]
Being an interior point of both the positive cones and , the operator is at the center of an open ball , located entirely in the intersection (cf. Fig. 1). Since the ball has dimension the effects contained in it must indeed span the real vector space of Hermitian operators. ∎
Combining Theorem 1 with Lemmata 1 and 2 will allow us to present a new proof of Busch’s Gleason-type theorem.
3.2 An alternative proof of Busch’s Gleason-type theorem
Recalling that the trace of the product of two Hermitian operators constitutes an inner product on the vector space of Hermitian operators, Theorem 2 essentially states that the frame function acting on an effect can be written as the inner product of that effect with a fixed density operator. To underline the connection with the inner product we adopt the following notation. Let be a basis for the Hermitian operators on . We describe the effect by the “effect vector” , given by its expansion coefficients in this basis,
[TABLE]
where is an operator-valued vector with components. Theorem 2 now states that the frame function is given by a scalar product,
[TABLE]
between the effect vector and a fixed vector . Let us determine the relation between the density matrix in (20) in the theorem and the vector in (34). Consider any orthonormal basis of the Hermitian operators on and let be the vector such that . Then we may write
[TABLE]
here is a fixed vector given by and is the inverse transpose of the change-of-basis matrix between the bases and , i.e. the matrix satisfying for all Hermitian operators . By the definition of a frame function the operator
[TABLE]
must be positive semi-definite (since is positive) and have unit trace (due to Eq. (18)) i.e. be a density operator.
We will now prove that a frame function always admits an expression as in Eq. (34).
Proof.
By Lemma 1, there exists an augmented basis for any such that
[TABLE]
with coefficients , as in Eq. (33).
For each value , we write the restriction of the frame function to the set of effects of the form , for , as
[TABLE]
where and . By Eq. (19) we have that satisfies Cauchy’s functional equation, i.e. . Due to the assumption in Theorem 2 that , each must satisfy Condition (i) of Theorem 1 which implies
[TABLE]
Thus we find
[TABLE]
where the -th component of is given by , by repeatedly using additivity and Eq. (39). Note that Eq. (40) is not yet in the desired form of Eq. (34) since the vector depends on the basis and thus the effect .
Let \mathcal{M}=\left\llbracket M_{1},\ldots,M_{d^{2}}\right\rrbracket be a MIC-POM on . Since the elements of are a basis for the space , the Hermitian operators on , we have for any
[TABLE]
for coefficients some of which may be negative. There exists a fixed change-of-basis matrix such that
[TABLE]
for all effects . Now we have
[TABLE]
Any effect in the intersection of the positive cones and can be expressed in two ways,
[TABLE]
where both effect vectors and have only non-negative components. Eqs. (40) and (43) imply that
[TABLE]
Since by Lemma 2 there are linearly independent effects in the intersection , we conclude that
[TABLE]
Combining this equality with Equation (43) we find, for a fixed MIC-POM \mathcal{M}=\left\llbracket M_{1},\ldots,M_{d^{2}}\right\rrbracket and any effect , that the frame function takes the form
[TABLE]
Here is a fixed vector since it does not depend on . ∎
Note that Eq. (39) may also be found using the other three cases of Theorem 1. For Case (ii), we observe that each of the functions , is non-negative by definition. Alternatively, each function can be shown to be continuous at zero (Case (iii)) using the following argument which is similar to the one given in [14]. Assume is not continuous at zero. Then there exists a number such that for all we have
[TABLE]
for some . For any given choose , there is a value of such that . However, we have the inequality , which leads to
[TABLE]
contradicting the the existence of an upper bound of one on values of . Finally, each of the functions is Lebesgue measurable (Case (iv)) which follows from the monotonicity of the function.
4 Summary and discussion
We are aware of two papers linking Gleason’s theorem and Cauchy’s functional equation. Cooke et al. [15] used Cauchy’s functional equation to demonstrate the necessity of the boundedness of frame functions in proving Gleason’s theorem. Dvurečenskij [16] introduced frame functions defined on effect algebras but did not proceed to derive a Gleason-type theorem in the context of quantum theory.
In this paper, we have exploited the fact that additive functions are central to both Gleason-type theorems and Cauchy’s functional equation. Gleason-type theorems are based on the assumption that states assign probabilities to measurement outcomes via additive functions, or frame functions, on the effect space. Linearity of the frame functions* *has been shown to follow from positivity and other assumptions which are well-known in the context of Cauchy’s functional equation. Altogether, the result obtained here amounts to an alternative proof of the extension of Gleason’s theorem to dimension two given by Busch [2] and Caves et al. [14].
Other* *Gleason-type theorems are known that are *stronger, *in the sense that they depend on assumptions weaker than those of Theorem 2. The smallest known set of assumptions requires Eq. (19) to only be valid for effects and that coexist in a projective-simulable POM [17], i.e. a POM that may be simulated using only classic mixtures of projective measurements, as opposed to any POM. Since the proof given in [4] relies on Theorem 2, the alternative proof presented in Sec. 3.2 also gives rise to a new proof of the strongest existing Gleason-type theorem.
We have not been able to exploit the structural similarity between the requirements on frame functions and on the solutions of Cauchy’s functional equation in order to yield a new proof of Gleason’s original theorem. Additivity of frame functions defined on projections instead of effects does not provide us with the type of continuous parameters that are necessary for the argument developed here. It remains an intriguing open question whether such a proof does exist.
Acknowledgement*.*
The authors thank Jonathan Barrett for pointing out a gap in the proof of Theorem 1 given in an earlier version of this paper. VJW gratefully acknowledges funding from the York Centre for Quantum Technologies and the WW Smith fund.
Appendix A Proofs of Cases (ii), (iii) and (iv) of Theorem 1
It is shown that each of the conditions given in Cases (ii) to (iv) imply Theorem 1 which states that an additive function on a particular interval must be linear.
Proof.
Case (ii): Suppose that there exists a non-linear function satisfying Eq. (6) and Case (ii) of Theorem 1. Then the function defined by is non-linear but satisfies Eq. (6) and and , with , contradicting Case (i). ∎
Proof.
Case (iii): Since is continuous at zero and , as follows from Eq. (6), we have that for any , there exists a such that for all satisfying . Let be such that . First consider the case . Using additivity,
[TABLE]
we find
[TABLE]
On the other hand, if we have
[TABLE]
and then
[TABLE]
It follows that is continuous on . As in the proof for Case (i), Eqs. (8) and (9) show that
[TABLE]
for rational . Therefore, if is a sequence of rational numbers converging to , the function must be linear in :
[TABLE]
∎
In Case (iv), where is Lebesgue measurable, the proof of the analogous result for functions on the full real line by Banach [7] is easily adapted to our setting. Given Case (iii), it suffices to prove that is continuous at [math], i.e. that for every there exists a number such that
[TABLE]
holds for all .
Proof.
Case (iv): Let . Lusin’s theorem [18] states that, for a Lebesgue measurable function on an interval of Lesbesgue measure , there exists a compact subset of any measure such that the restriction of to this subset is continuous. Thus we may find a compact set with on which is continuous. Let be given. Since is compact, is uniformly continuous on and there exists a such that
[TABLE]
is valid for two numbers such that . Let . Suppose and were disjoint. Then we would have
[TABLE]
which contradicts . Taking a point then a number can be found such that
[TABLE]
for . Hence, remembering that , the function is continuous at . ∎
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] A. M. Gleason, J. Math. Mech. 6 (1957) 885
- 2[2] P. Busch, Phys. Rev. Lett. 91 (2003) 120403
- 3[3] C. M. Caves, C. A. Fuchs, K. K. Manne and J. M. Renes, Found. Phys. 34 (2004) 193
- 4[4] V. J. Wright and S. Weigert, J. Phys. A 52 (2019) 055301
- 5[5] A. Cauchy, Cours D’analyse De L’école Royale Polytechnique (Debure, 1821)
- 6[6] G. Hamel, Math. Ann. 60 (1905) 459
- 7[7] S. Banach, Fund. Math. 1 (1920) 123
- 8[8] E. Christensen, Commun. Math. Phys. 86 (1982) 529
