Gleason-type Theorems from Cauchy's Functional Equation

Victoria J Wright; Stefan Weigert

arXiv:1905.12751·quant-ph·December 10, 2019

Gleason-type Theorems from Cauchy's Functional Equation

Victoria J Wright, Stefan Weigert

PDF

TL;DR

This paper offers an alternative proof of Gleason-type theorems in quantum mechanics by leveraging solutions to Cauchy's functional equation, connecting probability additivity to functional equation techniques.

Contribution

It introduces a novel proof method for Gleason-type theorems using functional equations, expanding the mathematical tools available for foundational quantum theory.

Findings

01

Provides a new proof of Gleason-type theorems

02

Links additivity in quantum probabilities to Cauchy's functional equation

03

Enhances understanding of the mathematical structure underlying quantum measurement

Abstract

Gleason-type theorems derive the density operator and the Born rule formalism of quantum theory from the measurement postulate, by considering additive functions which assign probabilities to measurement outcomes. Additivity is also the defining property of solutions to Cauchy's functional equation. This observation suggests an alternative proof of the strongest known Gleason-type theorem, based on techniques used to solve functional equations.

Equations129

f (P_{1}) + f (P_{2}) = f (P_{1} + P_{2}) .

f (P_{1}) + f (P_{2}) = f (P_{1} + P_{2}) .

f (\cdot) = Tr (ρ \cdot),

f (\cdot) = Tr (ρ \cdot),

f (E_{1}) + f (E_{2}) = f (E_{1} + E_{2}),

f (E_{1}) + f (E_{2}) = f (E_{1} + E_{2}),

E_{1} + E_{2} \in E (H),

E_{1} + E_{2} \in E (H),

f (x) + f (y) = f (x + y),

f (x) + f (y) = f (x + y),

f (x) + f (y) = f (x + y),

f (x) + f (y) = f (x + y),

f (x) = \frac{f ( a )}{a} x,

f (x) = \frac{f ( a )}{a} x,

f (x) = f (\frac{n}{n} x) = n f (\frac{x}{n}),

f (x) = f (\frac{n}{n} x) = n f (\frac{x}{n}),

f (\frac{m}{n} x) = m f (\frac{x}{n}) = \frac{m}{n} f (x) .

f (\frac{m}{n} x) = m f (\frac{x}{n}) = \frac{m}{n} f (x) .

f_{+} (x) = n f (\frac{x}{n}),

f_{+} (x) = n f (\frac{x}{n}),

f (\frac{x}{mn}) = \frac{1}{m} f (\frac{x}{n}) = \frac{1}{n} f (\frac{x}{m}),

f (\frac{x}{mn}) = \frac{1}{m} f (\frac{x}{n}) = \frac{1}{n} f (\frac{x}{m}),

m f (\frac{x}{m}) = n f (\frac{x}{n}) .

m f (\frac{x}{m}) = n f (\frac{x}{n}) .

f_{+} (x) + f_{+} (y)

f_{+} (x) + f_{+} (y)

= n f (\frac{x + y}{n})

= f_{+} (x + y),

f_{R} (x) = {f_{+} (x) - f_{+} (- x) x \geq 0, x < 0 .

f_{R} (x) = {f_{+} (x) - f_{+} (- x) x \geq 0, x < 0 .

f_{R} (x) + f_{R} (y)

f_{R} (x) + f_{R} (y)

= - f_{+} (- x - y)

= f_{R} (x + y),

f_{R} (x) + f_{R} (y)

f_{R} (x) + f_{R} (y)

= f_{+} (x) - f_{+} (- y - x) - f_{+} (x)

= f_{R} (x + y) .

f_{R} (x) + f_{R} (y)

f_{R} (x) + f_{R} (y)

= f_{+} (x + y) + f_{+} (- y) - f_{+} (- y)

= f_{R} (x + y) .

f (I_{d}) = 1,

f (I_{d}) = 1,

f (E_{1}) + f (E_{2}) = f (E_{1} + E_{2}),

f (E_{1}) + f (E_{2}) = f (E_{1} + E_{2}),

f (E) = Tr (E ρ),

f (E) = Tr (E ρ),

C (S) = ⎩ ⎨ ⎧ H = j = 1 \sum d^{2} a_{j} H_{j}, a_{j} \geq 0 for any H_{j} \in S ⎭ ⎬ ⎫ .

C (S) = ⎩ ⎨ ⎧ H = j = 1 \sum d^{2} a_{j} H_{j}, a_{j} \geq 0 for any H_{j} \in S ⎭ ⎬ ⎫ .

Π_{j} = ∣ e_{j} ⟩ ⟨ e_{j} ∣, j = 1 \dots d,

Π_{j} = ∣ e_{j} ⟩ ⟨ e_{j} ∣, j = 1 \dots d,

G = j = 1 \sum d^{2} Π_{j},

G = j = 1 \sum d^{2} Π_{j},

B_{j} = Π_{j} /Γ, j = 1 \dots d^{2},

B_{j} = Π_{j} /Γ, j = 1 \dots d^{2},

E = j = 1 \sum d λ_{j} ∣ e_{j} ⟩ ⟨ e_{j} ∣, λ_{j} \in [0, 1],

E = j = 1 \sum d λ_{j} ∣ e_{j} ⟩ ⟨ e_{j} ∣, λ_{j} \in [0, 1],

B_{j} = c ∣ e_{j} ⟩ ⟨ e_{j} ∣,

B_{j} = c ∣ e_{j} ⟩ ⟨ e_{j} ∣,

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Gleason-type Theorems from

Cauchy’s Functional Equation

Victoria J Wright and Stefan Weigert

Department of Mathematics, University of York

York YO10 5DD, United Kingdom

[email protected], [email protected]

(May 2019)

Abstract

Gleason-type theorems derive the density operator and the Born rule formalism of quantum theory from the measurement postulate, by considering additive functions which assign probabilities to measurement outcomes. Additivity is also the defining property of solutions to Cauchy’s functional equation. This observation suggests an alternative proof of the strongest known Gleason-type theorem, based on techniques used to solve functional equations.

1 Introduction

Gleason’s theorem [1] is a fundamental result in the foundations of quantum theory simplifying the axiomatic structure upon which the theory is based. The theorem shows that quantum states must correspond to density operators if they are to consistently assign probabilities to the outcomes of projective measurements in Hilbert spaces of dimension three or larger.111By a consistent assignment of probabilities we mean one in which the probabilities for all outcomes of a given measurement sum to one.

More explicitly, let $\mathcal{P\left(H\right)}$ be the lattice of self-adjoint projections onto closed subspaces of a separable Hilbert space $\mathcal{H}$ of dimension at least three. Consider functions $f:\mathcal{P\left(H\right)}\rightarrow\left[0,1\right]$ , that are *finitely additive *for projections $P_{1}$ and $P_{2}$ onto *orthogonal *subspaces of $\mathcal{H}$ , i.e.

[TABLE]

Gleason’s result shows that the solutions of Eq. (1) (in finite dimensional Hilbert spaces) 222Gleason proved this result in all separable Hilbert spaces if the condition of finite additivity is replaced with $\upsigma$ -additivity. These conditions are equivalent in finite dimensional Hilbert spaces. Later Christensen [8] showed that the weaker condition of finite additivity was also sufficient for the result to hold in infinite dimensions. necessarily admit an expression

[TABLE]

for some positive-semidefinite self-adjoint operator $\rho$ on $\mathcal{H}$ .

The result does not hold, however, in Hilbert spaces of dimension two since the constraints (1) degenerate in this case: the projections lack the “intertwining” property [1] present in higher dimensions. In 2003, Busch [2] and then Caves et al. [3] extended Gleason’s theorem to dimension two by considering *generalised *quantum measurements described by positive operator-valued measures, or POMs. In analogy with Gleason’s original requirement, a state is now defined as an additive probability assignment not only on projections but on a larger set of operators, the space $\mathcal{E}\left(\mathcal{H}\right)$ of effects333An effect $E$ on $\mathcal{H}$ is a self-adjoint operator satisfying $0\leq\langle\psi|E\psi\rangle\leq\langle\psi|\psi\rangle$ for all vectors $|\psi\rangle\in\mathcal{H}$ . defined on a separable Hilbert space. Then, in finite dimensional Hilbert spaces, any function $f:\mathcal{E\left(H\right)}\rightarrow\left[0,1\right]$ satisfying finite additivity,

[TABLE]

for effects $E_{1},E_{2}\in\mathcal{E\left(H\right)}$ such that

[TABLE]

is found to necessarily admit an expression of the form given in Eq. (2).444This result does not imply Gleason’s result since in dimensions greater than two the requirement (3) is stronger than the requirement (1). The effects $E_{1}$ and $E_{2}$ are said to coexist since the condition in Eq. (4) implies that they occur in the range of a single POM. More recently, it has been shown that this Gleason-type theorem555It is important to clearly distinguish Gleason*-type* theorems from Gleason’s original theorem. also follows from weaker assumptions: it is sufficient to require Eq. (3) hold only for effects $E_{1}$ and $E_{2}$ that coexist in *projective-simulable *measurements obtained by mixing projective measurements [4].

Finitely additive functions were first given serious consideration in 1821 when Cauchy [5] attempted to find all solutions of the equation

[TABLE]

for real variables $x,\,y\in\mathbb{R}$ . In addition to the obvious linear solutions, non-linear solutions to Cauchy’s functional equation are known to exist [6]. However, the non-linear functions $f$ satisfying Eq. (5) cannot be Lebesgue measurable [7], continuous at a single point [9] or bounded on any set of positive measure [10]. Similar results also hold for Cauchy’s functional equation with arguments more general than real numbers, reviewed in [11], for example.

Recalling that the Hermitian operators on $\mathbb{C}^{d}$ form a real vector space, it becomes clear that the Gleason-type theorems described above can be viewed as results about the solutions of Cauchy’s functional equation for vector-valued arguments: additive functions on subsets of a real vector space, subject to some additional constraints, are necessarily linear. Taking advantage of this connection, we use results regarding Cauchy’s functional equation to present an alternative proof of known Gleason-type theorems.

In Sec. 2, we spell out four conditions that single out *linear *solutions to Cauchy’s functional equation defined on a finite interval of the real line. The main result of this paper—an alternative method to derive Busch’s Gleason-type theorem—is presented in Sec. 3. We conclude with a summary and a discussion of the results in Sec. 4.

2 Cauchy’s functional equation on a finite interval

In 1821 Cauchy [5] showed that a continuous function over the real numbers satisfying Eq. (5) is necessarily linear. It is important to note, however, that relaxing the continuity restriction does allow for non-linear solutions [6], as pathological as they may be.666The existence of non-linear solutions depends on the existence of Hamel bases and, thus, on the axiom of choice. Other conditions known to ensure linearity of a finitely additive function include Lebesgue measurability [7], positivity on small numbers [12] or continuity at a single point [9]. We begin by proving a related result, in which the domain of the function is restricted to an interval, as opposed to the entire real line.

Theorem 1.

Let $a>0$ and $f:\left[0,a\right]\rightarrow\mathbb{R}$ be a function that satisfies

[TABLE]

for all $x,y\in\left[0,a\right]$ such that $\left(x+y\right)\in\left[0,a\right]$ . The function $f$ is necessarily linear, i.e.

[TABLE]

if it satisfies any one of the following four conditions:

(i)

$f\left(x\right)\leq b$ * for some $b\geq 0$ and all $x\in\left[0,a\right]$ ;* 2. (ii)

$f\left(x\right)\geq c$ * for some $c\leq 0$ and all $x\in\left[0,a\right]$ ;* 3. (iii)

$f$ * is continuous at zero;* 4. (iv)

$f$ * is Lebesgue-measurable.*

Theorem 1 says that non-linear solutions of Eq. (6) cannot be bounded from below or above, continuous at zero or Lebesgue measurable. We will now prove the linearity of $f$ for Case (i). The proofs for the remaining cases are given in Appendix A.

Proof.

We will extend $f$ to a finitely additive function on the entire real line. For any real number $x\in\left[0,a\right]$ , Eq. (6) implies that

[TABLE]

where $n$ is a positive integer. If we choose an integer $m\in\mathbb{N}$ with $m/n\in\left[0,a\right]$ , then we have

[TABLE]

In a first step, we extend the function $f$ to all non-negative real numbers by defining

[TABLE]

for real numbers $x>a$ and integers $n>x/a$ . This extension is well-defined since for any two sufficiently large integers, i.e. $m$ and $n$ with $m,n>x/a$ , we have

[TABLE]

according to Eq. (8), resulting in the identity

[TABLE]

Finite additivity on the positive half-line also holds since for any two non-negative numbers $x,y\geq 0$ , we find

[TABLE]

for sufficiently large $n\in\mathbb{N}$ which ensures that $(x+y)/n\in[0,a]$ .

In a second step, we extend the function $f_{+}$ to the entire real line by defining

[TABLE]

To show that the function $f_{\mathbb{R}}$ is finitely additive on all of $\mathbb{R}$ , three cases must be considered.

If both $x<0$ and $y<0$ , we have

[TABLE]

using that $f_{+}(-x)+f_{+}(-y)=f_{+}(-x-y)$ holds for non-negative real numbers $-x$ and $-y$ .

If $x\geq 0$ , $y<0$ and $x+y<0$ , we have

[TABLE]

If $x\geq 0$ , $y<0$ and $x+y\geq 0$ , we have

[TABLE]

This property completes the proposed extension of the function $f$ to a finitely additive function $f_{\mathbb{R}}$ on the real line that is bounded above on the interval $[0,a]$ . Ostrowski [13] and Kestelman [10] showed that finitely additive functions on the real line that are bounded above on a set of positive measure are necessarily linear. Therefore, the extended function $f_{\mathbb{R}}$ is linear, and its restriction back to the interval $[0,a]$ is given by $f(x)=f(a)x/a$ . ∎

3 When Cauchy meets Gleason: additive functions on effect spaces

The first Gleason-type theorem, published in 2003, assumes additivity of the frame function not only on projections that occur in the same projection-valued measure (PVM) but on the larger set of effects that coexist in the same POM.

Theorem 2 (Busch [2]).

Let $\mathcal{E}_{d}$ be the space of effects on $\mathbb{C}^{d}$ and $\operatorname{I}_{d}$ be the identity operator on $\mathbb{C}^{d}$ . Any function $f:\mathcal{E}_{d}\rightarrow\left[0,1\right]$ satisfying

[TABLE]

and

[TABLE]

for all $E_{1},E_{2}\in\mathcal{E}_{d}$ such that $\left(E_{1}+E_{2}\right)\in\mathcal{E}_{d}$ , admits an expression

[TABLE]

for some density operator $\rho$ , and all effects $E\in\mathcal{E}_{d}$ .

Theorem 2 rephrases the (finite-dimensional case of the) theorem proved by Busch [2] and the theorem due to Caves et al. [3]. Busch uses the positivity of the frame function $f$ to directly establish its homogeneity whereas Caves et al. derive homogeneity by showing that the frame function $f$ must be continuous at the zero operator. These arguments seem to run in parallel with Cases (ii) and (iii) of Theorem 1 presented in the previous section. In Sec. 3.2, we will give an alternative proof of Theorem 2 which can be based on any of the four cases of Theorem 1.

3.1 Preliminaries

To begin, let us introduce a number of useful concepts and establish a suitable notation. Throughout this section we will make use of the fact that the Hermitian operators on $\mathbb{C}^{d}$ constitute a real vector space of dimension $d^{2}$ , which we will denote by $\mathbb{H}_{d}$ . We may therefore employ the standard inner product $\left\langle A,B\right\rangle=\operatorname{Tr}\left(AB\right)$ , for Hermitian operators $A$ and $B$ , in our reasoning as well as the norm $\left\|\cdot\right\|$ which it induces.

A discrete POM on $\mathbb{C}^{d}$ is described by its range, i.e. by a sequence of effects $\left\llbracket E_{1},E_{2},\dots\right\rrbracket$ that sum to the identity operator on $\mathbb{C}^{d}$ . A minimal informationally-com-plete (MIC) POM $\mathcal{M}$ on $\mathbb{C}^{d}$ consists of exactly $d^{2}$ linearly independent effects, ${\cal M}=\left\llbracket M_{1},\ldots,M_{d^{2}}\right\rrbracket$ . Hence, MIC-POMs constitute bases of the vector space of Hermitian operators, and it is known that they exist in all finite dimensions [14].

Positive linear combinations of effects will play an important role below, giving rise to the following definition.

Definition 1.

The positive cone of a set of Hermitian operators $S$ on $\mathbb{C}^{d}$ is the set of non-negative linear combinations of the elements of $S$ , i.e. the set

[TABLE]

Note that the expression of an element of $\mathcal{C}\left(S\right)$ as a linear combination of elements of $S$ requires at most $d^{2}$ terms as a consequence of Caratheodory’s theorem.

Next, we introduce so-called “augmented” bases of the space $\mathbb{H}_{d}$ which are built around sets of $d$ projections $\left\{|e_{1}\rangle\langle e_{1}|,\ldots,|e_{d}\rangle\langle e_{d}|\right\}$ where the vectors $\left\{|e_{1}\rangle,\ldots|e_{d}\rangle\right\}$ form an orthonormal basis of $\mathbb{C}^{d}$ .

Definition 2.

An *augmented basis *of the Hermitian operators on $\mathbb{C}^{d}$ is a set of $d^{2}$ linearly independent rank-one effects $\mathcal{B}=\left\{B_{1,}\ldots,B_{d^{2}}\right\}$ satisfying

(i)

$B_{j}=c|e_{j}\rangle\langle e_{j}|$ for $1\leq j\leq d$ , with $0<c<1$ and an orthonormal basis $\left\{|e_{1}\rangle,\ldots|e_{d}\rangle\right\}$ of $\mathbb{C}^{d}$ ; 2. (ii)

$\sum_{j=1}^{d^{2}}B_{j}\in\mathcal{E}_{d}\,$ .

Given any orthonormal basis $\left\{|e_{1}\rangle,\ldots,|e_{d}\rangle\right\}$ of $\mathbb{C}^{d}$ , we can construct an augmented basis for the space of operators acting on it. First, complete the $d$ projectors

[TABLE]

into a basis $\left\{\Pi_{1},\ldots,\Pi_{d^{2}}\right\}$ of the Hermitian operators on $\mathbb{C}^{d}$ , by adding $d(d-1)$ further rank-one projections; this is always possible [14]. The sum

[TABLE]

is necessarily a positive operator. The relation $\operatorname{Tr}G=d^{2}$ implies that G must have at least one eigenvalue larger than $1$ . If $\Gamma>1$ is the largest eigenvalue of $G$ , then $G/\Gamma$ is an effect since it is a positive operator with eigenvalues less than or equal to one. Defining

[TABLE]

the set $\mathcal{B}=\left\{B_{1},\ldots,B_{d^{2}}\right\}$ turns into an augmented basis. One can show that $\mathcal{B}$ can never correspond to a POM. Nevertheless, the effects $B_{j}$ coexist in the sense that they can occur in one single POM, for example $\left\llbracket B_{1},\ldots,B_{d^{2}},\operatorname{I}-G/\Gamma\right\rrbracket$ .

Given an effect, one can always represent it as a positive linear combination of elements in a suitable augmented basis.

Lemma 1.

For any effect $E\in\mathcal{E}_{d}$ there exists an augmented basis $\mathcal{B}$ such that $E$ is in the positive cone of $\mathcal{B}$ .

Proof.

By the spectral theorem we may write

[TABLE]

for an orthonormal basis $\left\{|e_{j}\rangle,1\leq j\leq d\right\}$ of $\mathbb{C}^{d}$ . Take $\mathcal{B}$ to be an augmented basis with

[TABLE]

for $1\leq j\leq d$ and some $c\in\left(0,1\right)$ . Then we may express $E$ as the linear combination

[TABLE]

with non-negative coefficients

[TABLE]

showing that the positive cone of the basis ${\cal B}$ indeed contains the effect $E$ . ∎

Finally, we need to establish that the intersection of the positive cones associated with an augmented basis and a MIC-POM, respectively, has dimension $d^{2}$ .

Lemma 2.

Let $\mathcal{B}=\left\{B_{1},\ldots,B_{d^{2}}\right\}$ be an augmented basis and $\mathcal{M}=\left\llbracket M_{1},\ldots,M_{d^{2}}\right\rrbracket$ a MIC-POM on $\mathbb{C}^{d}$ . The effects in the intersection ${\cal C}({\cal B})\cap{\cal C}({\cal M})$ of the positive cones of $\mathcal{B}$ and $\mathcal{M}$ span the real vector space $\mathbb{H}_{d}$ of Hermitian operators on $\mathbb{C}^{d}$ .

Proof.

Since the effects in a POM sum to the identity, we have

[TABLE]

With each of the coefficients in the unique decomposition on the right-hand side being finite and positive (as opposed to non-negative), the effect $\operatorname{I}_{d}/d$ is seen to be an interior point of the positive cone $\mathcal{C}\left(\mathcal{M}\right)$ . At the same time, the effect $\operatorname{I}_{d}/d$ is located on the boundary of the cone ${\cal C}({\cal B})$ since its expansion in an augmented basis has only $d$ non-zero terms. Let us define the operator

[TABLE]

which, for any positive $\delta>0$ , is an interior point of the cone $\mathcal{C}\left(\mathcal{B}\right)$ : each of the positive coefficients in its unique decomposition in terms of the augmented basis ${\cal B}$ is non-zero; we have used Property 1 of Def. 2 to express the identity $\operatorname{I}_{d}$ in terms of the basis ${\cal B}$ . For sufficiently small values of $\delta$ , the operator $E_{\delta}$ is also an interior point of the open ball $\mathfrak{B}_{\varepsilon}\left(\operatorname{I}_{d}/d\right)$ with radius $\varepsilon$ about the point $I_{d}/d$ since

[TABLE]

holds whenever

[TABLE]

Being an interior point of both the positive cones $\mathcal{C}\left(\mathcal{B}\right)$ and $\mathcal{C}\left(\mathcal{M}\right)$ , the operator $E_{\delta}$ is at the center of an open ball $\mathfrak{B}_{\gamma}\left(E_{\delta}\right)$ , located entirely in the intersection ${\cal C}({\cal B})\cap{\cal C}({\cal M})$ (cf. Fig. 1). Since the ball $\mathfrak{B}_{\gamma}\left(E_{\delta}\right)$ has dimension $d^{2},$ the effects contained in it must indeed span the real vector space $\mathbb{H}_{d}$ of Hermitian operators. ∎

Combining Theorem 1 with Lemmata 1 and 2 will allow us to present a new proof of Busch’s Gleason-type theorem.

3.2 An alternative proof of Busch’s Gleason-type theorem

Recalling that the trace of the product of two Hermitian operators constitutes an inner product on the vector space of Hermitian operators, Theorem 2 essentially states that the frame function $f$ acting on an effect can be written as the inner product of that effect with a fixed density operator. To underline the connection with the inner product we adopt the following notation. Let $\mathcal{A}=\left\{A_{1},\ldots,A_{d^{2}}\right\}$ be a basis for the Hermitian operators $\mathbb{H}_{d}$ on $\mathbb{C}^{d}$ . We describe the effect $E$ by the “effect vector” $\mathbf{e}=\left(e_{1},\ldots,e_{d^{2}}\right)^{T}\in\mathbb{R}^{d^{2}}$ , given by its expansion coefficients in this basis,

[TABLE]

where $\mathbf{A}$ is an operator-valued vector with $d^{2}$ components. Theorem 2 now states that the frame function is given by a scalar product,

[TABLE]

between the effect vector $\mathbf{e}$ and a fixed vector $\mathbf{c}\in\mathbb{R}^{d^{2}}$ . Let us determine the relation between the density matrix $\rho$ in (20) in the theorem and the vector $\mathbf{c}$ in (34). Consider any orthonormal basis $\mathcal{W}=\left\{W_{1},\dots,W_{d^{2}}\right\}$ of the Hermitian operators on $\mathbb{C}^{d}$ and let $\mathbf{e}^{\prime}\in\mathbb{R}^{d^{2}}$ be the vector such that $E=\mathbf{e^{\prime}}\cdot\mathbf{W}$ . Then we may write

[TABLE]

here $\mathbf{c^{\prime}}\in\mathbb{R}^{d^{2}}$ is a fixed vector given by $\mathbf{c}^{\prime}=C^{-T}\mathbf{c}$ and $C^{-T}$ is the inverse transpose of the change-of-basis matrix $C$ between the bases $\mathcal{B}$ and $\mathcal{W}$ , i.e. the matrix satisfying $C\mathbf{h}=\mathbf{h}^{\prime}$ for all Hermitian operators $H=\mathbf{h}\cdot\mathbf{B}=\mathbf{h^{\prime}}\cdot\mathbf{W}$ . By the definition of a frame function the operator

[TABLE]

must be positive semi-definite (since $f$ is positive) and have unit trace (due to Eq. (18)) i.e. be a density operator.

We will now prove that a frame function always admits an expression as in Eq. (34).

Proof.

By Lemma 1, there exists an augmented basis $\mathcal{B}=\left\{B_{1},\ldots,B_{d^{2}}\right\}$ for any $E\in\mathcal{E}_{d}$ such that

[TABLE]

with coefficients $e_{j}\geq 0$ , as in Eq. (33).

For each value $j\in\left\{1,\dots,d^{2}\right\}$ , we write the restriction of the frame function $f$ to the set of effects of the form $xB_{j}$ , for $x\in\mathbb{R}$ , as

[TABLE]

where $F_{j}:\left[0,a_{j}\right]\rightarrow\left[0,1\right]$ and $a_{j}=\max\left\{x|xB_{j}\in\mathcal{E}_{d}\right\}$ . By Eq. (19) we have that $F_{j}$ satisfies Cauchy’s functional equation, i.e. $F_{j}\left(x+y\right)=F_{j}\left(x\right)+F_{j}\left(y\right)$ . Due to the assumption in Theorem 2 that $f:\mathcal{E}_{d}\rightarrow\left[0,1\right]$ , each $F_{j}$ must satisfy Condition (i) of Theorem 1 which implies

[TABLE]

Thus we find

[TABLE]

where the $j$ -th component of $\mathbf{f}_{\mathcal{B}}\in\mathbb{R}^{d^{2}}$ is given by $f\left(B_{j}\right)$ , by repeatedly using additivity and Eq. (39). Note that Eq. (40) is not yet in the desired form of Eq. (34) since the vector $\mathbf{f}_{\mathcal{B}}$ depends on the basis $\mathcal{B}$ and thus the effect $E$ .

Let $\mathcal{M}=\left\llbracket M_{1},\ldots,M_{d^{2}}\right\rrbracket$ be a MIC-POM on $\mathbb{C}^{d}$ . Since the elements of $\mathcal{M}$ are a basis for the space $\mathbb{H}_{d}$ , the Hermitian operators on $\mathbb{C}^{d}$ , we have for any $E\in\mathcal{E}_{d}$

[TABLE]

for coefficients $e_{j}^{\prime\prime}\in\mathbb{R}$ some of which may be negative. There exists a fixed change-of-basis matrix $D$ such that

[TABLE]

for all effects $E\in\mathcal{E}_{d}$ . Now we have

[TABLE]

Any effect $G$ in the intersection of the positive cones $\mathcal{C}\left(\mathcal{B}\right)$ and $\mathcal{C}\left(\mathcal{M}\right)$ can be expressed in two ways,

[TABLE]

where both effect vectors $\mathbf{g}$ and $\mathbf{g}^{\prime\prime}$ have only non-negative components. Eqs. (40) and (43) imply that

[TABLE]

Since by Lemma 2 there are $d^{2}$ linearly independent effects $G$ in the intersection $\mathcal{C}\left(\mathcal{M}\right)\cap\mathcal{C}\left(\mathcal{B}\right)$ , we conclude that

[TABLE]

Combining this equality with Equation (43) we find, for a fixed MIC-POM $\mathcal{M}=\left\llbracket M_{1},\ldots,M_{d^{2}}\right\rrbracket$ and any effect $E\in\mathcal{\mathcal{E}}_{\mathbb{C}^{d}}$ , that the frame function $f$ takes the form

[TABLE]

Here $\mathbf{f}_{\mathcal{M}}\equiv\mathbf{c}$ is a fixed vector since it does not depend on $E$ . ∎

Note that Eq. (39) may also be found using the other three cases of Theorem 1. For Case (ii), we observe that each of the functions $F_{j},j=1\ldots d^{2}$ , is non-negative by definition. Alternatively, each function $F_{j}$ can be shown to be continuous at zero (Case (iii)) using the following argument which is similar to the one given in [14]. Assume $F_{j}$ is not continuous at zero. Then there exists a number $\varepsilon>0$ such that for all $\delta>0$ we have

[TABLE]

for some $0<x_{0}<\delta<1$ . For any given $\varepsilon$ choose $\delta=1/n<\varepsilon$ , there is a value of $x_{0}<\delta$ such that $F_{j}\left(x_{0}\right)>\varepsilon$ . However, we have the inequality $nx_{0}<1$ , which leads to

[TABLE]

contradicting the the existence of an upper bound of one on values of $F_{j}$ . Finally, each of the functions $F_{j}$ is Lebesgue measurable (Case (iv)) which follows from the monotonicity of the function.

4 Summary and discussion

We are aware of two papers linking Gleason’s theorem and Cauchy’s functional equation. Cooke et al. [15] used Cauchy’s functional equation to demonstrate the necessity of the boundedness of frame functions in proving Gleason’s theorem. Dvurečenskij [16] introduced frame functions defined on effect algebras but did not proceed to derive a Gleason-type theorem in the context of quantum theory.

In this paper, we have exploited the fact that additive functions are central to both Gleason-type theorems and Cauchy’s functional equation. Gleason-type theorems are based on the assumption that states assign probabilities to measurement outcomes via additive functions, or frame functions, on the effect space. Linearity of the frame functions* *has been shown to follow from positivity and other assumptions which are well-known in the context of Cauchy’s functional equation. Altogether, the result obtained here amounts to an alternative proof of the extension of Gleason’s theorem to dimension two given by Busch [2] and Caves et al. [14].

Other* *Gleason-type theorems are known that are *stronger, *in the sense that they depend on assumptions weaker than those of Theorem 2. The smallest known set of assumptions requires Eq. (19) to only be valid for effects $E_{1}$ and $E_{2}$ that coexist in a projective-simulable POM [17], i.e. a POM that may be simulated using only classic mixtures of projective measurements, as opposed to any POM. Since the proof given in [4] relies on Theorem 2, the alternative proof presented in Sec. 3.2 also gives rise to a new proof of the strongest existing Gleason-type theorem.

We have not been able to exploit the structural similarity between the requirements on frame functions and on the solutions of Cauchy’s functional equation in order to yield a new proof of Gleason’s original theorem. Additivity of frame functions defined on projections instead of effects does not provide us with the type of continuous parameters that are necessary for the argument developed here. It remains an intriguing open question whether such a proof does exist.

*Acknowledgement**.*

The authors thank Jonathan Barrett for pointing out a gap in the proof of Theorem 1 given in an earlier version of this paper. VJW gratefully acknowledges funding from the York Centre for Quantum Technologies and the WW Smith fund.

Appendix A Proofs of Cases (ii), (iii) and (iv) of Theorem 1

It is shown that each of the conditions given in Cases (ii) to (iv) imply Theorem 1 which states that an additive function on a particular interval must be linear.

Proof.

Case (ii): Suppose that there exists a non-linear function $f$ satisfying Eq. (6) and Case (ii) of Theorem 1. Then the function $g:\left[0,a\right]\rightarrow\mathbb{R}$ defined by $g\left(x\right)=-f\left(x\right)$ is non-linear but satisfies Eq. (6) and $g\left(x\right)\leq b$ and $b\geq 0$ , with $b=-c$ , contradicting Case (i). ∎

Proof.

Case (iii): Since $f$ is continuous at zero and $f\left(0\right)=0$ , as follows from Eq. (6), we have that for any $\varepsilon>0$ , there exists a $\delta>0$ such that $\left|f\left(x\right)\right|<\varepsilon$ for all $x$ satisfying $\left|x\right|<\delta$ . Let $x,x_{0}\in\left[0,a\right]$ be such that $\left|x-x_{0}\right|<\delta$ . First consider the case $x<x_{0}$ . Using additivity,

[TABLE]

we find

[TABLE]

On the other hand, if $x>x_{0}$ we have

[TABLE]

and then

[TABLE]

It follows that $f$ is continuous on $\left[0,a\right]$ . As in the proof for Case (i), Eqs. (8) and (9) show that

[TABLE]

for rational $q\in\left[0,a\right]$ . Therefore, if $\left(q_{1},q_{2},\ldots\right)$ is a sequence of rational numbers converging to $x$ , the function $f(x)$ must be linear in $x$ :

[TABLE]

∎

In Case (iv), where $f$ is Lebesgue measurable, the proof of the analogous result for functions on the full real line by Banach [7] is easily adapted to our setting. Given Case (iii), it suffices to prove that $f$ is continuous at [math], i.e. that for every $\varepsilon>0$ there exists a number $\delta>0$ such that

[TABLE]

holds for all $0<h<\delta$ .

Proof.

Case (iv): Let $a/2<r<a$ . Lusin’s theorem [18] states that, for a Lebesgue measurable function $g$ on an interval $J$ of Lesbesgue measure $\mu\left(J\right)=m$ , there exists a compact subset of any measure $m^{\prime}<m$ such that the restriction of $g$ to this subset is continuous. Thus we may find a compact set $F\subset\left[0,a\right]$ with $\mu\left(F\right)\geq r$ on which $f$ is continuous. Let $\varepsilon>0$ be given. Since $F$ is compact, $f$ is uniformly continuous on $F$ and there exists a $\delta\in\left(0,2r-a\right)$ such that

[TABLE]

is valid for two numbers $x,y\in F$ such that $\left|x-y\right|<\delta$ . Let $h\in\left(0,\delta\right)$ . Suppose $F$ and $F-h=\left\{x-h|x\in F\right\}$ were disjoint. Then we would have

[TABLE]

which contradicts $h<\delta<2r-a$ . Taking a point $x\in F\cap\left(F-h\right)$ then a number $\delta\in\left(0,2r-a\right)$ can be found such that

[TABLE]

for $h\in\left(0,\delta\right)$ . Hence, remembering that $f(0)=0$ , the function $f(x)$ is continuous at $x=0$ . ∎

Bibliography18

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] A. M. Gleason, J. Math. Mech. 6 (1957) 885
2[2] P. Busch, Phys. Rev. Lett. 91 (2003) 120403
3[3] C. M. Caves, C. A. Fuchs, K. K. Manne and J. M. Renes, Found. Phys. 34 (2004) 193
4[4] V. J. Wright and S. Weigert, J. Phys. A 52 (2019) 055301
5[5] A. Cauchy, Cours D’analyse De L’école Royale Polytechnique (Debure, 1821)
6[6] G. Hamel, Math. Ann. 60 (1905) 459
7[7] S. Banach, Fund. Math. 1 (1920) 123
8[8] E. Christensen, Commun. Math. Phys. 86 (1982) 529

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Gleason-type Theorems from

Abstract

1 Introduction

2 Cauchy’s functional equation on a finite interval

Theorem 1**.**

Proof.

3 When Cauchy meets Gleason: additive functions on effect spaces

Theorem 2** (Busch [2]).**

3.1 Preliminaries

Definition 1**.**

Definition 2**.**

Lemma 1**.**

Proof.

Lemma 2**.**

Proof.

3.2 An alternative proof of Busch’s Gleason-type theorem

Proof.

4 Summary and discussion

Acknowledgement*.*

Appendix A Proofs of Cases (ii), (iii) and (iv) of Theorem 1

Proof.

Proof.

Proof.

Theorem 1.

Theorem 2 (Busch [2]).

Definition 1.

Definition 2.

Lemma 1.

Lemma 2.

*Acknowledgement**.*