A combined-probability space and (un)certainty relations for a   finite-level quantum system

Arun Sehrawat

arXiv:1702.01680·quant-ph·August 8, 2017

A combined-probability space and (un)certainty relations for a finite-level quantum system

Arun Sehrawat

PDF

TL;DR

This paper introduces a new geometric framework called combined-probability space for finite-level quantum systems, deriving uncertainty relations using convex analysis and parametric curves, unifying and extending known results.

Contribution

It develops a novel combined-probability space for qudits, characterizes its extreme points, and derives uncertainty relations without exhaustive search by focusing on parametric curves.

Findings

01

The combined-probability space is a compact convex set with extreme points on parametric curves.

02

Uncertainty relations are obtained by minimizing concave functions on these curves.

03

Many known tight (un)certainty relations for qubits are recovered through triangle inequalities.

Abstract

The Born rule provides a probability vector (distribution) with a quantum state for a measurement setting. For two settings, we have a pair of vectors from the same quantum state. Each pair forms a combined-probability vector that obeys certain quantum constraints, which are triangle inequalities in our case. Such a restricted set of combined vectors, titled combined-probability space, is presented here for a $d$ -level quantum system (qudit). The combined space turns out a compact convex subset of a Euclidean space, and all its extreme points come from a family of parametric curves. Considering a suitable concave function on the combined space to estimate the uncertainty, we deliver an uncertainty relation by finding its global minimum at the curves for a qudit. If one chooses an appropriate concave (or convex) function, then there is no need to search for the absolute minimum (maximum)…

Tables12

Table 1. Table 1: A list of four points P → = ( p 1 , p 2 , p → ˙ rest , q → ˙ ) ∈ ℝ 2 d → 𝑃 subscript 𝑝 1 subscript 𝑝 2 subscript ˙ → 𝑝 rest ˙ → 𝑞 superscript ℝ 2 𝑑 {\vec{P}=\big{(}p_{1},p_{2},\dot{\vec{p}}_{\mathrm{rest}},\dot{\vec{q}}\;\big{)}\in\mathbb{R}^{2d}} that lie on the line characterized by ( 143 ). From the interior point ( p → ˙ , q → ˙ ) ˙ → 𝑝 ˙ → 𝑞 {\big{(}\dot{\vec{p}},\dot{\vec{q}}\,\big{)}} , P → 1 , P → 2 subscript → 𝑃 1 subscript → 𝑃 2 \vec{P}_{1},\vec{P}_{2} are in the direction where p 1 subscript 𝑝 1 p_{1} increases, and P → 3 , P → 4 subscript → 𝑃 3 subscript → 𝑃 4 \vec{P}_{3},\vec{P}_{4} are in the direction where p 1 subscript 𝑝 1 p_{1} decreases. So, the value of p 1 subscript 𝑝 1 p_{1} for a point here is one of the four bounds [stated in ( 153 )]. Once we have p 1 subscript 𝑝 1 p_{1} —in the center column—then p 2 subscript 𝑝 2 p_{2} is retrieved with ( 143 ) and placed in the right column.

$\vec{P}$	$p_{1}$	$p_{2}$
${\vec{P}}_{1}$	$\cos {(θ_{1 J} - {\dot{β}}_{J})}^{2}$	${\dot{p}}_{1} + {\dot{p}}_{2} - \cos {(θ_{1 J} - {\dot{β}}_{J})}^{2}$
${\vec{P}}_{2}$	${\dot{p}}_{1} + {\dot{p}}_{2}$	0
${\vec{P}}_{3}$	0	${\dot{p}}_{1} + {\dot{p}}_{2}$
${\vec{P}}_{4}$	${\dot{p}}_{1} + {\dot{p}}_{2} - \cos {(θ_{2 K} - {\dot{β}}_{K})}^{2}$	$\cos {(θ_{2 K} - {\dot{β}}_{K})}^{2}$

Table 2. Table 2: The conditions that—rely on the minimum and the maximum values in ( 153 )—determine whether a point from Table 1 will be in or out of 𝝎 𝝎 \bm{\omega} . If a condition from the left column holds, only then the related case in the right column occurs, and vice versa. One can realize that at most two conditions can hold at a time.

If and only if	Then
$\cos {(θ_{1 J} - {\dot{β}}_{J})}^{2} < \dot{p_{1}} + \dot{p_{2}}$	${\vec{P}}_{1} \in 𝝎$ and ${\vec{P}}_{2} \notin 𝝎$
$\cos {(θ_{1 J} - {\dot{β}}_{J})}^{2} > \dot{p_{1}} + \dot{p_{2}}$	${\vec{P}}_{1} \notin 𝝎$ and ${\vec{P}}_{2} \in 𝝎$
$\cos {(θ_{1 J} - {\dot{β}}_{J})}^{2} = \dot{p_{1}} + \dot{p_{2}}$	${\vec{P}}_{1} = {\vec{P}}_{2} \in 𝝎$
$\cos {(θ_{2 K} - {\dot{β}}_{K})}^{2} < \dot{p_{1}} + \dot{p_{2}}$	${\vec{P}}_{3} \notin 𝝎$ and ${\vec{P}}_{4} \in 𝝎$
$\cos {(θ_{2 K} - {\dot{β}}_{K})}^{2} > \dot{p_{1}} + \dot{p_{2}}$	${\vec{P}}_{3} \in 𝝎$ and ${\vec{P}}_{4} \notin 𝝎$
$\cos {(θ_{2 K} - {\dot{β}}_{K})}^{2} = \dot{p_{1}} + \dot{p_{2}}$	${\vec{P}}_{3} = {\vec{P}}_{4} \in 𝝎$

Table 3. Table 3: Duos P → ′ , P → ′′ superscript → 𝑃 ′ superscript → 𝑃 ′′ {\vec{P}^{\prime},\vec{P}^{\prime\prime}} of points from Table 1 . Only one out of these duos—unless two or more duos are the same—lies in 𝝎 𝝎 \bm{\omega} and expresses the interior point ( p → ˙ , q → ˙ ) ˙ → 𝑝 ˙ → 𝑞 {\big{(}\dot{\vec{p}},\dot{\vec{q}}\,\big{)}} through the convex combination ( 154 ) with a real number λ 𝜆 \lambda . Corresponding to each duo, λ 𝜆 \lambda is registered in the right column. One can confirm that 0 < λ < 1 0 𝜆 1 {0<\lambda<1} by realizing 0 < p ˙ 1 < cos ( θ 1 J − β ˙ J ) 2 {0<\dot{p}_{1}<{\cos(\theta_{1J}-\dot{\beta}_{J})}^{2}} and 0 < p ˙ 2 < cos ( θ 2 K − β ˙ K ) 2 {0<\dot{p}_{2}<{\cos(\theta_{2K}-\dot{\beta}_{K})}^{2}} .

${\vec{P}}^{'}, {\vec{P}}^{''}$	$λ$
${\vec{P}}_{1}, {\vec{P}}_{3}$	$\frac{\dot{p_{1}}}{\cos {(θ_{1 J} - {\dot{β}}_{J})}^{2}}$
${\vec{P}}_{1}, {\vec{P}}_{4}$	$\frac{\cos {(θ_{2 K} - {\dot{β}}_{K})}^{2} - \dot{p_{2}}}{\cos {(θ_{1 J} - {\dot{β}}_{J})}^{2} + \cos {(θ_{2 K} - {\dot{β}}_{K})}^{2} - \dot{p_{1}} - \dot{p_{2}}}$
${\vec{P}}_{2}, {\vec{P}}_{3}$	$1 - \frac{\dot{p_{2}}}{\dot{p_{1}} + \dot{p_{2}}}$
${\vec{P}}_{2}, {\vec{P}}_{4}$	$1 - \frac{\dot{p_{2}}}{\cos {(θ_{2 K} - {\dot{β}}_{K})}^{2}}$

Table 4. Table 4: A list of four points P → = ( 0 , p 2 , p 3 , p → ˙ rest , q → ˙ ) → 𝑃 0 subscript 𝑝 2 subscript 𝑝 3 subscript ˙ → 𝑝 rest ˙ → 𝑞 {\vec{P}=\big{(}0,p_{2},p_{3},\dot{\vec{p}}_{\mathrm{rest}},\dot{\vec{q}}\;\big{)}} similar to Table 1 . The upper bounds on p 2 subscript 𝑝 2 p_{2} [see ( 159 )] specify the points P → 31 subscript → 𝑃 31 \vec{P}_{31} and P → 32 subscript → 𝑃 32 \vec{P}_{32} , while the lower bounds determine P → 33 subscript → 𝑃 33 \vec{P}_{33} and P → 34 subscript → 𝑃 34 \vec{P}_{34} . These bounds are stated in the middle column for p 2 subscript 𝑝 2 p_{2} , and then the corresponding p 3 subscript 𝑝 3 p_{3} are obtained by ( 158 ) [see the right column].

$\vec{P}$	$p_{2}$	$p_{3}$
${\vec{P}}_{31}$	$\cos {(θ_{2 K} - {\dot{β}}_{K})}^{2}$	$\sum_{i = 1}^{3} {\dot{p}}_{i} - \cos {(θ_{2 K} - {\dot{β}}_{K})}^{2}$
${\vec{P}}_{32}$	$\sum_{i = 1}^{3} {\dot{p}}_{i}$	0
${\vec{P}}_{33}$	0	$\sum_{i = 1}^{3} {\dot{p}}_{i}$
${\vec{P}}_{34}$	$\sum_{i = 1}^{3} {\dot{p}}_{i} - \cos {(θ_{3 L} - {\dot{β}}_{L})}^{2}$	$\cos {(θ_{3 L} - {\dot{β}}_{L})}^{2}$

Table 5. Table 5: The necessary and sufficient conditions—that arise from the restraint ( 159 )—for a point of Table 4 to be in or out of the region 𝐏 1 ⊂ 𝝎 subscript 𝐏 1 𝝎 {\mathbf{P}_{1}\subset\bm{\omega}} . The table is like Table 2 .

If and only if	Then
$\cos {(θ_{2 K} - {\dot{β}}_{K})}^{2} < \sum_{i = 1}^{3} {\dot{p}}_{i}$	${\vec{P}}_{31} \in 𝐏_{1}$ and ${\vec{P}}_{32} \notin 𝐏_{1}$
$\cos {(θ_{2 K} - {\dot{β}}_{K})}^{2} > \sum_{i = 1}^{3} {\dot{p}}_{i}$	${\vec{P}}_{31} \notin 𝐏_{1}$ and ${\vec{P}}_{32} \in 𝐏_{1}$
$\cos {(θ_{2 K} - {\dot{β}}_{K})}^{2} = \sum_{i = 1}^{3} {\dot{p}}_{i}$	${\vec{P}}_{31} = {\vec{P}}_{32} \in 𝐏_{1}$
$\cos {(θ_{3 L} - {\dot{β}}_{L})}^{2} < \sum_{i = 1}^{3} {\dot{p}}_{i}$	${\vec{P}}_{33} \notin 𝐏_{1}$ and ${\vec{P}}_{34} \in 𝐏_{1}$
$\cos {(θ_{3 L} - {\dot{β}}_{L})}^{2} > \sum_{i = 1}^{3} {\dot{p}}_{i}$	${\vec{P}}_{33} \in 𝐏_{1}$ and ${\vec{P}}_{34} \notin 𝐏_{1}$
$\cos {(θ_{3 L} - {\dot{β}}_{L})}^{2} = \sum_{i = 1}^{3} {\dot{p}}_{i}$	${\vec{P}}_{33} = {\vec{P}}_{34} \in 𝐏_{1}$

Table 6. Table 6: Depending on P → 3 subscript → 𝑃 3 {\vec{P}_{3}} and the conditions in Table 5 , at most two separate points of Table 4 can belong to 𝐏 1 subscript 𝐏 1 \mathbf{P}_{1} . Here, the left column carries all such couples of points. To the right side of each couple P → ′ , P → ′′ superscript → 𝑃 ′ superscript → 𝑃 ′′ {\vec{P}^{\prime},\vec{P}^{\prime\prime}} , the value of λ 𝜆 \lambda is written, which associates the couple (provided it is in 𝐏 1 subscript 𝐏 1 \mathbf{P}_{1} ) back to P → 3 = λ P → ′ + ( 1 − λ ) P → ′′ subscript → 𝑃 3 𝜆 superscript → 𝑃 ′ 1 𝜆 superscript → 𝑃 ′′ {\vec{P}_{3}=\lambda\vec{P}^{\prime}+(1-\lambda)\vec{P}^{\prime\prime}} . Taking 0 < p ˙ 3 < cos ( θ 3 L − β ˙ L ) 2 {0<\dot{p}_{3}<{\cos(\theta_{3L}-\dot{\beta}_{L})}^{2}} and 0 < p ˙ 1 + p ˙ 2 ≤ cos ( θ 2 K − β ˙ K ) 2 {0<\dot{p}_{1}+\dot{p}_{2}\leq{\cos(\theta_{2K}-\dot{\beta}_{K})}^{2}} —that determines P → 3 ∈ 𝐏 1 subscript → 𝑃 3 subscript 𝐏 1 {\vec{P}_{3}\in\mathbf{P}_{1}} [see Table 2 ]—one can check that each λ 𝜆 \lambda lies in the interval ( 0 , 1 ] 0 1 {(0,1]} .

${\vec{P}}^{'}, {\vec{P}}^{''}$	$λ$
${\vec{P}}_{31}, {\vec{P}}_{33}$	$\frac{\dot{p_{1}} + \dot{p_{2}}}{\cos {(θ_{2 K} - {\dot{β}}_{K})}^{2}}$
${\vec{P}}_{31}, {\vec{P}}_{34}$	$\frac{\cos {(θ_{3 L} - {\dot{β}}_{L})}^{2} - \dot{p_{3}}}{\cos {(θ_{2 K} - {\dot{β}}_{K})}^{2} + \cos {(θ_{3 L} - {\dot{β}}_{L})}^{2} - \sum_{i = 1}^{3} {\dot{p}}_{i}}$
${\vec{P}}_{32}, {\vec{P}}_{33}$	$1 - \frac{\dot{p_{3}}}{\sum_{i = 1}^{3} {\dot{p}}_{i}}$
${\vec{P}}_{32}, {\vec{P}}_{34}$	$1 - \frac{\dot{p_{3}}}{\cos {(θ_{3 L} - {\dot{β}}_{L})}^{2}}$

Table 7. Table 7: A set of four points P → = ( cos ( θ 1 J − β ˙ J ) 2 , p 2 , p 3 , p → ˙ rest , q → ˙ ) {\vec{P}=\big{(}{\cos(\theta_{1J}-\dot{\beta}_{J})}^{2},p_{2},p_{3},\dot{\vec{p}}_{\mathrm{rest}},\dot{\vec{q}}\;\big{)}} like Tables 1 and 4 . Here { P → 11 , P → 12 } subscript → 𝑃 11 subscript → 𝑃 12 {\{\vec{P}_{11},\vec{P}_{12}\}} and { P → 13 , P → 14 } subscript → 𝑃 13 subscript → 𝑃 14 {\{\vec{P}_{13},\vec{P}_{14}\}} are obtained with the upper and lower bounds in ( D.2 ), correspondingly. These bounds are arranged in the center column, and p 3 subscript 𝑝 3 p_{3} is drawn from p 2 subscript 𝑝 2 p_{2} with ( 161 ).

$\vec{P}$	$p_{2}$	$p_{3}$
${\vec{P}}_{11}$	$\cos {(θ_{2 K} - {\dot{β}}_{K})}^{2}$	$\sum_{i = 1}^{3} {\dot{p}}_{i} - \cos {(θ_{1 J} - {\dot{β}}_{J})}^{2} - \cos {(θ_{2 K} - {\dot{β}}_{K})}^{2}$
${\vec{P}}_{12}$	$\sum_{i = 1}^{3} {\dot{p}}_{i} - \cos {(θ_{1 J} - {\dot{β}}_{J})}^{2}$	0
${\vec{P}}_{13}$	0	$\sum_{i = 1}^{3} {\dot{p}}_{i} - \cos {(θ_{1 J} - {\dot{β}}_{J})}^{2}$
${\vec{P}}_{14}$	$\sum_{i = 1}^{3} {\dot{p}}_{i} - \cos {(θ_{1 J} - {\dot{β}}_{J})}^{2} - \cos {(θ_{3 L} - {\dot{β}}_{L})}^{2}$	$\cos {(θ_{3 L} - {\dot{β}}_{L})}^{2}$

Table 8. Table 8: If there is a case from the left column, then we have the corresponding consequence in the right column. All these cases are implications of ( D.2 )–( D.2 ). The table is built in the same way as Table 2 and 5 .

	If	Then
$K \neq J$		${\vec{P}}_{11} \notin 𝐑_{1 J}$ and ${\vec{P}}_{12} \in 𝐑_{1 J}$
$K = J and$	$\cos {(θ_{1 J} - {\dot{β}}_{J})}^{2} + \cos {(θ_{2 K} - {\dot{β}}_{K})}^{2} < \sum_{i = 1}^{3} {\dot{p}}_{i}$	${\vec{P}}_{11} \in 𝐑_{1 J}$ and ${\vec{P}}_{12} \notin 𝐑_{1 J}$
	$\cos {(θ_{1 J} - {\dot{β}}_{J})}^{2} + \cos {(θ_{2 K} - {\dot{β}}_{K})}^{2} > \sum_{i = 1}^{3} {\dot{p}}_{i}$	${\vec{P}}_{11} \notin 𝐑_{1 J}$ and ${\vec{P}}_{12} \in 𝐑_{1 J}$
	$\cos {(θ_{1 J} - {\dot{β}}_{J})}^{2} + \cos {(θ_{2 K} - {\dot{β}}_{K})}^{2} = \sum_{i = 1}^{3} {\dot{p}}_{i}$	${\vec{P}}_{11} = {\vec{P}}_{12} \in 𝐑_{1 J}$
$L \neq J$		${\vec{P}}_{13} \in 𝐑_{1 J}$ and ${\vec{P}}_{14} \notin 𝐑_{1 J}$
$L = J and$	$\cos {(θ_{1 J} - {\dot{β}}_{J})}^{2} + \cos {(θ_{3 L} - {\dot{β}}_{L})}^{2} < \sum_{i = 1}^{3} {\dot{p}}_{i}$	${\vec{P}}_{13} \notin 𝐑_{1 J}$ and ${\vec{P}}_{14} \in 𝐑_{1 J}$
	$\cos {(θ_{1 J} - {\dot{β}}_{J})}^{2} + \cos {(θ_{3 L} - {\dot{β}}_{L})}^{2} > \sum_{i = 1}^{3} {\dot{p}}_{i}$	${\vec{P}}_{13} \in 𝐑_{1 J}$ and ${\vec{P}}_{14} \notin 𝐑_{1 J}$
	$\cos {(θ_{1 J} - {\dot{β}}_{J})}^{2} + \cos {(θ_{3 L} - {\dot{β}}_{L})}^{2} = \sum_{i = 1}^{3} {\dot{p}}_{i}$	${\vec{P}}_{13} = {\vec{P}}_{14} \in 𝐑_{1 J}$

Table 9. Table 9: Taking the case K = J 𝐾 𝐽 {K=J} and L = J 𝐿 𝐽 {L=J} , we have four duos of points, and the table is arranged in the same manner as Table 3 and 6 . Right side to each duo, we place λ 𝜆 \lambda that relates the duo (when it is in 𝐑 1 J subscript 𝐑 1 𝐽 \mathbf{R}_{1J} ) to the point P → 1 = λ P → ′ + ( 1 − λ ) P → ′′ subscript → 𝑃 1 𝜆 superscript → 𝑃 ′ 1 𝜆 superscript → 𝑃 ′′ {\vec{P}_{1}=\lambda\vec{P}^{\prime}+(1-\lambda)\vec{P}^{\prime\prime}} . Having 0 < p ˙ i < cos ( θ i J − β ˙ J ) 2 {0<\dot{p}_{i}<{\cos(\theta_{iJ}-\dot{\beta}_{J})}^{2}} for i = 1 , 2 , 3 𝑖 1 2 3 {i=1,2,3} and the condition cos ( θ 1 J − β ˙ J ) 2 ≤ p ˙ 1 + p ˙ 2 {{\cos(\theta_{1J}-\dot{\beta}_{J})}^{2}\leq\dot{p}_{1}+\dot{p}_{2}} that certifies P → 1 ∈ 𝐑 1 J subscript → 𝑃 1 subscript 𝐑 1 𝐽 {\vec{P}_{1}\in\mathbf{R}_{1J}} [see Table 2 ], one can show that 0 ≤ λ < 1 0 𝜆 1 0\leq\lambda<1 in every case.

${\vec{P}}^{'}, {\vec{P}}^{''}$	$λ$
${\vec{P}}_{11}, {\vec{P}}_{13}$	$\frac{{\dot{p}}_{1} + {\dot{p}}_{2} - \cos {(θ_{1 J} - {\dot{β}}_{J})}^{2}}{\cos {(θ_{2 J} - {\dot{β}}_{J})}^{2}}$
${\vec{P}}_{11}, {\vec{P}}_{14}$	$\frac{\cos {(θ_{3 J} - {\dot{β}}_{J})}^{2} - {\dot{p}}_{3}}{\sum_{i = 1}^{3} (\cos {(θ_{i J} - {\dot{β}}_{J})}^{2} - {\dot{p}}_{i})}$
${\vec{P}}_{12}, {\vec{P}}_{13}$	$1 - \frac{{\dot{p}}_{3}}{\sum_{i = 1}^{3} {\dot{p}}_{i} - \cos {(θ_{1 J} - {\dot{β}}_{J})}^{2}}$
${\vec{P}}_{12}, {\vec{P}}_{14}$	$1 - \frac{{\dot{p}}_{3}}{\cos {(θ_{3 J} - {\dot{β}}_{J})}^{2}}$

Table 10. Table 10: Four points Q → = ( p → ̊ , q ˙ 1 , q 2 , q 3 , q → ˙ rest ) ∈ ℝ 2 d → 𝑄 ̊ → 𝑝 subscript ˙ 𝑞 1 subscript 𝑞 2 subscript 𝑞 3 subscript ˙ → 𝑞 rest superscript ℝ 2 𝑑 {\vec{Q}=\big{(}\mathring{\vec{p}}\,,\dot{q}_{1},q_{2},q_{3},\dot{\vec{q}}_{\mathrm{rest}}\,\big{)}\in\mathbb{R}^{2d}} that rest on the line specified by ( 176 ). From the point ( p → ̊ , q → ˙ ) ̊ → 𝑝 ˙ → 𝑞 {\big{(}\mathring{\vec{p}}\,,\dot{\vec{q}}\,\big{)}} , the coordinate q 2 subscript 𝑞 2 q_{2} increases towards { Q → 1 , Q → 2 } subscript → 𝑄 1 subscript → 𝑄 2 {\{\vec{Q}_{1},\vec{Q}_{2}\}} , while it decreases towards { Q → 3 , Q → 4 } subscript → 𝑄 3 subscript → 𝑄 4 {\{\vec{Q}_{3},\vec{Q}_{4}\}} . The middle column carries the four bounds given in ( 177 ), and then q 3 subscript 𝑞 3 q_{3} is obtained with ( 176 ). The table is prepared in the same fashion as Tables 1 , 4 , and 7 .

$\vec{Q}$	$q_{2}$	$q_{3}$
${\vec{Q}}_{1}$	$\cos {(θ_{K 2} - {\overset{̊}{α}}_{K})}^{2}$	${\dot{q}}_{2} + {\dot{q}}_{3} - \cos {(θ_{K 2} - {\overset{̊}{α}}_{K})}^{2}$
${\vec{Q}}_{2}$	${\dot{q}}_{2} + {\dot{q}}_{3}$	0
${\vec{Q}}_{3}$	0	${\dot{q}}_{2} + {\dot{q}}_{3}$
${\vec{Q}}_{4}$	${\dot{q}}_{2} + {\dot{q}}_{3} - \cos {(θ_{L 3} - {\overset{̊}{α}}_{L})}^{2}$	$\cos {(θ_{L 3} - {\overset{̊}{α}}_{L})}^{2}$

Table 11. Table 11: Group of conditions for the case ( 180 ), where α ̊ 1 = θ 11 − β 1 ˙ subscript ̊ 𝛼 1 subscript 𝜃 11 ˙ subscript 𝛽 1 {\mathring{\alpha}_{1}=\theta_{11}-\dot{\beta_{1}}} . A condition from the left column delivers what is on its right side. These conditions originate from ( 177 ) and the discussion around ( 190 ). At most two conditions can hold simultaneously, thus more than two distinct points of Table 10 cannot be a part of 𝝎 𝝎 \bm{\omega} . The table looks like Table 8 .

	If	Then
$K = s$		${\vec{Q}}_{2} \in 𝝎$
$K = 1,$	$\cos {(θ_{K 2} - {\overset{̊}{α}}_{K})}^{2} < \dot{q_{2}} + \dot{q_{3}}$	${\vec{Q}}_{1} \in 𝝎$ and ${\vec{Q}}_{2} \notin 𝝎$
	$\cos {(θ_{K 2} - {\overset{̊}{α}}_{K})}^{2} > \dot{q_{2}} + \dot{q_{3}}$	${\vec{Q}}_{1} \notin 𝝎$ and ${\vec{Q}}_{2} \in 𝝎$
	$\cos {(θ_{K 2} - {\overset{̊}{α}}_{K})}^{2} = \dot{q_{2}} + \dot{q_{3}}$	${\vec{Q}}_{1} = {\vec{Q}}_{2} \in 𝝎$
$L = s$		${\vec{Q}}_{3} \in 𝝎$
$L = 1,$	$\cos {(θ_{L 3} - {\overset{̊}{α}}_{L})}^{2} < \dot{q_{2}} + \dot{q_{3}}$	${\vec{Q}}_{3} \notin 𝝎$ and ${\vec{Q}}_{4} \in 𝝎$
	$\cos {(θ_{L 3} - {\overset{̊}{α}}_{L})}^{2} > \dot{q_{2}} + \dot{q_{3}}$	${\vec{Q}}_{3} \in 𝝎$ and ${\vec{Q}}_{4} \notin 𝝎$
	$\cos {(θ_{L 3} - {\overset{̊}{α}}_{L})}^{2} = \dot{q_{2}} + \dot{q_{3}}$	${\vec{Q}}_{3} = {\vec{Q}}_{4} \in 𝝎$

Table 12. Table 12: Collection of duplets Q → ′ , Q → ′′ superscript → 𝑄 ′ superscript → 𝑄 ′′ {\vec{Q}^{\prime},\vec{Q}^{\prime\prime}} of points from Table 10 . Only one of these duplets—except if two or more are the same—belongs to 𝝎 𝝎 \bm{\omega} and represents the point ( p → ̊ , q → ˙ ) ̊ → 𝑝 ˙ → 𝑞 {\big{(}\mathring{\vec{p}}\,,\dot{\vec{q}}\,\big{)}} with the convex combination λ Q → ′ + ( 1 − λ ) Q → ′′ 𝜆 superscript → 𝑄 ′ 1 𝜆 superscript → 𝑄 ′′ {\lambda\,\vec{Q}^{\prime}+(1-\lambda)\,\vec{Q}^{\prime\prime}} . Here we assume K = 1 𝐾 1 {K=1} and L = 1 𝐿 1 {L=1} , otherwise Q → 1 subscript → 𝑄 1 \vec{Q}_{1} and Q → 4 subscript → 𝑄 4 \vec{Q}_{4} can not belong to 𝝎 𝝎 \bm{\omega} without being equal to Q → 2 subscript → 𝑄 2 \vec{Q}_{2} and Q → 3 subscript → 𝑄 3 \vec{Q}_{3} , respectively [see Table 11 ]. The right column has the values of λ 𝜆 \lambda for each duplet, provided the duplet lies in 𝝎 𝝎 \bm{\omega} . One can check that λ ∈ [ 0 , 1 ] 𝜆 0 1 {\lambda\in[0,1]} with 0 < q ˙ 2 ≤ cos ( θ K 2 − α ̊ K ) 2 {0<\dot{q}_{2}\leq{\cos(\theta_{K2}-\mathring{\alpha}_{K})}^{2}} and 0 < q ˙ 3 ≤ cos ( θ L 3 − α ̊ L ) 2 {0<\dot{q}_{3}\leq{\cos(\theta_{L3}-\mathring{\alpha}_{L})}^{2}} [see ( 177 )].

${\vec{Q}}^{'}, {\vec{Q}}^{''}$	$λ$
${\vec{Q}}_{1}, {\vec{Q}}_{3}$	$\frac{{\dot{q}}_{2}}{\cos {(θ_{12} - {\overset{̊}{α}}_{1})}^{2}}$
${\vec{Q}}_{1}, {\vec{Q}}_{4}$	$\frac{\cos {(θ_{13} - {\overset{̊}{α}}_{1})}^{2} - {\dot{q}}_{3}}{\cos {(θ_{12} - {\overset{̊}{α}}_{1})}^{2} + \cos {(θ_{13} - {\overset{̊}{α}}_{1})}^{2} - {\dot{q}}_{2} - {\dot{q}}_{3}}$
${\vec{Q}}_{2}, {\vec{Q}}_{3}$	$1 - \frac{{\dot{q}}_{3}}{{\dot{q}}_{2} + {\dot{q}}_{3}}$
${\vec{Q}}_{2}, {\vec{Q}}_{4}$	$1 - \frac{{\dot{q}}_{3}}{\cos {(θ_{13} - {\overset{̊}{α}}_{1})}^{2}}$

Equations383

\mathcal{B}_{a}:=\big{\{}|a_{i}\rangle\big{\}}_{i=1}^{d}\quad\mbox{and}\quad\mathcal{B}_{b}:=\big{\{}|b_{j}\rangle\big{\}}_{j=1}^{d}

\mathcal{B}_{a}:=\big{\{}|a_{i}\rangle\big{\}}_{i=1}^{d}\quad\mbox{and}\quad\mathcal{B}_{b}:=\big{\{}|b_{j}\rangle\big{\}}_{j=1}^{d}

N

N

p_{i} = ∣ ⟨ a_{i} ∣ ψ ⟩ ∣^{2} \mbox an d q_{j} = ∣ ⟨ b_{j} ∣ ψ ⟩ ∣^{2}

p_{i} = ∣ ⟨ a_{i} ∣ ψ ⟩ ∣^{2} \mbox an d q_{j} = ∣ ⟨ b_{j} ∣ ψ ⟩ ∣^{2}

α_{i} = arccos ∣ ⟨ a_{i} ∣ ψ ⟩ ∣ \mbox an d β_{j} = arccos ∣ ⟨ b_{j} ∣ ψ ⟩ ∣

α_{i} = arccos ∣ ⟨ a_{i} ∣ ψ ⟩ ∣ \mbox an d β_{j} = arccos ∣ ⟨ b_{j} ∣ ψ ⟩ ∣

\sum_{i = 1}^{d} p_{i}

\sum_{i = 1}^{d} p_{i}

0

\sum_{j = 1}^{d} q_{j}

\sum_{j = 1}^{d} q_{j}

0

\qquad\qquad\quad r_{ij}=|\langle a_{i}|b_{j}\rangle|^{2}\qquad\big{(}1\leq i,j\leq d\big{)}

\qquad\qquad\quad r_{ij}=|\langle a_{i}|b_{j}\rangle|^{2}\qquad\big{(}1\leq i,j\leq d\big{)}

θ_{ij} = arccos ∣ ⟨ a_{i} ∣ b_{j} ⟩ ∣

θ_{ij} = arccos ∣ ⟨ a_{i} ∣ b_{j} ⟩ ∣

R := r_{11} ⋮ r_{d 1} \dots ⋱ \dots r_{1 d} ⋮ r_{dd} \mbox an d Θ := θ_{11} ⋮ θ_{d 1} \dots ⋱ \dots θ_{1 d} ⋮ θ_{dd}

R := r_{11} ⋮ r_{d 1} \dots ⋱ \dots r_{1 d} ⋮ r_{dd} \mbox an d Θ := θ_{11} ⋮ θ_{d 1} \dots ⋱ \dots θ_{1 d} ⋮ θ_{dd}

∣ θ_{ij} - β_{j} ∣ \leq α_{i} \leq θ_{ij} + β_{j}

∣ θ_{ij} - β_{j} ∣ \leq α_{i} \leq θ_{ij} + β_{j}

θ_{ij} \leq α_{i} + β_{j} \mbox f or e v er y 1 \leq i, j \leq d .

θ_{ij} \leq α_{i} + β_{j} \mbox f or e v er y 1 \leq i, j \leq d .

p_{i} q_{j} \leq r_{ij} + (1 - p_{i}) (1 - q_{j})

p_{i} q_{j} \leq r_{ij} + (1 - p_{i}) (1 - q_{j})

p_{i} + q_{j} \leq r_{ij} + 1 + 2 r_{ij} (1 - p_{i}) (1 - q_{j})

p_{i} + q_{j} \leq r_{ij} + 1 + 2 r_{ij} (1 - p_{i}) (1 - q_{j})

α_{i} = θ_{i 1} - β_{1} \mbox f or a l l i = 1, \dots, m

α_{i} = θ_{i 1} - β_{1} \mbox f or a l l i = 1, \dots, m

p (β_{1})

p (β_{1})

q (β_{1})

p_{s}

q_{t}

0

p_{s} (β^{'}) = cos (θ_{s 1} - β^{'})^{2}

p_{s} (β^{'}) = cos (θ_{s 1} - β^{'})^{2}

β^{''} = \frac{θ _{11} - θ _{1 t}}{2} + \frac{π}{4}

\displaystyle p_{s}{\scriptstyle(\beta^{\prime\prime})}=0\

β_{j}

β_{j}

p (α_{1})

q (α_{1})

p_{s}

q_{t}

q_{t} (α^{'}) = cos (θ_{1 t} - α^{'})^{2}

q_{t} (α^{'}) = cos (θ_{1 t} - α^{'})^{2}

α^{''} = \frac{θ _{11} - θ _{s 1}}{2} + \frac{π}{4}

\displaystyle q_{t}{\scriptstyle(\alpha^{\prime\prime})}=0\

d \sum_{m = 1}^{d - 1} \frac{d !}{m ! ( d - m )!} (d - 1) (d - m)

d \sum_{m = 1}^{d - 1} \frac{d !}{m ! ( d - m )!} (d - 1) (d - m)

d^{2} (d - 1)^{2}

d^{2} (d - 1)^{2}

= d^{2} (d - 1) [2^{d} - (d + 1)]

u (p) := \sum_{i = 1}^{d} p_{i}

u (p) := \sum_{i = 1}^{d} p_{i}

\frac{\partial ^{2} u}{\partial p _{k} \partial p _{l}} = - \frac{1}{4} (\frac{1}{p _{l}^{3/2}} δ_{l k} + \frac{1}{p _{d}^{3/2}}) = \frac{\partial ^{2} u}{\partial p _{l} \partial p _{k}},

\frac{\partial ^{2} u}{\partial p _{k} \partial p _{l}} = - \frac{1}{4} (\frac{1}{p _{l}^{3/2}} δ_{l k} + \frac{1}{p _{d}^{3/2}}) = \frac{\partial ^{2} u}{\partial p _{l} \partial p _{k}},

u (q) = \sum_{j = 1}^{d} q_{j},

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

A combined-probability space and (un)certainty relations for a finite-level quantum system

Arun Sehrawat

[email protected]

Department of Physical Sciences, Indian Institute of Science Education & Research (IISER) Mohali, Sector 81 SAS Nagar, Manauli PO 140306, Punjab, India

Abstract

The Born rule provides a probability vector (distribution) with a quantum state for a measurement setting. For two settings, we have a pair of vectors from the same quantum state. Each pair forms a combined-probability vector that obeys certain quantum constraints, which are triangle inequalities in our case. Such a restricted set of combined vectors, titled combined-probability space, is presented here for a $d$ -level quantum system (qudit). The combined space turns out a compact convex subset of a Euclidean space, and all its extreme points come from a family of parametric curves. Considering a suitable concave function on the combined space to estimate the uncertainty, we deliver an uncertainty relation by finding its global minimum at the curves for a qudit. If one chooses an appropriate concave (or convex) function, then there is no need to search for the absolute minimum (maximum) on the whole space, it will be at the parametric curves. So these curves are quite useful for establishing an uncertainty (or a certainty) relation for a general pair of settings. In the paper, we also demonstrate that many known tight (un)certainty relations for a qubit can be obtained with the triangle inequalities.

I Introduction

Every setting for a measurement on a quantum system can be completely specified by an orthonormal basis of the system’s Hilbert space. Identical systems can be independently prepared in a (pure) state $\rho$ such that, every time, we get a definite outcome when a system is measured in a setting $a$ . If we change $a$ to a physically distinct setting $b$ , then we observe—sometimes one and sometimes other—multiple outcomes. In other words, there the probability is one for an outcome in $a$ -setting, whereas none of the probabilities is one in $b$ -setting. Of course, in any setting, all the probabilities are nonnegative numbers that sum up to one. Apart from that, the probability vectors (distributions) $\vec{p}$ and $\vec{q}$ —associated with the two settings $a$ and $b$ , respectively—must follow certain constraints, called quantum constraints (QCs), together.

Historically, such QCs are expressed in terms of uncertainty relations (URs) by taking Hermitian operators rather than orthonormal bases. An UR is an inequality, ${\mathsf{c}(a,b,\rho)\leq\mathsf{u}(a,b,\rho)}$ , between two real-valued functions: uncertainty measure $\mathsf{u}$ and its lower bound $\mathsf{c}$ . In 1927, Heisenberg introduced the first UR Heisenberg27 ; Wheeler83 (derived by Weyl in Weyl32 ) for the position and momentum operators. Different aspects of his seminal work are reviewed in Busch07 . Robertson Robertson29 generalized the Heisenberg’s relation for an arbitrary pair of operators by employing the standard deviation as a measure of uncertainty. In Robertson’s UR, the lower bound $\mathsf{c}$ is a function of state $\rho$ . Deutsch criticized it and introduced a new UR Deutsch83 for a finite-dimensional state space by taking entropy as a measure of uncertainty. He achieved a state independent ${\mathsf{c}(a,b)}$ . Later, a better lower bound was conjectured by Kraus Kraus87 and then proved by Maassen and Uffink Maassen88 . Such URs are—known as entropy URs—reviewed in Wehner10 ; Bialynicki11 ; Coles17 .

Throughout the article, we are considering $d$ -level quantum systems (qudits) and projective measurements. Our primary objective is to study a set of combined-probability vectors ${(\vec{p},\vec{q}\,)}$ , called combined-probability space, where every vector respects certain, if not all, QCs. Here the elemental QCs are the triangle inequalities (TIs) between quantum angles, and the (un)certainty relations emerge from them. As an angle between a pair of kets—called quantum angle—is a metric over the set of all pure states Wootters81 , we own TIs. Landau and Pollak obtained a single TI Landau61 of this kind for continuous-time signals and provided a classical UR (see also Sec. 8 in Folland97 ).

In Sec. II, we present the combined space that is a compact convex subset of the $2d$ -dimensional real vector space $\mathbb{R}^{2d}$ . Thanks to the Krein-Milman theorem (see Theorem ${3.3.5}$ and Appendix A.3 in Niculescu93 ), every compact convex subset of $\mathbb{R}^{2d}$ can be generated by the convex combinations of its extreme points. As a principal result, we provide a family of parametric curves in Sec. II, which represents all the extreme points of the combined space. In the case of ${d=2}$ , all the parametric curves form an ellipse, and the same ellipse also appears in Lenard72 ; Larsen90 ; Kaniewski14 as a special case.

An uncertainty measure ${\mathsf{u}(a,b,\rho)\equiv\mathsf{u}(\vec{p},\vec{q}\,)}$ should be a concave function on the combined-probability space, argued in the beginning of Sec. III. The concavity of $\mathsf{u}$ ensures that its global minimum $\mathsf{c}$ will occur at the parametric curves (extreme points) of the space (see Theorem ${3.4.7}$ and Appendix A.3 in Niculescu93 ). Hence, one can exploit these curves to obtain an UR, rather easily, for her or his liking of $\mathsf{u}$ and, of course, for general measurement settings $a$ and $b$ .

In Sec. III, we choose a concave, thus uncertainty, measure ${\mathfrak{u}(\vec{p},\vec{q}\,)}$ . A significance of our choice lies in the fact that $\mathfrak{u}$ is again a concave function on every parametric curve (that is, as a function of the parameter). Therefore its absolute minimum $\mathfrak{c}$ will occur nowhere but at the endpoint(s) of these curves. A simple three-step procedure is delivered to find the lower bound ${\mathfrak{c}\leq\mathfrak{u}}$ for an arbitrary pair ${\{a,b\}}$ of settings and for a finite $d$ . One can employ an ordinary computer to run the procedure. Besides, $\mathfrak{c}$ is presented in analytic forms for ${d=2,3,}$ and in the case of mutually unbiased bases (MUBs) Durt10 . References Kraus87 ; Larsen90 ; Ivanovic92 ; Sanchez-Ruiz95 ; Ballester07 ; Wu09 ; Mandayam10 contains URs particularly for MUBs. At the end of Sec. III, we provide another uncertainty measure that is also concave on all the parametric curves, so the whole analysis given before for $\mathfrak{u}$ can be straightforwardly applied to this measure.

If a suitable concave function can be a measure of the uncertainty, then an appropriate convex function will be a measure of certainty. In Sec. IV, we pick some other concave and convex functions and exhibit that the tight (un)certainty relations given in Rastegin12 ; Larsen90 ; Busch14 ; Garrett90 ; Sanchez-Ruiz98 ; Ghirardi03 ; Bosyk12 ; Vicente05 ; Zozor13 ; Deutsch83 ; Maassen88 for a qubit can be achieved with the TIs that specifies the ellipse. We conclude the article with Sec. V.

The appendices are kept for certain technical details and proofs: the TIs are derived in Appendix A. It is manifested in Appendix B that the combined space is a compact convex set. The parametric curves are explicitly obtained in Appendix D with the help of Appendix C.

II Quantum constraints and combined-probability space

In quantum theory, observables are represented by Hermitian operators. If such an operator is degenerate, then it possesses more than one eigenbases, where some of them can represent physically different measurement setups. Hence, ‘measurement in an orthonormal basis’ of the underlying Hilbert space is rather well defined than ‘a measurement of an operator’ (see Chapter 7 in Peres93 ). In fact, measurement in a basis $\mathcal{B}_{a}$ measure all the operators whose eigenbasis is $\mathcal{B}_{a}$ . Moreover, Deutsch pointed out that a measure of uncertainty for a discrete observable must not depend on its eigenvalues, but on its eigenbasis Deutsch83 . With all these considerations, we choose orthonormal bases instead of Hermitian operators to specify different projective measurements for a qudit.

We begin with two orthonormal bases

[TABLE]

of a $d$ -dimensional Hilbert space $\mathscr{H}_{d}$ to depict the two measurement settings $a$ and $b$ , respectively. In this paper, all (un)certainty relations are preparation (un)certainty relations that are applicable in the following experimental scheme.

[TABLE]

A similar scenario Peres used in his book Peres93 at page 93 to interpret the position-momentum UR. In proposal (2), clearly, the two measurements have no influence whatsoever on each other.

Throughout the text, we assume $\rho$ is a pure quantum state ${|\psi\rangle\langle\psi|}$ so that we can associate angles (4) and TIs (12) with the state vector ${|\psi\rangle}$ . Although every (un)certainty relation presented in this paper as it is applicable for every qudit’s state [see the text around (39)].

The state ${\rho=|\psi\rangle\langle\psi|}$ provides two probability distributions for the two measurement settings [given in (1)] by the Born rule:

[TABLE]

are the probabilities of getting outcome $a_{i}$ in the $a$ -setting and outcome $b_{j}$ in the $b$ -setting, respectively. Next, we present quantum angles:

[TABLE]

are the angles between $|\psi\rangle$ and $|a_{i}\rangle$ and between $|\psi\rangle$ and $|b_{j}\rangle$ , respectively. In the entire article, we consider only the principal values ${[0,\pi]}$ of the (multivalued) $\arccos$ function. With (3) and (4), one can recognize that the absolute value of the inner product establishes a one-to-one correspondence between the angles—that belong to ${[0,\tfrac{\pi}{2}]}$ —and the probabilities—that lie in ${[0,1]}$ .

Related to the $a$ -setting, every probability vector ${\vec{p}:=(p_{1},\cdots,p_{d})}$ satisfies

[TABLE]

and the collection of all such vectors constitutes a probability space $\Omega_{a}$ . Similarly, $\Omega_{b}$ is—related to the basis $\mathcal{B}_{b}$ —defined be the constraints

[TABLE]

Equations (5) and (7) state that all the probabilities add up to one, and inequalities (6) and (8) tell that probabilities are nonnegative numbers. Both $\Omega_{a}$ and $\Omega_{b}$ are—the standard ${(d-1)}$ -simplices—compact convex subsets of the $d$ -dimensional real vector space $\mathbb{R}^{d}$ , and their Cartesian product ${\mathbf{\Omega}:=\Omega_{a}\times\Omega_{b}}$ is a compact convex subset of $\mathbb{R}^{2d}$ [see Appendix B]. Basically, $\mathbf{\Omega}$ is determined by the conditions (5)–(8).

Performing measurement on every qudit using a single setting, say $a$ , looks like throwing a $d$ -sided dice, every time. The vector $\vec{p}$ alone is limited by (5) and (6) that specify $\Omega_{a}$ , which is also the probability space of a $d$ -sided dice. Whereas the experimental scheme (2) is not similar to throwing one out of two $d$ -sided dices at a time, although $\mathbf{\Omega}$ is the probability space of two dices: every pure or mixed state of a qudit gives a unique pair ${(\vec{p},\vec{q}\,)\in\mathbf{\Omega}}$ by the Born rule [see (3) and (39)], but not every pair ${(\vec{p},\vec{q}\,)\in\mathbf{\Omega}}$ has a quantum state. For example, if ${|\langle a_{i}|b_{j}\rangle|\neq 1}$ for some $i,j$ , then one cannot get always the same outcome: $a_{i}$ in the $a$ -setting and $b_{j}$ in the $b$ -setting. In other words, it is impossible to prepare prep a quantum system in a state (in this case, there exists no quantum state) that can provide ${(\vec{p},\vec{q}\,)}$ , where ${p_{i}=1=q_{j}}$ , which identifies an extreme point of $\mathbf{\Omega}$ .

So, other than (5)–(8), there are certain constraints that are purely quantum mechanical in nature and must be obeyed by $\vec{p}$ and $\vec{q}$ together. In our case, QCs are the TIs given in (12), which arise naturally from the structure of Hilbert space on which quantum theory is based. To write the TIs, we need

[TABLE]

that is the probability of getting outcome ${a_{i}}$ if ${|b_{j}\rangle\langle b_{j}|}$ (or ${b_{j}}$ if ${|a_{i}\rangle\langle a_{i}|}$ ) is our state for the system. Like $\alpha_{i}$ and $\beta_{j}$ in (4),

[TABLE]

is the angle between the pure states ${|a_{i}\rangle\langle a_{i}|}$ and ${|b_{j}\rangle\langle b_{j}|}$ . In the subscripts of $r_{ij}$ and $\theta_{ij}$ , from left, the first and second indices are reserved for $\mathcal{B}_{a}$ and $\mathcal{B}_{b}$ , respectively. Therefore, note that ${r_{ji}=|\langle a_{j}|b_{i}\rangle|^{2}}$ is different from $r_{ij}$ , and likewise for $\theta$ .

After choosing the measurement settings, $\mathcal{B}_{a}$ and $\mathcal{B}_{b}$ in (1), the entries in

[TABLE]

get fixed by (9) and (10). Each entry in $R$ and in $\varTheta$ belong to ${[0,1]}$ and ${[0,\tfrac{\pi}{2}]}$ , respectively. Sum of all the entries in each row and every column of $R$ is one, thus it is a doubly stochastic matrix. If the two measurement settings described by (1) are physically the same, then $R$ will be a permutation matrix. For every state vector ${|\psi\rangle\in\mathscr{H}_{d}}$ , there are three TIs

[TABLE]

attached to each entry in $\varTheta$ . These TIs [see (114)] are derived in Appendix A.

For simplicity, out the three TIs (12), here we choose only one

[TABLE]

Angles $\alpha_{i}$ and $\beta_{j}$ vary, whereas $\theta_{ij}$ is fixed, as we change the state vector ${|\psi\rangle}$ . The kets that saturates TI (13) for certain $i,j$ lie in the linear span of ${\{|a_{i}\rangle,|b_{j}\rangle\}}$ [consider (108) and (109) with ${0\leq\beta\leq\theta}$ from Appendix A]. In the triangle equality (TE) ${\theta_{ij}=\alpha_{i}+\beta_{j}}$ , $\alpha_{i}$ and $\beta_{j}$ are reminiscent of complementary angles from planar geometry, and ${0\leq\alpha_{i},\beta_{j}\leq\theta_{ij}}$ . Identifying $f$ , $D$ , and $B$ in Landau61 by our ${|\psi\rangle}$ , ${|a\rangle\langle a|}$ , and ${|b\rangle\langle b|}$ , respectively, one can see that the TI ${\theta\leq\alpha+\beta}$ is obtained by Landau and Pollak for continuous-time signals (see also Sec. 8 in Folland97 ). They also plotted elliptic curves (for different $\theta$ s) one of this kind is shown in Fig. 1 between the point $E_{1}$ and $E_{2}$ (see also Lenard72 ). The results in Landau61 ; Lenard72 are more general than here, but they are only for a pair of projectors. Whereas, we take every possible pair ${|a_{i}\rangle\langle a_{i}|}$ and ${|b_{j}\rangle\langle b_{j}|}$ and present three TIs [see (12)], not just one, for each pair.

The cosine function is strictly decreasing on ${[0,\pi]}$ , so applying it on both sides of TI (13) and using (3), (4), (9), and (10), we attain

[TABLE]

after a rearrangement of terms. As both sides in (14) are nonnegative functions of the probabilities, squaring and further simplification lead to

[TABLE]

for every ${1\leq i,j\leq d}$ .

All those pairs ${(\vec{p},\vec{q}\,)\in\mathbf{\Omega}}$ that obey QC (15) for every ${1\leq i,j\leq d}$ build the combined-probability space $\bm{\omega}$ for the two measurement bases in (1). In the case of ${d>2}$ , even if we consider all TIs given in (12) for each ${1\leq i,j\leq d}$ , they do not capture the full QCs for a general pair of settings. Therefore, one can still find some ${(\vec{p},\vec{q}\,)\in\bm{\omega}}$ that corresponds to no quantum state. Nevertheless, our analysis relies on the following fact: every ${(\vec{p},\vec{q}\,)}$ that does not belong to $\bm{\omega}$ cannot be obtained from a quantum state, thus it is discarded. To investigate a space $\bm{\omega}_{{\textsc{q}}}$ —that contains all those, and only those, pairs ${(\vec{p},\vec{q}\,)}$ that originate from the quantum states—is not the aim of this paper. However, it is not tough to realize that ${\bm{\omega}_{{\textsc{q}}}=\bm{\omega}}$ for ${d=2}$ ; in general, ${\bm{\omega}_{{\textsc{q}}}\subseteq\bm{\omega}}$ .

Note that $\bm{\omega}$ is a proper subset of $\mathbf{\Omega}$ . To prove this one can show: only one out of the two extreme points—specified by ${p_{i}=1=q_{j}}$ and ${p_{i}=1=q_{l}}$ , where ${j\neq l}$ —of $\mathbf{\Omega}$ can belong to $\bm{\omega}$ . Recall that if and only if ${r_{ij}=1}$ then the point described by ${p_{i}=1=q_{j}}$ belongs to $\bm{\omega}$ , otherwise ${\theta_{ij}\leq\alpha_{i}+\beta_{j}}$ will be violated. Secondly, if ${r_{ij}=1}$ then ${r_{il}=0}$ , and ${\theta_{il}\leq\alpha_{i}+\beta_{l}}$ cannot be obeyed by the other point; hence that stays outside of $\bm{\omega}$ .

The space $\bm{\omega}$ is—held by the conditions (5)–(8) and (15)—a compact and convex subset of $\mathbb{R}^{2d}$ [for a proof, see Appendix B]. Every point of such a set can be written as a convex combination of its extreme points due to the Krein-Milman theorem (see Theorem ${3.3.5}$ and Appendix A.3 in Niculescu93 ). We begin our journey from an interior point of $\bm{\omega}$ in Appendix D.1 and arrive at its extreme points at the end of Appendix D.3. There it is concluded that the set of all extreme points of $\bm{\omega}$ comes from a family of parametric curves.

One can skip all those technical details and start constructing the parametric curves straight from the conclusion (197): the first step is to pick a set of $m$ angles from a single column or row of the matrix $\varTheta$ given in (11). Such a set is called $m$ -set, and ${1\leq m\leq d-1}$ . For instance, we pick the top $m$ angles ${\{\theta_{i1}\}_{i=1}^{m}}$ from the first column. Then we associate $m$ TEs with the $m$ -set as

[TABLE]

by taking $\beta_{1}$ , where the subscript 1 reflects the selected column.

Next, with (3) and (4), we assign ${m+1}$ probabilities to the angles: ${p_{i}={\cos\alpha_{i}}^{2}}$ and ${q_{1}={\cos\beta_{1}}^{2}}$ . They create the probability vectors

[TABLE]

One can observe that ${\big{(}\vec{p}{\scriptstyle(\beta_{1})}\,,\,\vec{q}{\scriptstyle(\beta_{1})}\big{)}}$ serves as a vector-valued function of a single real parameter $\beta_{1}$ , thus it exhibits a parametric curve. Since the curve is associated with an $m$ -set and all its points obey $m$ TEs (16), we call it an $m$ -parametric curve.

A part of the curve, identified by the upper and lower limits ${\beta^{\prime}\leq\beta_{1}\leq\beta^{\prime\prime}}$ , lies in $\bm{\omega}$ and represents its extreme points because ${\big{(}\vec{p}{\scriptstyle(\beta_{1})}\,,\,\vec{q}{\scriptstyle(\beta_{1})}\big{)}}$ cannot be written into a convex combination of other points of $\bm{\omega}$ . In Appendix D.4, we realize that the two limits are fixed by

[TABLE]

[see (215)]. Equations (22) and (24) are like Eq. (214), whose roots are stated in (221). Always the root with + sign delivers the correct limit [for justifications, see the last paragraph in Appendix D.4].

If one chooses an $m$ -set from a row of $\varTheta$ , say ${\{\theta_{1j}\}_{j=1}^{m}}$ , then the $m$ -parametric curve is constructed as

[TABLE]

Now the parameter is ${\alpha_{1}\in[\alpha^{\prime},\alpha^{\prime\prime}]}$ , and the limits are determined by

[TABLE]

One can check that, for ${m=1}$ , both (16)–(23) and (25)–(31) describe the same thing, provided $s$ and $t$ are identical in both the cases. So an $m$ -parametric curve is identified by an $m$ -set and the positions of $p_{s}$ and $q_{t}$ (that is, $s$ and $t$ ) in $\vec{p}$ and $\vec{q}$ , respectively.

Let us count the total number of curves such as describe by (16)–(20). One can harvest $\tfrac{d!}{m!(d-m)!}$ distinct $m$ -sets from a single column of $\varTheta$ , and there are total $d$ columns. The probability $p_{s}$ can take ${d-m}$ separate places in $\vec{p}$ of (17) for distinct $s$ , and $q_{t}$ can take ${d-1}$ separate places in $\vec{q}$ of (18) for distinct $t$ . Thus we have $(d-m)(d-1)$ individual $m$ -parametric curves with a single $m$ -set. Since ${1\leq m\leq d-1}$ , we collect

[TABLE]

number of curves, where each $m$ -set is made of angles from a column of $\varTheta$ .

We secure the same number if we consider rows, rather than columns, to build an $m$ -set and then a curve such as given by (25)–(29). For ${m=1}$ , every $m$ -set is a part of a row as well as a part of a column. So, to avoid double counting errors, we take the cases ${m=1}$ and ${m>1}$ separately. In total, there are

[TABLE]

number of parametric curves for a qudit.

If one adopts a suitable concave function ${\mathsf{u}(\vec{p},\vec{q}\,)}$ on the combined space $\bm{\omega}$ to estimate the uncertainty, then its absolute minimum will occur only at the parametric curves (see Theorem ${3.4.7}$ and Appendix A.3 in Niculescu93 ). So ultimately one needs to find absolute minima of, at most, $d^{2}(d-1)[2^{d}-(d+1)]$ functions, each of a single variable [for example, see (42)]. Then the smallest minimum will be the lower bound ${\mathsf{c}\leq\mathsf{u}}$ in an UR. This task can be easily completed with a regular computer. In the next two sections, we discuss certain concave as well as convex functions on $\bm{\omega}$ .

III Uncertainty measures and relations

If $u$ quantifies the uncertainty—about the outcomes $a_{i}$ when a qudit is measured in the basis $\mathcal{B}_{a}$ of (1)—then $u$ should be a concave function of ${\vec{p}\in\Omega_{a}}$ . It is because mixing probability distributions, $\vec{p}\,^{\prime}$ and $\vec{p}\,^{\prime\prime}$ as ${\lambda\,\vec{p}\,^{\prime}+(1-\lambda)\vec{p}\,^{\prime\prime}=\vec{p}}$ with ${\lambda\in[0,1]}$ , can only increase uncertainty ${\lambda\,u(\vec{p}\,^{\prime})+(1-\lambda)u(\vec{p}\,^{\prime\prime})\leq u(\vec{p}\,)}$ (see Chapter 9 in Peres93 ). In this regard, every mixed state, say ${\lambda|\psi^{\prime}\rangle\langle\psi^{\prime}|+(1-\lambda)|\psi^{\prime\prime}\rangle\langle\psi^{\prime\prime}|=\rho_{\text{mix}}}$ , has more uncertainty.

So, here, we adopt a real-valued smooth concave function

[TABLE]

as an uncertainty measure. It is associated with the Tsallis entropy Tsallis88 ${S_{\nicefrac{{1}}{{2}}}(\vec{p}\,)=2K(u(\vec{p}\,)-1)}$ , where $K$ the Boltzmann constant. To prove ${u(\vec{p}\,)}$ is a concave function on $\Omega_{a}$ , it is sufficient to demonstrate that the ${(d-1)\times(d-1)}$ Hessian matrix—that is a symmetric matrix of second-order partial derivatives of $u$ —is a negative semidefinite matrix at every point in $\Omega_{a}$ (see Theorem ${4.5}$ in Rockafellar70 ). At an interior point (where all ${p_{i}>0}$ ) of $\Omega_{a}$ , the entry in the $k$ th row and $l$ th column ${(1\leq l,k\leq d-1)}$ in the Hessian matrix is

[TABLE]

where ${p_{d}=1-\sum\nolimits_{i=1}^{d-1}p_{i}}$ and $\delta_{lk}$ is the Kronecker delta function. These entries indeed provide a negative definite matrix, thus ${u(\vec{p}\,)}$ is strictly concave in the interior of $\Omega_{a}$ . At a boundary point (where one or more ${p_{i}=0}$ ), all the partial derivatives in a certain row(s) and column(s) of the Hessian matrix become zero, thus the matrix turns out to be a negative semidefinite and ${u(\vec{p}\,)}$ to be a concave function. By the way, ${u(\vec{p}\,)}$ can be employed for the entanglement detection (see Remark 2 in Sehrawat16 ).

If the state vector ${|\psi\rangle}$ is an equal superposition of all the kets in $\mathcal{B}_{a}$ or the state is completely mixed, then all the outcomes $a_{i}$ will be equally probable: ${p_{i}=\tfrac{1}{d}}$ for every ${1\leq i\leq d}$ is the center of $\Omega_{a}$ , where ${u(\vec{p}\,)}$ reaches its maximum value $\sqrt{d}$ . Whereas, only in the case of a definite outcome—that is when ${|\psi\rangle\langle\psi|=|a_{i}\rangle\langle a_{i}|}$ , and then ${p_{i}=1}$ for a particular $i$ —we have the minimum uncertainty ${u(\vec{p}\,)=1}$ as it should be. Note that ${p_{i}=1}$ characterizes an extreme point of $\Omega_{a}$ .

To establish a measure of combined uncertainty for the experimental proposal (2), we take the same function,

[TABLE]

for the $b$ -setting. Like ${u(\vec{p}\,)}$ of (35), ${u(\vec{q}\,)}$ is a concave function on $\Omega_{b}$ with the range ${[1,\sqrt{d}\,]}$ . Now we define our combined uncertainty measure

[TABLE]

on the convex set $\bm{\omega}$ , rather than $\mathbf{\Omega}$ . Sum of two concave functions is concave, so ${\mathfrak{u}}$ is also a concave function.

A mixed quantum state is a convex combination of pure states, the probabilities

[TABLE]

are linear functions of the state $\varrho$ ( ${0\leq\varrho}$ , ${\text{tr}(\varrho)=1}$ ), and $\bm{\omega}$ is a compact and convex set. As a result, every $(\vec{p},\vec{q}\,)$ associated with any (pure or mixed) quantum state lies in $\bm{\omega}$ . And, because ${\mathfrak{u}}$ is a concave function on $\bm{\omega}$ , our UR given in (40) applies to every state for a qudit. This is also true in the case of other (un)certainty relations presented in Sec. IV, because mostly there also we have either a concave or a convex function. In (93) and (94), the functions are neither concave nor convex on $\bm{\omega}$ , but the relations are followed by every qubit’s state. By the way, one can check that if ${\varrho=|\psi\rangle\langle\psi|}$ then the Born rule (39) reduces to (3).

The range of ${\mathfrak{u}(\vec{p},\vec{q}\,)}$ and our UR are presented as

[TABLE]

is the global minimum that will occur at the $m$ -parametric curves [given in Sec. II]. Whereas, $\mathfrak{u}$ gains its absolute maximum ${2\sqrt{d}}$ only at the point identified by ${p_{i}=\tfrac{1}{d}=q_{j}}$ for all ${1\leq i,j\leq d}$ . It is called the center of $\bm{\omega}$ , which represents the uniform distribution for both the settings. Now recall from Sec. II that an extreme point of $\mathbf{\Omega}$ , describe by ${p_{i}=1=q_{j}}$ , belongs to $\bm{\omega}$ if and only if ${|a_{i}\rangle\langle a_{i}|=|b_{j}\rangle\langle b_{j}|}$ . Only in such a situation—that does not necessarily require both the bases $\mathcal{B}_{a}$ and $\mathcal{B}_{b}$ to be the same in any way—we have the trivial lower bound ${\mathfrak{c}=2}$ and thus the UR ${2\leq\mathfrak{u}}$ . A similar statement is made by Deutsch in Deutsch83 . For ${d=2}$ , the trivial case is possible if and only if the two measurement settings are (physically) the same. A nontrivial lower bound ${\mathfrak{c}>2}$ materializes when the settings are completely different, that is when ${r_{ij}<1}$ for every ${1\leq i,j\leq d}$ . So the following analysis is obviously for the nontrivial cases.

To find the lower bound (41) and to establish the UR ${\mathfrak{c}\leq\mathfrak{u}}$ , we write the functional form

[TABLE]

which ${\mathfrak{u}(\vec{p},\vec{q}\,)}$ of (38) acquires on an $m$ -parametric curve specified by (16)–(21). To show that ${\mathfrak{u}}$ of (42) is a concave function of $\beta_{1}$ , we present

[TABLE]

With these derivatives, one can clearly see ${\frac{\partial^{2}\,\mathfrak{u}}{{\partial\beta_{1}}^{2}}<0}$ for ${1<m\leq d-1}$ . Whereas, for ${m=1}$ , one can directly realize ${\frac{\partial^{2}\,\mathfrak{u}}{{\partial\beta_{1}}^{2}}=-\mathfrak{u}<0}$ . This proves that $\mathfrak{u}$ is a (strictly) concave function on every parametric curve. Therefore, its global minimum $\mathfrak{c}$ will always be at the endpoints of the curves. Endpoints of an $m$ -parametric curve are identified by the two limits on a parameter [see (22)–(24) as well as (30)–(32)].

It is manifested in Appendix D.4 that, to compute a limit, we always have to solve an equation such as (214); which carries m number of angles from a column or a row of $\varTheta$ [given in (11)]. Note that we use small letter ‘ $m$ ’ ${(1\leq m\leq d-1)}$ when we construct a parametric curve with an $m$ -set [see Sec. II] and use capital letter ‘m’ ${(2\leq\textsc{m}\leq d)}$ when we compute a limit with an m-set. Essentially, one needs to follow a three-step procedure to compute a limit and then the value of $\mathfrak{u}$ [defined in (38), see also (42)] at the corresponding endpoint of a curve:

[TABLE]

The equation in Step 2 is like Eq. (214) that is solved in Appendix D.4, and every time we take the solution (221) with + sign. One can observe that $\chi$ and therefore $c_{\textsc{m}}$ are solely determined by the m-set picked in Step 1.

After repeating the three-step procedure for every m-set and for each ${2\leq\textsc{m}\leq d}$ , we collect a set of values ${\{c_{\textsc{m}}\}}$ for all the endpoints. Then, the smallest value in this set will be $\mathfrak{c}$ [defined by (41)], and thus we own our UR $\mathfrak{c}\leq\mathfrak{u}$ [presented in (40)]. Since every $c_{\textsc{m}}$ is determined by the entries in $\varTheta$ -matrix, the lower bound $\mathfrak{c}$ —depends only on the measurement bases in (1)—is independent of a quantum state. Besides, to compute $\mathfrak{c}$ , we can employ an ordinary computer, which repeats the three steps of (46) by taking

[TABLE]

number of m-sets one by one. In fact, ${2d\,[\,2^{d}-(d+1)]}$ is the total number of endpoints for a qudit.

Although we have the solution (221) for Step 2, it is easy to calculate $\chi$ and $c_{\textsc{m}}$ for ${\textsc{m}=2,d}$ . For a 2-set ${\{\theta_{1},\theta_{2}\}}$ , one can directly realize

[TABLE]

Every endpoint of a ${m=1}$ parametric curve is determined by a set of ${\textsc{m}=2}$ angles [see (22), (23), (30), and (31)]. For a $d$ -set ${\{\theta_{1},\cdots,\theta_{d}\}}$ , that is an entire column or row of $\varTheta$ , we have the total probability ${{\textstyle\sum\nolimits_{l=1}^{d}{\cos\theta_{l}}^{2}}=1}$ . Therefore, we obtain the solution

[TABLE]

For general measurement settings, it is—easy to compute but—difficult to express $\mathfrak{c}$ in an analytic form. Nevertheless, we present it for ${d=2,3,}$ and when the measurement bases in (1) are MUBs Durt10 .

In the case of a qubit, ${d=2}$ , a (un)certainty relation can be stated with the three probabilities $p_{1}$ , $q_{1}$ , and $r_{11}$ , hence we drop their subscripts here and in the next section. Furthermore, all the TIs (13) can now be put together as

[TABLE]

where $\alpha$ , $\beta$ , and $\theta$ are associated with $p$ , $q$ , and $r$ , respectively [through (3), (4), (9), and (10)]. Here only ${m=1}$ parametric curves exist, which are four in total [see with (II)]. To draw an endpoint of a curve, we can use either (48) or (51); both are equal (because ${\theta_{1}+\theta_{2}=\tfrac{\pi}{2}}$ ). There are only four [see (47)] endpoints ${E_{1},\cdots,E_{4}}$ . Next, one can realize that (49) and (52) are also the same for a qubit. Furthermore, $c_{d}$ is even identical for every ${\textsc{m}=2}$ set. It implies that our combined uncertainty function (38) takes the same value at all the four endpoints, thus $\mathfrak{c}=c_{d}=c_{2}$ and

[TABLE]

is an UR for ${d=2}$ . It is also given in Rastegin12 .

Together all the parametric curves—that represent all the extreme points of the combined-probability space $\bm{\omega}$ —can be expressed by an ellipse

[TABLE]

in the case of a qubit. As a special case, the same ellipse also appears in Lenard72 ; Larsen90 ; Kaniewski14 through different routes diff-routes , although our approach is closer to Lenard72 . One can observe that the ellipse turns into a circle for ${\theta=\tfrac{\pi}{4}}$ and into certain line segments for ${\theta=0,\tfrac{\pi}{2}}$ . In Fig. 1, we present a contour plot of ${\mathfrak{u}(p,q)}$ on $\bm{\omega}$ by taking ${r=\tfrac{3}{4}}$ . So ${\theta=\tfrac{\pi}{6}}$ , and one can see that $\bm{\omega}$ is bounded by the ellipse (55). Furthermore, by putting ${\vartheta=0,\theta,\tfrac{\pi}{2},\tfrac{\pi}{2}+\theta}$ in ${(p{\scriptstyle(\vartheta)},q{\scriptstyle(\vartheta)})}$ , we can have the four endpoints ${E_{1},\cdots,E_{4}}$ , respectively.

In the case of ${d=2}$ , there always exist a quantum state for each point in $\bm{\omega}$ , thus ${\bm{\omega}=\bm{\omega}_{\textsc{q}}}$ . For instance, the kets such as (108) and (109) correspond to points on the ellipse (55) by the Born rule (3). In particular, the kets of basis $\mathcal{B}_{a}$ correspond to the points ${\{E_{2},E_{4}\}}$ , and the kets of $\mathcal{B}_{b}$ are related with ${\{E_{1},E_{3}\}}$ . So the lower bound $\mathfrak{c}(r)$ in the UR (54) is achieved—hence, it is a tight UR—only by those state vectors $|\psi\rangle$ that (up to a phase factor) belong to one of the bases in (1). The lower bound will be the largest ${\sqrt{2}+1}$ when, ${r=\tfrac{1}{2}}$ , the measurement bases are MUBs [see also (58)].

An UR is called tight if there exists a quantum state that saturates the UR. In the case of a qubit, all the relations mentioned in this and the next section are tight because ${\bm{\omega}=\bm{\omega}_{\textsc{q}}}$ . For ${d\geq 3}$ , ${\bm{\omega}_{\textsc{q}}\subseteq\bm{\omega}}$ , hence our UR ${\mathfrak{c}\leq\mathfrak{u}}$ is not tight in general.

In the case of ${d=3}$ (qutrit), there are only two kinds of parametric curves (for ${m=1,2}$ ), and two types of endpoints (for ${\textsc{m}=2,3}$ ). So (48) and (51) can specify any endpoint for a qutrit. To compute the lower bound $\mathfrak{c}$ , we have to evaluate the function $c_{2}$ of (49) for every 2-set and $c_{d}$ of (52) every $d$ -set drawn from the $\varTheta$ -matrix. For ${d=3}$ , there are 18 2-sets and 6 $d$ -sets [see the total in (47)]. Then, the smallest out of the ${18+6=24}$ values will be our $\mathfrak{c}$ . Now let us consider a pair of MUBs Durt10 for a finite dimension $d$ .

If the two bases given in (1) are such that ${r_{ij}=\tfrac{1}{d}}$ for every ${1\leq i,j\leq d}$ [for $r_{ij}$ , see (9)], then they are called MUBs and the measurement settings $a$ and $b$ are designated as complementary Kraus87 . In the case of MUBs, ${\theta_{ij}=\arccos\tfrac{1}{\sqrt{d}}}$ for every $i,j$ , so one can straightforward realize

[TABLE]

in Step 2 and 3 of the three-step procedure (46). One can acknowledge that here $\chi$ and $c_{\textsc{m}}$ depend on ${\textsc{m}=2,\cdots,d}$ , not on a particular m-set, because every $\theta$ is the same. Furthermore, $\chi$ decreases, whereas $c_{\textsc{m}}$ increases, with m. Hence the lower bound is

[TABLE]

which does not deliver a tight UR when ${d>2}$ , whereas tight URs Kraus87 ; Maassen88 ; Ballester07 are known for MUBs in a finite $d$ . We close this section with the following remarks.

Remark 1: By the Born rule (3), ${|\psi\rangle=|a_{i}\rangle}$ provides an extreme point, given by ${p_{i}=1}$ and ${\vec{q}=(r_{i1},\cdots,r_{id})}$ , of $\bm{\omega}$ [see (174) and (173) in Appendix D.3]. At this point the combined uncertainty function (38) has the value ${1+\textstyle\sum\nolimits_{j=1}^{d}\sqrt{r_{ij}}\,}$ [see also (52)]. Likewise, ${|\psi\rangle=|b_{j}\rangle}$ gives the combined uncertainty ${1+\textstyle\sum\nolimits_{i=1}^{d}\sqrt{r_{ij}}\,}$ . Now we take the minimum value

[TABLE]

Next, one can easily establish

[TABLE]

The first inequality in (62) comes from (40). The last inequality is due to ${\textstyle\sum\nolimits_{i=1}^{d}\sqrt{r_{ij}}\leq\sqrt{d}}$ and the similar relation where the summation is over index $j$ instead of $i$ . $\mathfrak{c}_{\textsc{q}}$ is the largest lower bound that defines the tight UR ${\mathfrak{c}_{\textsc{q}}\leq\mathfrak{u}(\vec{p},\vec{q}\,)}$ . For $d=2$ , our lower bound ${\mathfrak{c}=\mathfrak{c}_{\textsc{q}}=\mathfrak{c}_{\text{bases}}}$ , and the UR (54) is tight. Whereas, if the two bases in (1) share a ket then $\mathfrak{c}$ turns out to be the trivial bound: ${2=\mathfrak{c}=\mathfrak{c}_{\textsc{q}}=\mathfrak{c}_{\text{bases}}}$ . One can use (62) to avoid errors while calculating $\mathfrak{c}$ .

Remark 2: The function ${H_{\nicefrac{{1}}{{2}}}(\vec{p}\,)=2\log u(\vec{p}\,)}$ is the Rényi entropy Renyi61 of order $\tfrac{1}{2}$ . Using (36), one can realize that ${H_{\nicefrac{{1}}{{2}}}(\vec{p}\,)}$ is a concave function on $\Omega_{a}$ , hence the sum

[TABLE]

is concave on $\bm{\omega}$ . Taking (43)–(45), one can confirm that the sum is also concave on each of the parametric curves, therefore its absolute minimum will be on the endpoints. By repeating the three-step procedure (46)—where in the third step now we need to compute

[TABLE]

instead of $c_{\textsc{m}}$ —for every m-set, we can own an UR based on the combined entropy (64) for any pair of measurement settings. Analogues to (49), (52), and (57), here we have

[TABLE]

respectively, with these one can directly get URs for qubit, qutrit, and for a pair of MUBs just like above. For a qubit, we express the corresponding tight UR (also obtained in Rastegin12 )

[TABLE]

in terms of the product ${u(p)u(q)}$ . In this case, the product turns out not only a concave function on $\bm{\omega}$ but also on each of the four parametric curves. And, its absolute minimum—given in left-hand side of (69)—occurs at all the four endpoints ${E_{1},\cdots,E_{4}}$ , and the absolute maximum $2$ at the center [denoted by ${\star}$ in Fig. 1] of $\bm{\omega}$ .

IV Other (un)certainty measures and relations

The negative of a concave function is a convex function, hence a suitable convex function can be taken as a measure of certainty, rather than uncertainty. Here we present other popular measures of (un)certainty and obtain the associated (un)certainty relations for ${d=2}$ by finding the absolute minimum (for concave) and maximum (for convex) on the ellipse (55). We want to emphasize that all the relations given in this paper for a qubit are already known, thanks to Larsen90 ; Busch14 ; Garrett90 ; Sanchez-Ruiz98 ; Ghirardi03 ; Bosyk12 ; Vicente05 ; Zozor13 ; Deutsch83 ; Maassen88 ; Rastegin12 , through different methods. The following analysis merely shows that they all can be obtained from the TIs (53) that characterize the ellipse. Recall that one can have the same ellipse from Lenard72 ; Larsen90 ; Kaniewski14 .

One can always construct Hermitian operators, for example

[TABLE]

by assigning real numbers to the measurement outcomes $a_{i}$ and $b_{j}$ for the two settings specified by (1). Then ${\textbf{a}:=\{a_{i}\}_{i=1}^{d}}$ and ${\textbf{b}:=\{b_{j}\}_{j=1}^{d}}$ are the sets of eigenvalues of $A$ and $B$ , respectively. With (3) and (70), one can perceive that the squared standard deviations

[TABLE]

are functions of the probabilities as well as the eigenvalues.

Taking ${p_{d}=1-\textstyle\sum\nolimits_{i=1}^{d-1}p_{i}}$ , like the derivatives (36) of ${u(\vec{p}\,)}$ , we get the second-order partial derivatives

[TABLE]

of the function (71) for ${1\leq k,l\leq d-1}$ . One can validate that the Hessian matrix—made of the derivatives (73)—is a negative semidefinite matrix for any set a of eigenvalues. Thus, ${{\Delta(\textbf{a},\vec{p}\,)}^{2}}$ is a concave function on $\Omega_{a}$ (see Theorem ${4.5}$ in Rockafellar70 ). Likewise, ${{\Delta(\textbf{b},\rho)}^{2}}$ is a concave function on $\Omega_{b}$ . Hence, analogues to ${\mathfrak{u}(\vec{p},\vec{q}\,)}$ of (38), the sum

[TABLE]

establishes a concave, thus uncertainty, measure on the combined space $\bm{\omega}$ . In Maccone14 , URs are presented by taking a sum such as (74), however, here the approach is different.

In the case of a qubit ( ${d=2}$ ), every measurement setting can also be described by a three-component real vector. So, we designate the two settings [see (1)] by certain unit vectors $\widehat{a}$ and $\widehat{b}$ and then construct the Hermitian operators ${A=\widehat{a}\cdot\vec{\sigma}}$ and ${B=\widehat{b}\cdot\vec{\sigma}}$ with the dot product, where $\vec{\sigma}$ is the Pauli vector operator. One can verify that ${A^{2}=I=B^{2}}$ , therefore the eigenvalues are: ${\textbf{a}=\{\pm 1\}=\textbf{b}}$ . Suppose the kets ${|a_{1}\rangle}$ and ${|b_{1}\rangle}$ of the two bases [in (1)] are associated with the eigenvalue ${+1}$ of $A$ and $B$ , respectively. Now one can easily derive the relation

[TABLE]

between the three kinds of inner products. From Sec. III, let us recall that we only require three probabilities $p_{1}$ , $q_{1}$ , and $r_{11}$ to express a (un)certainty relation for ${d=2}$ . So, there is no further need for the subscripts. With all the above considerations, $\bm{\Delta}^{\text{sq}}$ of (74) turns out to be the function

[TABLE]

of $p$ and $q$ .

We plot ${\bm{\Delta}^{\text{sq}}}$ of (76) on ${\bm{\omega}}$ in Fig. 2 by taking ${r=\tfrac{1}{4}}$ . Since ${\bm{\Delta}^{\text{sq}}}$ is a concave function on ${\bm{\omega}}$ , its absolute minimum will be at the four parametric curves, which are jointly described by the ellipse (55) and by their endpoints ${E_{1},\cdots,E_{4}}$ . To compute the minimum, first, we need to represent ${\bm{\Delta}^{\text{sq}}}$ as a function of a parameter, like $\mathfrak{u}$ in (42), on each curve. Then, we have to find the critical points of ${\bm{\Delta}^{\text{sq}}}$ . Here we obtain four critical points ${F_{1},\cdots,F_{4}}$ —one on each curve—that are depicted by the bullets ${(\bullet)}$ in Fig. 2. By putting $\vartheta=\tfrac{\theta}{2},\tfrac{\theta}{2}+\tfrac{\pi}{4},\tfrac{\theta}{2}+\tfrac{2\pi}{4},\tfrac{\theta}{2}+\tfrac{3\pi}{4}$ in ${(p{\scriptstyle(\vartheta)},q{\scriptstyle(\vartheta)})}$ of (55), one can have ${F_{1},\cdots,F_{4}}$ , in that order. Record that the $F$ -points are not the endpoints ${E_{1},\cdots,E_{4}}$ that are only shown in Fig. 1, not in Fig. 2.

The function ${\bm{\Delta}^{\text{sq}}}$ of (76) takes the value $2r$ at both the points ${\{F_{2},F_{4}\}}$ and takes the value ${2(1-r)}$ at ${\{F_{1},F_{3}\}}$ . So the global minimum is

[TABLE]

and thus we obtain a tight UR, like (54). One can confirm that the lower bound is

[TABLE]

Remark 3: The standard deviation $\Delta\bm{(}\pm 1,p\bm{)}$ is a concave function of $p$ , hence the sum $\Delta\bm{(}\pm 1,p\bm{)}+\Delta\bm{(}\pm 1,q\bm{)}$ is a concave function on $\bm{\omega}$ . As a result, we have another tight uncertainty relation

[TABLE]

One can check that the sum reaches its absolute minimum value at all the endpoints ${E_{1},\cdots,E_{4}}$ , and has its maximum value $2$ at the center of $\bm{\omega}$ . Both the tight URs (77) and (79) are known due to Busch14 . A quantum state that saturates a tight UR is called its minimum uncertainty state. Since the $E$ -points and the $F$ -points are not the same, in general, the set—of minimum uncertainty states—is different for the two URs (77) and (79) based on the standard deviation. Note that we always get the trivial lower bound ${0\leq\Delta(\textbf{a},\vec{p}\,)\Delta(\textbf{b},\vec{q}\,)}$ for the product of standard deviations, and this bound can be reached by any ket belongs to either of the bases given in (1).

Next, the Shannon entropy Shannon48

[TABLE]

is arguably the most famous measure of uncertainty at present. It is superior than the standard deviation ${\Delta(\textbf{a},\vec{p}\,)}$ Bialynicki11 ; Coles17 because it only depends on $\vec{p}$ , not on the eigenvalues. One can show that ${H(\vec{p}\,)\in[0,\log d]}$ , and it is a concave function on $\Omega_{a}$ with the Hassian matrix composed of the second-order derivatives

[TABLE]

where ${p_{d}=1-\textstyle\sum\nolimits_{i=1}^{d-1}p_{i}}$ . Considering the same function for the $b$ -setting, that is $H(\vec{q}\,)$ , one can formulate a combined uncertainty measure by the sum ${H(\vec{p}\,)+H(\vec{q}\,)}$ and then produce an entropy UR Deutsch83 ; Kraus87 ; Maassen88 . Such URs are reviewed in Wehner10 ; Bialynicki11 ; Coles17 . For ${d=2}$ , the tight entropy UR is achieved in Garrett90 ; Ghirardi03 (see also Sanchez-Ruiz98 ), and we can directly import all their results here. In fact, Eq. (7) in Garrett90 and Eq. (2.4) in Ghirardi03 are ${H(p)+H(q)}$ on the ellipse (55), and they found the absolute minimum of ${H(p)+H(q)}$ on the ellipse. In Ghirardi03 , all the results are given in terms of angles between the real unit vectors, which are related to the angles between kets through (75).

We can choose

[TABLE]

as another (un)certainty measure, which is closely related to the Tsallis Tsallis88 and Rényi Renyi61 entropies of order $\gamma$ . One can prove that the Hassian matrix with entries

[TABLE]

${1\leq k,l\leq d-1}$ , is a negative and positive semidefinite matrix for ${0<\gamma\leq 1}$ and ${1\leq\gamma<\infty}$ , respectively. It confirms that ${u_{\gamma}(\vec{p}\,)}$ is a concave (uncertainty) and convex (certainty) measure when ${0<\gamma\leq 1}$ and ${1\leq\gamma<\infty}$ , respectively. A similar observation is made in Luis11 ; Rastegin12 . In fact, our uncertainty measure ${u(\vec{p}\,)}$ of (35) is ${u_{\gamma}(\vec{p}\,)}$ with the exponent ${\gamma=\tfrac{1}{2}}$ . Furthermore, the range of ${u_{\gamma}(\vec{p}\,)}$ is ${[1,d^{1-\gamma}]}$ if ${\gamma\leq 1}$ and is ${[d^{1-\gamma},1]}$ if ${1\leq\gamma}$ . When ${\gamma=1}$ , ${u_{\gamma}(\vec{p}\,)=1}$ for every $\vec{p}\in\Omega_{a}$ due to Eq. (5), thus ${u_{1}}$ is not a genuine (un)certainty measure.

Like before, one can establish a (un)certainty relation with the sum $u_{\gamma}(\vec{p}\,)+u_{\gamma}(\vec{q}\,)$ . For $\gamma=2$ , in the case of $d=2$ , we obtain

[TABLE]

as a tight certainty relation; which is also given in Larsen90 for ${\tfrac{1}{2}\leq r}$ . Due to (84), one can immediately derive (85) from the UR (77). Where $\bm{\Delta}^{\text{sq}}$ of (76) reaches its absolute minimum (uncertainty) on $\bm{\omega}$ , there the function (84) achieves its global maximum (certainty)

[TABLE]

The certainty measure (84) hits its absolute minimum 1 at the center of $\bm{\omega}$ [depicted by the star ${(\star)}$ in Figs. 1 and 2].

Remark 4: One can have another tight certainty relation

[TABLE]

where product of certainty measures is used. The relation (87) is presented in Larsen90 for ${\tfrac{1}{2}\leq r}$ . One can verify that ${u_{2}(p)\,u_{2}(q)}$ is a convex functions on $\bm{\omega}$ . Therefore, its absolute maximum [given in (87)] will be on the ellipse [specified by (55)], and the global minimum $\tfrac{1}{4}$ will be at the center of $\bm{\omega}$ . The product-function reaches its upper bound on the $F$ -points. By applying the negative of the logarithm on both sides of the inequality (87), we get the corresponding tight UR—achieved in Bosyk12 —in terms of the collision entropy (that is, the Rényi entropy Renyi61 of order $2$ ).

Lastly, we pick the function

[TABLE]

that defines a norm on $\mathbb{R}^{d}$ if we replace $p_{i}$ with ${|p_{i}|}$ . Since every $p_{i}$ follows (6), the modulus sign is not shown in (88). Every norm is a convex function, so ${u_{\textrm{max}}}$ can be considered as a certainty measure on $\Omega_{a}$ ; ${u_{\textrm{max}}(\vec{p}\,)\in\big{[}\tfrac{1}{d}\,,1\big{]}}$ for every ${\vec{p}\in\Omega_{a}}$ . Note that ${u_{\textrm{max}}(\vec{p}\,)}$ is not differentiable everywhere in $\Omega_{a}$ . Nevertheless, we can assemble a combined certainty measure with the sum ${u_{\textrm{max}}(\vec{p}\,)+u_{\textrm{max}}(\vec{q}\,)}$ on $\bm{\omega}$ .

In the case of ${d=2}$ , the function ${u_{\textrm{max}}(p)+u_{\textrm{max}}(q)}$ is equal to

[TABLE]

The limits on ${p,q}$ stated in (89) divide $\bm{\omega}$ —that is an elliptical region [see Figs. 1 and 2]—into four quadrants. The function ${u_{\textrm{max}}(p)+u_{\textrm{max}}(q)}$ is differentiable in each of the quadrants. Furthermore, since it is a convex function on $\bm{\omega}$ , its global maximum will be at the ellipse (55). Here we discover four critical points, one in each quadrant on the ellipse, where the combined function takes a maximum value. In fact, these four points are the same ${F_{1},\cdots,F_{4}}$ exhibited in Fig. 2.

The combined measure acquires the value ${1+\sqrt{1-r}}$ at both ${F_{2},F_{4}}$ and reaches the value $1+\sqrt{r}$ at both ${F_{1},F_{3}}$ . Thus, like (85), we get the tight certainty relation

[TABLE]

for a qubit. And, the absolute maximum (upper bound) is given by

[TABLE]

analogues to (86). Besides, ${u_{\textrm{max}}(p)+u_{\textrm{max}}(q)}$ has its global minimum 1 at the center of $\bm{\omega}$ [exhibited by the star ${(\star)}$ in Figs. 1 and 2].

The certainty relation (90) is captured in Vicente05 using the inequality

[TABLE]

Instead of TIs (53), for a qubit, all the tight relation (54), (69), (77), (79), (85), (87), (90), (93), (94), and the entropy UR given in Garrett90 ; Sanchez-Ruiz98 ; Ghirardi03 can be obtained with (92). In fact, inequality (92), that is ${\min_{ij}\theta_{ij}\leq\min_{i}\alpha_{i}+\min_{j}\beta_{j}}$ , can be produced from $d^{2}$ TIs (13), and it is weaker than the TIs: all those ${(\vec{p},\vec{q}\,)\in\mathbf{\Omega}}$ that are bounded by (92) rather than (13) constitute a bigger combined-probability space.

Remark 5: One can confirm that the product ${u_{\textrm{max}}(p)\,u_{\textrm{max}}(q)}$ is neither a concave nor a convex function on $\bm{\omega}$ (for a similar observation, see Maassen88 ), so it not clear to us whether or not we can take it as a good combined-(un)certainty measure for every qubit’s state. It also shows that product of two convex (concave) functions is not necessarily a convex (concave) function. By computing the gradient of ${u_{\textrm{max}}(p)\,u_{\textrm{max}}(q)}$ in each of the four quadrants, one can realize: the function reaches its global minimum $\tfrac{1}{4}$ at the center of $\bm{\omega}$ and reaches its global maximum (on the ellipse) at the $F$ -points. Hence, we have the tight relation

[TABLE]

which is reported in Maassen88 (and implicitly appear in Deutsch83 ). In fact, for ${d=2}$ , the ket given by Eq. (11) in Deutsch83 is the ket (108) with ${\beta=\tfrac{\theta}{2}}$ and ${\nu=0}$ , and the ket corresponds to the point $F_{1}$ . By applying the negative of the logarithm on both sides of the inequality (93), one can turn this relation in the min-entropy terms Mandayam10 . The min-entropy ${H_{\text{min}}(q):=-\log\bm{(}u_{\textrm{max}}(q)\bm{)}}$ is the smallest in the family of Rényi entropies Renyi61 , and it is neither concave nor convex function on the interval ${[0,1]}$ . Like above, using the min-entropy, one can have another tight relation

[TABLE]

that is also given in Maassen88 , recall that ${H_{\nicefrac{{1}}{{2}}}(p)=2\log\bm{(}u(p)\bm{)}}$ . The function ${H_{\nicefrac{{1}}{{2}}}(p)+H_{\text{min}}(q)}$ always takes its global minimum at the endpoints $E_{2}$ and $E_{4}$ and takes its absolute maximum ${2\log 2}$ at the center [shown in Fig. 1] of $\bm{\omega}$ . In Zozor13 , a general expression for the tight lower bound of a sum of Rényi entropies is given, which is basically the minimization of the sum on the ellipse.

V Conclusion and outlook

Taking a pure quantum state for a qudit, we present TIs (13) and then the combined-probability space $\bm{\omega}$ for a general pair of measurement settings. The combined space is a compact and convex set in $\mathbb{R}^{2d}$ , and all its extreme points are represented by the $m$ -parametric curves, ${1\leq m\leq d-1}$ . These curves are determined by the two settings ( $\varTheta$ -matrix) and are sufficient to generate the whole $\bm{\omega}$ as well as to provide a (un)certainty relation.

One can pick some suitable concave and convex functions on $\bm{\omega}$ to quantify the uncertainty and certainty, respectively. Subsequently, one can establish an uncertainty (a certainty) relation by finding the absolute minimum (maximum) of a function at the parametric curves. Due to the parametric curves, formulation of a (un)certainty relation become a single-parameter optimization problem.

Particularly for the uncertainty measures (38) and (64), the absolute minima can always be easily computed by repeating the three-step procedure given in Sec. III with every m-set, ${2\leq\textsc{m}\leq d}$ , built with entries in the $\varTheta$ -matrix. And, thus, one can enjoy the corresponding URs for any pair of measurement settings. For the other functions, one needs to find all the critical points on the curves first and then the absolute extremum at those points. That is, still, much easier than searching the extremum on the whole space. In each case, the extremum—that is a lower (upper) bound on an uncertainty (certainty) measure—only depends on the measurement settings, not on a quantum state. Every (pure or mixed) state of a qudit provides a point in $\bm{\omega}$ by the Born rule and respects every (un)certainty relation presented in this write-up.

In the case of a qubit, ${d=2}$ , we show that many known tight (un)certainty relations, owing to Larsen90 ; Busch14 ; Garrett90 ; Sanchez-Ruiz98 ; Ghirardi03 ; Bosyk12 ; Vicente05 ; Zozor13 ; Deutsch83 ; Maassen88 ; Rastegin12 , can be derived from the TIs (53). These TIs define an ellipse that represents all the parametric curves, and each point on the ellipse (and in $\bm{\omega}$ ) corresponds to a qubit’s state, thus we have tight relations. The same ellipse also emerges in Lenard72 ; Larsen90 ; Kaniewski14 as a special case. For a pair of measurement setting on a qubit, it seems that the TIs (13) and the results in Lenard72 ; Larsen90 ; Kaniewski14 ; Landau61 provide more fundamental QCs than the tight (un)certainty relations.

TIs (13) do not provide all possible QCs when the dimension ${d>2}$ , hence there are still some points in $\bm{\omega}$ that correspond to no quantum state, and our URs given in Sec. III are not tight in general. However, all our (un)certainty relations are built on the fact that ‘every point outside of $\bm{\omega}$ is, surely, not associated with any quantum state’. One can include other QCs, namely TIs (12), then the domain $\bm{\omega}$ of a (un)certainty function will be smaller. Consequently, better bounds and finer (un)certainty relations can be achieved. To get a tight bound, in the case of general settings and ${d>2}$ , is a challenging task. Tight URs are only known in some special cases: position-momentum Weyl32 , MUBs Kraus87 ; Maassen88 ; Larsen90 ; Sanchez-Ruiz95 ; Ballester07 ; Mandayam10 , and a qubit Larsen90 ; Busch14 ; Garrett90 ; Sanchez-Ruiz98 ; Ghirardi03 ; Bosyk12 ; Vicente05 ; Zozor13 ; Deutsch83 ; Maassen88 ; Rastegin12 .

URs have numerous applications in different strands of physics. Recently, these are employed for certain quantum information processing tasks such as the cryptography Mandayam10 and the entanglement detection Vicente05 ; Hofmann03 ; Guhne04 ; Giovannetti04 ; Guhne04b . As our (un)certainty relations arise solely from TIs, one can directly appoint TIs (12) as genuine QCs for such a job. Furthermore, in quantum state estimation Paris04 , one collects data by applying different measurement settings, thus realizes scheme (2) in a laboratory. Then, $\rho_{\text{est}}$ is constructed with the data. There one needs to confirm that the estimated $\rho_{\text{est}}$ represents a legitimate quantum state. Again TIs (12) could be utilized for such a test, for instance, one can firstly check whether the estimated ${(\vec{p}_{\text{est}},\vec{q}_{\text{est}})}$ follows all the TIs or not.

Acknowledgements.

I am very grateful to Arvind for stimulating discussions and helpful comments on the manuscript. I thank Arun Kumar Pati for bringing Ref. Landau61 to my attention and Jędrzej Kaniewski for explaining and making me aware about their work Kaniewski14 .

Appendix A Derivation of the triangle inequalities

Landau and Pollak obtained a single TI of the kind given in (13) for continuous-time signals. One can spot several similarities between their work Landau61 and the following derivation. In this paper, the primary QCs are the TIs (12). To derive such TIs, we consider three kets ${|\psi\rangle}$ , ${|a\rangle}$ , and ${|b\rangle}$ of a $d$ -dimensional Hilbert space $\mathscr{H}_{d}$ . Their inner products are expressed in the polar form as

[TABLE]

where the phases ${\mu,\nu,\delta\in[0,2\pi)}$ . In the main text, ${|\psi\rangle}$ is associated with a quantum state, and ${|a\rangle}$ and ${|b\rangle}$ are with the two measurement settings [see (1)]. Through the inner products, the quantum angles $\alpha$ , $\beta$ , and $\theta$ are related with the probabilities $p$ , $q$ , and $r$ [see also (3), (4), (9), and (10)], and ${\text{i}=\sqrt{-1}}$ . Recall that the angles lie in ${[0,\tfrac{\pi}{2}]}$ , and the probabilities belong to the interval ${[0,1]}$ .

It is always feasible to write one ket, say ${|\psi\rangle}$ , as a sum of its component in the linear span of other two ${\{|a\rangle,|b\rangle\}}$ and its component in the orthogonal complement of the span [see (100)]. In general, $|a\rangle$ and $|b\rangle$ are not orthogonal to each other. In the case of ${0<|\langle a|b\rangle|<1}$ , employing the Gram-Schmidt orthogonalization process, one can convert the linearly independent set ${\{|a\rangle,|b\rangle\}}$ into an orthonormal set ${\{|b\rangle,|b^{\perp}\rangle\}}$ or ${\{|a\rangle,|a^{\perp}\rangle\}}$ , where

[TABLE]

The two sets are related by a unitary transformation:

[TABLE]

Now we can resolve

[TABLE]

with a suitable ket $|x\rangle$ that follows ${\langle b|x\rangle=0=\langle b^{\perp}|x\rangle}$ . If and only if ${|\psi\rangle}$ lies in the span of ${\{|a\rangle,|b\rangle\}}$ , the last term in the expansion (100) vanishes, otherwise not. With the normalization of ${|\psi\rangle}$ , one can recognize ${|\langle b^{\perp}|\psi\rangle|^{2}+|\langle x|\psi\rangle|^{2}={\sin\beta}^{2}}$ , and subsequently

[TABLE]

Taking the transformation (99) and the polar form (97), we realize another representation of the ket

[TABLE]

from (100). With the new representation (102) and the polar form

[TABLE]

we attain

[TABLE]

Remember that ${\langle a|x\rangle=0=\langle a^{\perp}|x\rangle}$ because ${|x\rangle}$ lies in the orthogonal complement of ${\{|a\rangle,|b\rangle\}}$ . Owing to

[TABLE]

first, we obtain the left-hand side inequality in

[TABLE]

and afterwards the right-hand side inequality with the aid of (101). Eventually, from above, we have

[TABLE]

[using the polar form (95)].

If there are equalities in (105) as well as in (101), then we reach an equality—at the place of inequality—in (107): ${\xi=\nu+\delta\;(\text{mod}\,{2\pi})}$ are the solutions of equation ${\cos(\xi-(\nu+\delta))=1}$ . And, ${|\langle x|\psi\rangle|=0}$ implies that ${|\psi\rangle}$ is contained in the subspace generated by ${\{|a\rangle,|b\rangle\}}$ , thus ${|\langle b^{\perp}|\psi\rangle|=\sin\beta}$ . These two conditions turn (100) and (102) into

[TABLE]

These $|\psi\rangle$ kets—where $\delta$ is specified by the polar form (97), provided ${\langle a|b\rangle\neq 0}$ , and the global phase $\nu$ can be any real number—are the only kets that saturate the inequality (107). We can not straightforward use the above analysis for the next two cases ${|\langle a|b\rangle|=0,1}$ , hence these are studied individually.

In the case of ${\langle a|b\rangle=0}$ , ${|b^{\perp}\rangle=|a\rangle}$ and ${|a^{\perp}\rangle=|b\rangle}$ ; in fact, there is no need for the orthogonalization process, and both the representations (100) and (102) of ${|\psi\rangle}$ become the same. Furthermore, $\delta$ is not determined by the polar form (97), whereas ${\theta=\tfrac{\pi}{2}}$ . Now the inequality (107) becomes ${{\cos\alpha\,}^{2}+{\cos\beta\,}^{2}\leq 1}$ , which is—directly realized from (100) due to (101)—saturated by the ket (108) with an arbitrary real phase $\delta$ [remember ${\cos\alpha=|\langle a|\psi\rangle|}$ due to (95)].

In the case of ${|\langle a|b\rangle|=1}$ , ${\theta=0}$ and ${|b\rangle=e^{\text{i}\delta}|a\rangle}$ according to (97), and the above orthogonalization process, thus ${|b^{\perp}\rangle}$ and ${|a^{\perp}\rangle}$ , does not exist. Consequently, the term ${\langle b^{\perp}|\psi\rangle|b^{\perp}\rangle}$ will not then appear in the decomposition (100) of ${|\psi\rangle}$ . At the places of (101), (107), and (108) we have ${0\leq|\langle x|\psi\rangle|\Rightarrow{\cos\beta\,}^{2}\leq 1}$ , ${{\cos\alpha\,}^{2}={\cos\beta\,}^{2}}$ , and ${|\psi\rangle=e^{\text{i}\nu}|b\rangle}$ , respectively. In this case, there is no genuine QC, nevertheless ${{\cos\beta\,}^{2}\leq 1}$ is saturated by the ket(s) ${|\psi\rangle=e^{\text{i}\nu}|b\rangle}$ [remember ${\cos\beta=|\langle b|\psi\rangle|}$ , see (96)].

One can appreciate that inequality (107) is a legitimate QC, and $\alpha$ and $\beta$ must respect that for every ${\theta\in[0,\tfrac{\pi}{2}]}$ . Applying square root to both sides of the inequality, we gain

[TABLE]

Since ${\alpha\in[0,\tfrac{\pi}{2}]}$ and ${(\theta-\beta)\in[-\tfrac{\pi}{2},\tfrac{\pi}{2}]}$ , both ${\cos\alpha}$ and ${\cos(\theta-\beta)}$ are nonnegative numbers, hence there is no need to use the modulus on either side of the above inequality. As the $\arccos$ function is a strictly decreasing function and ${\arccos(\cos\varsigma)=|\varsigma|}$ for ${\varsigma\in[-\tfrac{\pi}{2},\tfrac{\pi}{2}]}$ , from (110), we own an equivalent form

[TABLE]

of (107). In fact, (111) carries two TIs: ${\theta\leq\alpha+\beta}$ and ${\beta\leq\alpha+\theta}$ . ${|\psi\rangle}$ of (108) with ${0\leq\beta\leq\theta}$ saturates the TI ${\theta\leq\alpha+\beta}$ and with ${\theta\leq\beta\leq\tfrac{\pi}{2}}$ saturates the other TI ${\beta\leq\alpha+\theta}$ . TIs such as ${\theta\leq\alpha+\beta}$ [see (13)] are used to define the combined-probability space $\bm{\omega}$ in Sec. II.

Replacing the ordered set $\{b,\beta,\nu\}$ by $\{a,\alpha,\mu\}$ in (100) and repeating the above analysis, one will discover

[TABLE]

at the places of (107) and (111), respectively. Jointly (111) and (113) can be written as

[TABLE]

which displays three TIs associated with the three angles. A TI says: the sum of two quantum angles must be greater than or equal to the remaining quantum angle.

In fact, the quantum angle “ ${\arccos|\langle\ |\ \rangle|}$ " is a metric (and a distinguishability measure Wootters81 ) on the set $\mathcal{S}_{\text{pure}}$ of all pure states ( ${\rho=\rho^{2}}$ ). It is because the four conditions,

${\arccos|\langle a|b\rangle|\geq 0}$ 2. 2.

$\arccos|\langle a|b\rangle|=0$ if and only if ${|a\rangle\langle a|=|b\rangle\langle b|}$ 3. 3.

${\arccos|\langle a|b\rangle|=\arccos|\langle b|a\rangle|}$ 4. 4.

${\arccos|\langle a|b\rangle|\leq\arccos|\langle a|\psi\rangle|+\arccos|\langle\psi|b\rangle|}$ ,

are satisfied for every ${|a\rangle\langle a|}$ , ${|b\rangle\langle b|}$ , and ${|\psi\rangle\langle\psi|}$ in $\mathcal{S}_{\text{pure}}$ , where ${|\langle a|b\rangle|=\sqrt{\text{tr}\bm{(}|a\rangle\langle a|\,|b\rangle\langle b|\bm{)}}}$ . Note that every pure state on $\mathscr{H}_{d}$ is made of a ket in $\mathscr{H}_{d}$ , and two kets that are equal up to a global phase provide the same pure state. As the $\arccos$ function is nonnegative, the first condition is valid. The second and third are true by the virtue of ${|\langle a|b\rangle|=1\Leftrightarrow|a\rangle\langle a|=|b\rangle\langle b|}$ and ${|\langle a|b\rangle|=|\langle b|a\rangle|}$ , respectively. The last condition is, the TI ${\theta\leq\alpha+\beta}$ , already derived above.

Returning to the TIs (114), as ${\alpha\in[0,\tfrac{\pi}{2}]}$ , $\theta+\beta$ will be a true upper bound on $\alpha$ only if it is smaller than or equal to $\tfrac{\pi}{2}$ . Hence, we can further improve (114) as

[TABLE]

Taking the right-hand side inequality and applying the cosine function—that decreases monotonically on ${[0,\pi]}$ —to both the terms, we get

[TABLE]

Now, considering the Heaviside’s unit step function

[TABLE]

one can rewrite (116) as

[TABLE]

Since the terms on either side of the above inequality are nonnegative, squaring both sides delivers

[TABLE]

Putting (107) and (119) side by side, we accomplish

[TABLE]

Furthermore, due to (95)–(97), (120) becomes

[TABLE]

In essence, we obtain QCs (115) and (121) that are equivalent to each other, one is in terms of the quantum angles and the other is in terms of the probabilities.

Appendix B Compactness and convexity of ${\bm{\omega}\subset\mathbf{\Omega}}$

The real vector space $\mathbb{R}^{2d}$ is also a metric space with the Euclidean distance, and both its subsets $\mathbf{\Omega}$ and $\bm{\omega}$ are closed as well as bounded, hence they are compact sets (thanks to the Heine-Borel theorem, see in Rudin76 ). Since a convex combination of probability vectors is again a probability vector, both $\Omega_{a}$ and $\Omega_{b}$ are convex subsets of $\mathbb{R}^{d}$ . Moreover, ${\mathbf{\Omega}=\Omega_{a}\times\Omega_{b}}$ is a convex set because it is a Cartesian product of two such sets.

To prove the convexity of $\bm{\omega}$ , we consider two combined vectors ${\big{(}\vec{p}\,^{\prime},\vec{q}\,^{\prime}\big{)}}$ and ${\big{(}\vec{p}\,^{\prime\prime},\vec{q}\,^{\prime\prime}\big{)}}$ that belong to $\bm{\omega}$ . It means that their components follow the constraints (5)–(8) and (15) that is

[TABLE]

for every ${1\leq i,j\leq d}$ . For the proof, we need to show that a convex combination

[TABLE]

fulfills all the requirements (5)–(8) and (15)—therefore, lies in $\bm{\omega}$ —for every $\lambda\in[0,1]$ . Thanks to the convexity of $\mathbf{\Omega}$ , the combination (126) belongs to $\mathbf{\Omega}$ and $\big{(}\vec{p},\vec{q}\,\big{)}$ meets all the demands (5)–(8).

Now we demonstrate that the components $p_{i}$ and $q_{j}$ of $\big{(}\vec{p},\vec{q}\,\big{)}$ respect inequality (15):

[TABLE]

We have equality (129) due to the convex combination (126), and then we acquire inequality (129) by employing (124) and (125). The next inequality (129) is attributed to the concavity of a real-valued function

[TABLE]

defined on ${[0,1]\times[0,1]}$ , and the last equality is again because of the combination (126). In conclusion, the combined-probability space $\bm{\omega}$ is a convex set in $\mathbb{R}^{2d}$ . Beside, to recognize that $f(p,q)$ is a concave function, we present the Hessian matrix

[TABLE]

that is a negative semidefinite matrix for every $p$ and $q$ in the interval $[0,1)$ . For ${p=1}$ or ${q=1}$ or both, ${f(p,q)=0}$ , and the Hessian matrix is the ${2\times 2}$ zero matrix.

Appendix C Preliminary calculations for the next appendix

With (3), (4), (9), and (10), let us again acknowledge that $\text{probability}=\text{cos\,(angle)}^{2}$ , and the quantum angles belong to the interval ${[0,\tfrac{\pi}{2}]}$ . Now we consider ${j\neq l}$ and

[TABLE]

Since the difference between angles ${\beta_{j}-\beta_{l}\in[-\tfrac{\pi}{2},\tfrac{\pi}{2}]}$ , we have ${0\leq\cos(\beta_{j}-\beta_{l})}$ . Hence, with (133), one can establish

[TABLE]

and then

[TABLE]

due to the $\arccos$ function; note that ${\arccos(\cos\varsigma)=\varsigma}$ for ${\varsigma\in[0,\pi]}$ . One can also perceive ${\tfrac{\pi}{2}\leq\beta_{j}+\beta_{l}}$ as a TI.

Next we are going to validate a result that is applied in Appendix D.

[TABLE]

Let us designate ${\theta_{ij}-\beta_{j}}$ and ${\theta_{kl}-\beta_{l}}$ by $\varphi_{ij}$ and $\varphi_{kl}$ , respectively, and write

[TABLE]

just like (133). One can show that the sum

[TABLE]

due to ${\theta_{ij}+\theta_{kl}\leq\pi}$ and (135). Clearly ${\varphi_{ij},\varphi_{kl}\leq\tfrac{\pi}{2}}$ because ${\theta,\beta\in[0,\tfrac{\pi}{2}]}$ , and if ${0\leq\varphi_{ij},\varphi_{kl}}$ [see the requirements in (136)] then we have ${0\leq\varphi_{ij}+\varphi_{kl}}$ and ${\varphi_{ij}-\varphi_{kl}\in[-\tfrac{\pi}{2},\tfrac{\pi}{2}]}$ . As a net result, ${0\leq\cos(\varphi_{ij}\pm\varphi_{kl})}$ , the last term in (137) turns out to be a nonnegative function, and thus we achieve ${1\leq{\cos\varphi_{ij}}^{2}+{\cos\varphi_{kl}}^{2}}$ . It completes a proof of (136).

[TABLE]

If ${\theta_{ij}=\tfrac{\pi}{2}=\theta_{kl}}$ and ${\beta_{j}+\beta_{l}=\tfrac{\pi}{2}}$ then evidently we have the equality of (139). Now let us prove the converse under the requirements ${0\leq\varphi_{ij},\varphi_{kl}}$ of (136). If ${{\cos\varphi_{ij}}^{2}+{\cos\varphi_{kl}}^{2}=1}$ then the last term in (137) must vanish, which occurs—provided ${0\leq\varphi_{ij},\varphi_{kl}}$ —when the sum in (138) attains its upper bound $\tfrac{\pi}{2}$ or ${\varphi_{ij}-\varphi_{kl}=\pm\tfrac{\pi}{2}}$ . The case ${\varphi_{ij}-\varphi_{kl}=\tfrac{\pi}{2}}$ arises when ${\varphi_{ij}=\tfrac{\pi}{2}}$ and ${\varphi_{kl}=0}$ , and ${\varphi_{ij}-\varphi_{kl}=-\tfrac{\pi}{2}}$ happens when ${\varphi_{ij}=0}$ and ${\varphi_{kl}=\tfrac{\pi}{2}}$ . Both these cases come under ${\varphi_{ij}+\varphi_{kl}=\tfrac{\pi}{2}}$ —that is when the sum in (138) reaches its upper bound—which materialize if and only if ${\theta_{ij}=\tfrac{\pi}{2}=\theta_{kl}}$ and ${\beta_{j}+\beta_{l}=\tfrac{\pi}{2}}$ ; it validates (139).

Similar to (135) we have

[TABLE]

and to (136) plus (139) we have

[TABLE]

Appendix D Extreme points of $\bm{\omega}$

In Appendix B, we demonstrate that the combined-probability space $\bm{\omega}$ is a compact convex set in $\mathbb{R}^{2d}$ . According to the Krein-Milman theorem (see Theorem ${3.3.5}$ and Appendix A.3 in Niculescu93 ), every point of such a set can be decomposed into a convex combination of its extreme points. In this appendix, starting from an arbitrary interior point of $\bm{\omega}$ , we move toward its extreme points.

D.1 Interior of $\bm{\omega}$

A point ${\big{(}\dot{\vec{p}},\dot{\vec{q}}\,\big{)}\in\bm{\omega}}$ that obeys each of the constraints (6), (8), and (13) with strict inequality,

[TABLE]

is called an interior point of $\bm{\omega}$ . In certain cases, such as ${d=2}$ and ${\theta\in\{0,\tfrac{\pi}{2}\}}$ , there exist—no interior point—only extreme points, then the following analysis is not needed. However, for ${d>2}$ , there is always an interior point: with ${\theta_{ij}\leq\tfrac{\pi}{2}<2\arccos\tfrac{1}{\sqrt{d}}}$ , one can show that the center—specified by ${p_{i}=\tfrac{1}{d}=q_{j}}$ for all ${i,j}$ —of $\bm{\omega}$ is an interior point when ${d>2}$ .

We begin our journey from a general but fixed interior point ${\big{(}\dot{\vec{p}},\dot{\vec{q}}\,\big{)}}$ along a straight line, which is the locus of points ${\vec{P}=\big{(}p_{1},p_{2},\dot{\vec{p}}_{\mathrm{rest}},\dot{\vec{q}}\,\big{)}\in\mathbb{R}^{2d}}$ , where $p_{1},p_{2}$ obey the linear equation

[TABLE]

and ${\dot{\vec{p}}_{\mathrm{rest}}=(\dot{p}_{3},\cdots,\dot{p}_{d})}$ . One can acknowledge that two points on this line differ from each other only in the first two coordinates, hence $p_{1},p_{2}$ are the only variables here. In (143), the inequality saturates for $d=2$ and becomes strict due to (142) when ${d>2}$ .

Since we never want to move outside of the combined space, we only consider those points on the line that lie in $\bm{\omega}$ . From Sec. II recall that a point of $\mathbb{R}^{2d}$ lies in ${\mathbf{\Omega}}$ if and only if it meets all the requirements (5)–(8), and if it also satisfies all the TIs (13) only then it belongs to ${\bm{\omega}}$ . So a point ${\vec{P}=\big{(}p_{1},p_{2},\dot{\vec{p}}_{\mathrm{rest}},\dot{\vec{q}}\,\big{)}}$ on the line, defined by (143), is contained in ${\mathbf{\Omega}}$ if and only if

[TABLE]

With (143) and (144), one can derive

[TABLE]

As per (3) and (4), we can attach angles $\alpha_{1}$ and $\alpha_{2}$ with $p_{1}$ and $p_{2}$ , correspondingly. If these angles comply with

[TABLE]

only then ${\vec{P}\in\bm{\omega}}$ . Observe that the other demands for $\vec{P}$ to be in $\bm{\omega}$ —(142) for ${3\leq i\leq d}$ and (7)—are automatically met, because $\dot{\vec{p}}_{\mathrm{rest}}$ and $\dot{\vec{q}}$ are also parts of the interior point ${\big{(}\dot{\vec{p}},\dot{\vec{q}}\,\big{)}\in\bm{\omega}}$ .

Considering the suprema

[TABLE]

we can convert all the conditions in (146) into two

[TABLE]

Throughout the paper, in the subscripts of angles, capital letters are used to highlight a supremum. A supremum, say ${\theta_{1J}-\dot{\beta}_{J}}$ , cannot be a negative number: ${\theta_{1J}-\dot{\beta}_{J}<0}$ implies ${\theta_{1j}<\dot{\beta}_{j}}$ for every $j$ by the definition (147). Which leads to ${r_{1j}>\dot{q}_{j}}$ for each $j$ by the relations (3), (4), (9), and (10), and then to the contradiction ${1=\textstyle\sum\nolimits_{j=1}^{d}r_{1j}>\sum\nolimits_{j=1}^{d}\dot{q}_{j}=1}$ . Furthermore, ${\theta_{1J}-\dot{\beta}_{J}=0}$ if and only if ${\theta_{1j}=\dot{\beta}_{j}}$ for every $j$ . So, both suprema (147) and (148) lie in ${[0,\tfrac{\pi}{2}]}$ .

Since the cosine function is monotonically decreasing and nonnegative on ${[0,\tfrac{\pi}{2}]}$ , we can translate the constraints (149) as

[TABLE]

and then as

[TABLE]

By the way, inequalities (111) and (107) impose stronger restrictions than (146), (151), and (152). Since $p_{2}$ follows $p_{1}$ with Eq. (143), all the restrictions (145), (151), and (152) can be put together as

[TABLE]

One can witness that these bounds on $p_{1}$ depend on the chosen interior point ${\big{(}\dot{\vec{p}},\dot{\vec{q}}\,\big{)}}$ . In short, only those $\vec{P}$ that fulfill the requirements (143) and (153) belong to the combined space $\bm{\omega}$ .

From the interior point ${\big{(}\dot{\vec{p}},\dot{\vec{q}}\,\big{)}}$ , we can travel on the line in two directions: where $p_{1}$ increases and where $p_{1}$ decreases. While moving we pass four points ${\vec{P}_{1},\cdots,\vec{P}_{4}}$ of $\mathbb{R}^{2d}$ that are presented in Table 1. When we proceed in the direction where $p_{1}$ increases, then we reach first either $\vec{P}_{1}$ or $\vec{P}_{2}$ . It all depends on the minimum value in (153). The point that we reach first belongs to $\bm{\omega}$ . Whereas the other point, then, fails to satisfy (153), and thus it lies outside of $\bm{\omega}$ . While moving in the other direction, where $p_{1}$ decreases, we encounter first either $\vec{P}_{3}$ or $\vec{P}_{4}$ . Depending on the maximum value in (153) one of ${\{\vec{P}_{3},\vec{P}_{4}\}}$ will be in, other will be out of, $\bm{\omega}$ (unless both these points are the same).

All the above possibilities are communicated through Table 2. For any ${\big{(}\dot{\vec{p}},\dot{\vec{q}}\,\big{)}}$ , only two of these possibilities can and will materialize, thus $\bm{\omega}$ contains only a duo of (distinct) points from Table 1. In Table 3, we present every such duo. In fact, the interior point ${\big{(}\dot{\vec{p}},\dot{\vec{q}}\,\big{)}}$ can be expressed as a convex combination

[TABLE]

of points of the one duo ${\vec{P}^{\prime},\vec{P}^{\prime\prime}}$ that lies in $\bm{\omega}$ . For each duo, ${\lambda\in(0,1)}$ is presented in Table 3.

By varying $\lambda$ from 0 to 1 in the combination (154), one can generate the line segment from $\vec{P}^{\prime\prime}$ to $\vec{P}^{\prime}$ . Recall that the line is described by (143). If ${\vec{P}^{\prime},\vec{P}^{\prime\prime}}$ belong to the combined space, then obviously the whole segment will be in $\bm{\omega}$ thanks to its convexity. The line segments connecting $\vec{P}_{1}$ with $\vec{P}_{2}$ (provided ${\vec{P}_{1}\neq\vec{P}_{2}}$ ) and connecting ${\vec{P}_{3}}$ with ${\vec{P}_{4}}$ ${(\vec{P}_{3}\neq\vec{P}_{4})}$ remain outside of $\bm{\omega}$ . Therefore, these two duos are not listed in Table 3.

In this part, it is shown that every interior point ${\big{(}\dot{\vec{p}},\dot{\vec{q}}\,\big{)}}$ in $\bm{\omega}$ can be decomposed as a convex combination of boundary points of $\bm{\omega}$ , which are decomposed in the next part. Note that the subsequent analysis is for ${d>2}$ . In the case of ${d=2}$ , ${\dot{p_{1}}+\dot{p_{2}}=1}$ , and Table 1 already carries the extreme points of $\bm{\omega}$ . In fact, for ${d=2}$ , we only need $\vec{P}_{1}$ and $\vec{P}_{4}$ , because $\bm{\omega}$ contains $\vec{P}_{2}$ and $\vec{P}_{3}$ if and only if $\vec{P}_{2}=\vec{P}_{1}$ and $\vec{P}_{3}=\vec{P}_{4}$ , respectively.

D.2 Boundary of $\bm{\omega}$

The boundary of $\bm{\omega}$ is made of ${2d+d^{2}}$ regions, where a region is characterized by equality in one of the constraints (6), (8), and (13):

[TABLE]

for ${1\leq i,j\leq d}$ . A point from Table 1, provided it is in $\bm{\omega}$ , called a boundary point because it belongs to one of the regions (155)–(157). To reveal that the boundary points of $\bm{\omega}$ can be decomposed into certain convex combinations, let us suppose that the duo $\vec{P}_{1},\vec{P}_{3}$ belongs to $\bm{\omega}$ and analyze first ${\vec{P}_{3}\in\mathbf{P}_{1}}$ and then ${\vec{P}_{1}\in\mathbf{R}_{1J}}$ . Of course, an identical treatment can be delivered in the case of other duos from Table 3.

Now we start from ${\vec{P}_{3}}$ and travel within the region $\mathbf{P}_{1}$ along a new set of points ${\vec{P}=\big{(}0,p_{2},p_{3},\dot{\vec{p}}_{\mathrm{rest}},\dot{\vec{q}}\,\big{)}}$ by changing $p_{2},p_{3}$ according to

[TABLE]

where ${\dot{\vec{p}}_{\mathrm{rest}}=(\dot{p}_{4},\cdots,\dot{p}_{d})}$ . Repeating the procedure similar to Appendix D.1, here we have

[TABLE]

which is like (153). The supremum ${\theta_{2K}-\dot{\beta}_{K}}$ is defined by (148) and

[TABLE]

If and only if $p_{2}$ respects (159) and $p_{3}$ follows $p_{2}$ with (158), then a new ${\vec{P}\in\mathbf{P}_{1}\subset\bm{\omega}}$ .

Analogous to Tables 1–3, here we compose Tables 4–6, in that order. Table 4 holds a collection of four points. Table 5 has the conditions that decide whether a point of Table 4 is in or out of $\mathbf{P}_{1}$ . Table 6 supplies all possible couples—of points from Table 4—out of which one belongs to $\mathbf{P}_{1}$ , that one is determined by $\vec{P}_{3}$ . The line segment—connecting the one couple—carries $\vec{P}_{3}$ and completely occupies in the region $\mathbf{P}_{1}$ .

Now we are going to focus on ${\vec{P}_{1}\in\mathbf{R}_{1J}}$ . Let us proceed from ${\vec{P}_{1}}$ by altering only $p_{2},p_{3}$ of another new vector ${\vec{P}=\big{(}{\cos(\theta_{1J}-\dot{\beta}_{J})}^{2},p_{2},p_{3},\dot{\vec{p}}_{\mathrm{rest}},\dot{\vec{q}}\,\big{)}}$ with respect to

[TABLE]

Note that ${\dot{\vec{p}}_{\mathrm{rest}}=(\dot{p}_{4},\cdots,\dot{p}_{d})}$ , and (161) identifies a straight line, a segment of which is contained in the region ${\mathbf{R}_{1J}}$ . In addition to (161), if $p_{2}$ agrees to

[TABLE]

only then the new vector ${\vec{P}\in\mathbf{R}_{1J}}$ . Like Tables 1 and 4, here we assemble Table 7 of four points using the four bounds in (D.2).

Due to (136) and (139) from Appendix C, we have

[TABLE]

These inequalities are strict because a requirements in (139), ${\dot{\beta}_{J}+\dot{\beta}_{K}=\tfrac{\pi}{2}}$ , cannot be met since ${\dot{q}_{J}+\dot{q}_{K}<1}$ is caused by (142). Now taking (D.2)–(D.2) with ${\textstyle\sum\nolimits_{i=1}^{3}\dot{p}_{i}\leq 1}$ , one can deduce that the vectors $\vec{P}_{11}$ and $\vec{P}_{14}$ of Table 7 can not belong to ${\mathbf{R}_{1J}}$ unless ${K=J}$ and ${L=J}$ , respectively. This fact is recorded in Table 8 with some other conditions, together they tell when a point of Table 7 will be in or out of the region ${\mathbf{R}_{1J}}$ .

A duo, out of the four listed in Table 9, resides in ${\mathbf{R}_{1J}}$ and expresses $\vec{P}_{1}$ through a convex combination. As Tables 1–3 are linked with the interior point ${\big{(}\dot{\vec{p}},\dot{\vec{q}}\,\big{)}\in\bm{\omega}}$ and Tables 4–6 are attached to ${\vec{P}_{3}\in\mathbf{P}_{1}}$ , Tables 7–9 are associated with ${\vec{P}_{1}\in\mathbf{R}_{1J}}$ . Tables 1, 4, and 7 carry the boundary points of $\bm{\omega}$ , $\mathbf{P}_{1}$ , and $\mathbf{R}_{1J}$ , respectively.

D.3 Extreme of $\bm{\omega}$

In the above parts, it is demonstrated that every interior point ${\big{(}\dot{\vec{p}},\dot{\vec{q}}\,\big{)}\in\bm{\omega}}$ can be decomposed into a convex combination of the boundary points of $\bm{\omega}$ , which can further be decomposed into convex combinations of the boundary points of regions (155)–(157). Continuing this decomposition process, we reach at a point ${\big{(}\mathring{\vec{p}}\,,\dot{\vec{q}}\,\big{)}}$ , where

[TABLE]

Since every ${\mathring{\alpha}_{i}}$ of (166) is a supremum, ${0\leq\mathring{\alpha}_{i}}$ [see the explanation below (149)] and ${\mathring{\alpha}_{i}<\dot{\alpha}_{i}<\tfrac{\pi}{2}}$ due to (142), we deduce that

[TABLE]

The point ${\big{(}\mathring{\vec{p}}\,,\dot{\vec{q}}\,\big{)}}$ , designated by (165)–(169), satisfies $m$ and ${d-(m+1)}$ number of equality constraints of type (13) and (6), respectively. If ${\mathring{p}_{s}}$ of (167) follows

[TABLE]

then ${\big{(}\mathring{\vec{p}}\,,\dot{\vec{q}}\,\big{)}\in\bm{\omega}}$ , where

[TABLE]

is a supremum like (147), (148), (160), and (166). One can check that points in Table 1 for ${d=2}$ and in Tables 4 as well as 7—provided ${K=J}$ and ${L=J}$ —for ${d=3}$ are like ${\big{(}\mathring{\vec{p}}\,,\dot{\vec{q}}\,\big{)}}$ ; remember that ${\textstyle\sum\nolimits_{i=1}^{d}\dot{p}_{i}=1}$ due to (5). Furthermore, one can easily recognize $\mathring{p}_{s}$ in each of these points. Then, one can see through Table 2, 5, and 8 that one of the two inequalities in (171) is required for a point to be in $\bm{\omega}$ . The other inequality is automatically obeyed due to (142) and the conditions appeared in the earlier decompositions.

If we start our journey from a point ${\big{(}\dot{\vec{p}}\,,\dot{\vec{q}}\,\big{)}}$ , where

[TABLE]

then we will arrive at the point ${\big{(}\mathring{\vec{p}}\,,\dot{\vec{q}}\,\big{)}}$ , where

[TABLE]

[for 0, see (168)]. This point represents an extreme point of $\bm{\omega}$ and a special case

[TABLE]

of (169) and (167). In the case (175), the supremum ${\mathring{\alpha}_{1}=\theta_{1J}-\dot{\beta}_{J}=0}$ that is possible if and only if ${\theta_{1j}=\dot{\beta}_{j}}$ , means ${r_{1j}=\dot{q}_{j}}$ , for every $j$ . Indeed, it is so [see (173)]. In all other cases, ${0<\mathring{\alpha}_{i}}$ for every ${1\leq i\leq m}$ [see the limits (170) on ${\mathring{\alpha}_{i}}$ of (166)], and ${\big{(}\mathring{\vec{p}}\,,\dot{\vec{q}}\,\big{)}}$ can be decomposed further by adopting the same procedure as before.

Without loss of generality, let us suppose ${J=1}$ for the subsequent analysis. Here we begin with ${\vec{Q}=\big{(}\mathring{\vec{p}}\,,\dot{q}_{1},q_{2},q_{3},\dot{\vec{q}}_{\mathrm{rest}}\,\big{)}}$ , where

[TABLE]

and ${\dot{\vec{q}}_{\mathrm{rest}}=(\dot{q}_{4},\cdots,\dot{q}_{d})}$ . One can acknowledge that $\vec{Q}$ represents all those points, including ${\big{(}\mathring{\vec{p}}\,,\dot{\vec{q}}\,\big{)}}$ , that fall on the straight line characterized by (176).

If $q_{3}$ stays on the line with $q_{2}$ , which follows

[TABLE]

then ${\vec{Q}\in\bm{\omega}}$ . Here

[TABLE]

are suprema, and the angles $\mathring{\alpha}$ are related to the components of $\mathring{\vec{p}}$ through (3) and (4) [see also (165) and (166)]. The constraints (177) look alike (153) and (159). Identical to Tables 1, 4, and 7, we enter a list of four points in Table 10, where the points are drawn from the four bounds on $q_{2}$ given in (177).

Now, to establish criteria for a point of Table 10 to be in or out of $\bm{\omega}$ , we are going to address the two cases

[TABLE]

individually [see Eq. (167) for $\mathring{p}_{s}$ and the range (169) of $m$ ]. Let us first take the case (181): whatever the suprema (178) and (179) are, we have

[TABLE]

To demonstrate this, we consider ${m=2}$ , the cases with ${m>2}$ can be handled likewise. For ${m=2}$ , we have ${\dot{\beta}_{1}=\theta_{i1}-\mathring{\alpha}_{i}}$ (where ${i=1,2}$ ) due to (166). If $K$ associated with the supremum (178) is 1, then by taking ${\dot{\beta}_{1}=\theta_{21}-\mathring{\alpha}_{2}}$ we can validate the strict inequality (182) thanks to (141). If ${K\neq 1}$ , we can do the same by now considering ${\dot{\beta}_{1}=\theta_{11}-\mathring{\alpha}_{1}}$ . In a similar fashion, we can establish the other inequality (183).

We draw the following inferences from inequalities (182) and (183).

[TABLE]

implies that the maximum and the minimum values in (177) are 0 and ${\dot{q}_{2}+\dot{q}_{3}}$ , respectively. Consequently, the points $\vec{Q}_{1}$ and $\vec{Q}_{4}$ of Table 10 never, whereas $\vec{Q}_{2}$ and $\vec{Q}_{3}$ always, belong to $\bm{\omega}$ in the case (181). Moreover, ${\big{(}\mathring{\vec{p}}\,,\dot{\vec{q}}\,\big{)}}$ can be broken into the convex combination ${\lambda\,\vec{Q}_{2}+(1-\lambda)\,\vec{Q}_{3}}$ , where ${\lambda=\tfrac{\dot{q}_{2}}{\dot{q}_{2}+\dot{q}_{3}}}$ [see Table 12].

Next, it is not difficult to realize that both $\vec{Q}_{2}$ and $\vec{Q}_{3}$ can be decomposed further and further until we arrive at a point $\big{(}\mathring{\vec{p}}\,,\mathring{\vec{q}}\,\big{)}$ , where

[TABLE]

In the decomposition process one will encounter inequalities, such as (182) and (183), that can be tacked like the above. For ${m>1}$ , a point $\big{(}\mathring{\vec{p}}\,,\mathring{\vec{q}}\,\big{)}$ defined by (165)–(168) and (186) is an extreme point of $\bm{\omega}$ , because it cannot be written into a convex combination of other points of $\bm{\omega}$ . Furthermore, $\big{(}\mathring{\vec{p}}\,,\mathring{\vec{q}}\,\big{)}$ is a vector-valued function of $\dot{\beta}_{1}$ since $\theta$ -angles are fixed by (10) once the measurement settings are selected in (1).

Let us now turn to the case (180), where ${\dot{\beta}_{1}=\theta_{11}-\mathring{\alpha}_{1}}$ according to (166),

[TABLE]

Since supremum (178) is a nonnegative number, $K$ can either be $s$ or 1 here. It is due to ${\theta_{i2}-\mathring{\alpha}_{i}\leq 0}$ when ${i\neq s}$ and ${i\neq 1}$ , because then ${\mathring{\alpha}_{i}=\tfrac{\pi}{2}}$ and every ${\theta\leq\tfrac{\pi}{2}}$ . Similarly, $L$ related to the supremum (179) can either be $s$ or 1 here.

When ${K=s}$ or ${L=s}$ or both, we encounter situation similar to the case (181): When ${K=s}$ then—due to (141)—we have

[TABLE]

One can perceive that (189) and (190) are analogues to (182) and (184), respectively. The inequalities in (190) suggest that ${\dot{q}_{2}+\dot{q}_{3}}$ is the minimum value in (177). Therefore, without exception $\vec{Q}_{2}$ lies in $\bm{\omega}$ , if ${\vec{Q}_{1}=\vec{Q}_{2}}$ then ${\vec{Q}_{1}\in\bm{\omega}}$ . Identically, for ${L=s}$ , always $\vec{Q}_{3}\in\bm{\omega}$ , and $\vec{Q}_{4}$ belongs to $\bm{\omega}$ only when it is $\vec{Q}_{3}$ .

When ${K=1}$ and ${L=1}$ only then ${\vec{Q}_{1}}$ and ${\vec{Q}_{4}}$ can be in $\bm{\omega}$ without being equal to $\vec{Q}_{2}$ and $\vec{Q}_{3}$ , respectively [see Table 11]. With Table 11, for the case (180), one can find out whether or not a duplet of points from Table 10 lies in $\bm{\omega}$ . All such duplets are gathered in Table 12, which reveals that the point ${\big{(}\mathring{\vec{p}}\,,\dot{\vec{q}}\,\big{)}}$ can be split into a convex combination. As before, we can break the points of Table 10 further and further until we reach extreme points of $\bm{\omega}$ .

In the case (180), the decomposition process leads to

[TABLE]

If $\mathring{q}_{t}$ of (193) obeys

[TABLE]

then the point ${\big{(}\mathring{\vec{p}}\,,\mathring{\vec{q}}\,\big{)}}$ stated by (188) and (191) belongs to $\bm{\omega}$ . It is an extreme point of $\bm{\omega}$ in the case (180). One can also realize that both there $\mathring{\vec{p}}$ and $\mathring{\vec{q}}$ are functions of $\dot{\beta}_{1}$ by noticing ${\mathring{\beta}_{j}=\theta_{1j}-\theta_{11}+\dot{\beta}_{1}}$ in (192) with ${\mathring{\alpha}_{1}=\theta_{11}-\dot{\beta}_{1}}$ . In fact, the extreme point identified by (174) and (173) in the case (175) can also be represented with these $\mathring{\vec{p}}$ and $\mathring{\vec{q}}$ of (188) and (191) by taking ${\mathring{\alpha}_{1}=0}$ , which make it as an endpoint of the parametric curve ${\big{(}\mathring{\vec{p}}{\scriptstyle(\mathring{\alpha}_{1})}\,,\mathring{\vec{q}}{\scriptstyle(\mathring{\alpha}_{1})}\,\big{)}}$ . In conclusion, we realize the structure of extreme points of $\bm{\omega}$ :

[TABLE]

D.4 Limits on $\beta_{1}$

We start with the $m$ -parametric curve ${\big{(}\vec{p}{\scriptstyle(\beta_{1})}\,,\vec{q}{\scriptstyle(\beta_{1})}\,\big{)}}$ identified by (16)–(21). According to (197), a part of the curve that lies in $\bm{\omega}$ represents its extreme points. This part is specified by the upper and lower limits of $\beta_{1}$ . To compute these limits, here, we only need to consider

[TABLE]

When ${i>m}$ and ${i\neq s}$ then ${\alpha_{i}=\tfrac{\pi}{2}}$ , and when ${j\neq 1}$ and ${j\neq t}$ , then ${\beta_{j}=\tfrac{\pi}{2}}$ . So one can easily perceive that the points ${\big{(}\vec{p}{\scriptstyle(\beta_{1})}\,,\vec{q}{\scriptstyle(\beta_{1})}\,\big{)}}$ fulfill rest of the requirements (13) as well as (5)–(8) to be in $\bm{\omega}$ .

For ${i=s}$ in (199) or ${j=t}$ in (200), the TI is always obeyed: due to

[TABLE]

With (140), (16), and (135) one can sequentially go through the steps (201)–(203), and the left-hand side inequality in (204) is a consequence of ${\theta\leq\tfrac{\pi}{2}}$ . Since $\alpha_{s}$ and $\beta_{t}$ obey ${\tfrac{\pi}{2}\leq\alpha_{s}+\beta_{t}}$ , they certainly follow the TI ${\theta_{st}\leq\alpha_{s}+\beta_{t}}$ as every ${\theta\leq\tfrac{\pi}{2}}$ .

If we decrease $\beta_{1}$ then ${\alpha_{s}+\beta_{1}}$ decreases, and $\beta_{1}$ reaches its lower limit $\beta^{\prime}$ when the inequality (200), for ${j=1}$ , gets saturated. It means that $\beta^{\prime}$ is a solution of the equation ${\theta_{s1}-\beta^{\prime}=\alpha_{s}}$ and thus of

[TABLE]

[by (16) and (19)]. If we increase $\beta_{1}$ then $p_{s}$ and ${\alpha_{i}+\beta_{t}}$ ${(i=1,\cdots,m)}$ decrease, and $\beta_{1}$ attains its upper limit $\beta^{\prime\prime}$ as soon as one of the inequalities (198) and (199) gets saturated. Using (16), (19), and ${\beta_{t}=\tfrac{\pi}{2}-\beta_{1}}$ [owing to (135)], these inequalities can be expressed as

[TABLE]

Now we need to investigate the two cases, ${m=1}$ and ${1<m\leq(d-1)}$ listed in (197), separately for $\beta^{\prime\prime}$ .

In the case ${m=1}$ , (206) clearly holds, and the upper limit

[TABLE]

is obtained when (207) is saturated. Corresponding to $\beta^{\prime\prime}$ of (208), we have

[TABLE]

which is a root of the equation

[TABLE]

In the case ${1<m\leq(d-1)}$ , when we increase $\beta_{1}$ then the inequality (206), rather than (207), gets saturated first. Hence, $\beta^{\prime\prime}$ is now a solution of

[TABLE]

One can justify these statements by proving

[TABLE]

where ${1\leq i,i^{\prime}\leq m}$ . As $\beta^{\prime\prime}$ is a root of Eq. (211), $\widetilde{\beta}$ is a root of

[TABLE]

Equations (205), (211), and (213) are of the form

[TABLE]

where m angles—the m-set ${\{\theta_{11},\cdots,\theta_{\textsc{m}1}\}}$ —are taken from the first column of $\varTheta$ matrix [given in (11)]. Always, we must choose the root of Eq. (214) that respects ${0\leq\beta_{1}\leq\theta_{i1}}$ for every ${i=1,\cdots,\textsc{m}}$ . Furthermore, as we add more angles from the first column to the m-set, the number of nonnegative terms increases on the left-hand side of Eq. (214). Then $\beta_{1}$ of smaller value will satisfy Eq. (214). So, by comparing Eqs. (211) and (213) in this way, we can certify the left-hand side inequality in (212). Whereas, after a simplification, the right-hand side inequality turns into ${\theta_{i^{\prime}1}+\theta_{it}\leq\pi}$ , which is true as every ${\theta\leq\tfrac{\pi}{2}}$ .

[TABLE]

In fact, Eq. (210)—where two angles are taken from the first column of $\varTheta$ —is also like Eq. (214). Basically, one needs to solve equation such as (214)—where ${2\leq\textsc{m}\leq d}$ angles are picked from a row or a column of $\varTheta$ —to get a limit and then an endpoint of an $m$ -parametric curve. When ${m=1}$ then m can only be 2 [see (205) and (210)]. And, when ${1<m\leq(d-1)}$ then m can either be $m$ or ${m+1}$ [see (211) and (205)].

To solve Eq. (214) for $\beta_{1}$ , we transform it into

[TABLE]

Calling ${{\cos\beta_{1}}^{2}=q_{1}}$ by the relations (3) and (4), we can write Eq. (216) as

[TABLE]

The two roots of Eq. (220) are

[TABLE]

which only depend on the m-set ${\{\theta_{11},\cdots,\theta_{\textsc{m}1}\}}$ associated with Eq. (214).

We pick the root (221) with + sign due to the following reasons. First, for ${\textsc{m}=2}$ , we have equation such as (213), and its root $\widetilde{\beta}$ —given in (212)—corresponds to the + sign solution [see also (209) with (210)]. Second, for ${\textsc{m}=d}$ , ${\beta_{1}=0}$ is the only permissible solution of Eq. (214). It is because angles $\theta_{i1}$ are not random real numbers, they follow ${\textstyle\sum\nolimits_{i=1}^{d}{\cos\theta_{i1}}^{2}=1}$ . When ${\textsc{m}=d}$ , ${\textbf{z}=d-2=-\textbf{x}}$ [see (217) and (219)], and always the solution (221) with + sign offers ${\beta_{1}=0}$ . Third reason, for a pair of MUBs Durt10 , where every ${\theta}$ is the same ${\arccos\tfrac{1}{\sqrt{d}}}$ , one can directly solve Eq. (214). For every m-set, we get the same $\beta_{1}$ [see $\chi$ in (56)], which corresponds to

[TABLE]

that is clearly the root (221) with + sign.

Bibliography48

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1(1) W. Heisenberg, Z. Phys. 43 , 172 (1927); English translation in Wheeler 83 .
2(2) J. A. Wheeler and W. H. Zurek, eds., Quantum Theory and Measurement (Princeton University Press, Princeton, New Jersey, 1983), pp. 62–84.
3(3) H. Weyl, The Theory of Groups and Quantum Mechanics , English translated by H. P. Robertson (E.P. Dutton, New York, 1932), Chapter 2, Section 7 and Appendix 1.
4(4) P. Busch, T. Heinonen, and P. Lahti, Phys. Rep. 452 , 155 (2007).
5(5) H. P. Robertson, Phys. Rev. 34 , 163 (1929).
6(6) D. Deutsch, Phys. Rev. Lett. 50 , 631 (1983).
7(7) K. Kraus, Phys. Rev. D 35 , 3070 (1987).
8(8) H. Maassen and J. B. M. Uffink, Phys. Rev. Lett. 60 , 1103 (1988).

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

A combined-probability space and (un)certainty relations for a finite-level quantum system

Abstract

I Introduction

II Quantum constraints and combined-probability space

III Uncertainty measures and relations

IV Other (un)certainty measures and relations

V Conclusion and outlook

Acknowledgements.

Appendix A Derivation of the triangle inequalities

Appendix B Compactness and convexity of ω⊂Ω{\bm{\omega}\subset\mathbf{\Omega}}ω⊂Ω

Appendix C Preliminary calculations for the next appendix

Appendix D Extreme points of ω\bm{\omega}ω

D.1 Interior of ω\bm{\omega}ω

D.2 Boundary of ω\bm{\omega}ω

D.3 Extreme of ω\bm{\omega}ω

D.4 Limits on β1\beta_{1}β1​

Appendix B Compactness and convexity of ${\bm{\omega}\subset\mathbf{\Omega}}$

Appendix D Extreme points of $\bm{\omega}$

D.1 Interior of $\bm{\omega}$

D.2 Boundary of $\bm{\omega}$

D.3 Extreme of $\bm{\omega}$

D.4 Limits on $\beta_{1}$