Multiple-copy state discrimination of noisy qubits

Kieran Flatt; Stephen M. Barnett; Sarah Croke

arXiv:1906.11212·quant-ph·October 2, 2019

Multiple-copy state discrimination of noisy qubits

Kieran Flatt, Stephen M. Barnett, Sarah Croke

PDF

TL;DR

This paper compares local and collective quantum state discrimination schemes for noisy qubits, revealing that both schemes have a shared success limit and that local schemes outperform collective ones under noise.

Contribution

It provides the first detailed analysis of success probabilities for local and collective schemes under preparation noise in quantum state discrimination.

Findings

01

Both schemes share a common success limit less than one.

02

The local scheme outperforms the collective scheme in noisy conditions.

03

Preparation noise significantly impacts the optimal discrimination strategy.

Abstract

Multiple-copy state discrimination is a fundamental task in quantum information processing. If there are two, pure, non-orthogonal states then both local and collective schemes are known to reach the Helstrom bound, the maximum probability of successful discrimination allowed by quantum theory. For mixed states, it is known that only collective schemes can perform optimally, so it might be expected that these schemes are more resilient to preparation noise. We calculate the probability of success for two schemes, one local and one collective, in the regime of imperfect preparation fidelity. We find two surprising results. Firstly, both schemes converge upon the same many-copy limit, which is less than unity. Secondly, the local scheme performs better in all cases. This highlights the point that one should take into account noise when designing state discrimination schemes.

Equations131

∣ ψ_{k} ⟩ = cos (θ) ∣0 ⟩ + (- 1)^{k} sin (θ) ∣1 ⟩ k = 0, 1.

∣ ψ_{k} ⟩ = cos (θ) ∣0 ⟩ + (- 1)^{k} sin (θ) ∣1 ⟩ k = 0, 1.

P_{1}^{H} = \frac{1}{2} (1 + 1 - 4 p_{0} p_{1} cos^{2} (2 θ)) .

P_{1}^{H} = \frac{1}{2} (1 + 1 - 4 p_{0} p_{1} cos^{2} (2 θ)) .

P_{N}^{H} = \frac{1}{2} (1 + 1 - 4 p_{0} p_{1} cos^{2 N} (2 θ)) .

P_{N}^{H} = \frac{1}{2} (1 + 1 - 4 p_{0} p_{1} cos^{2 N} (2 θ)) .

∣ \tilde{ψ}_{k}^{i} ⟩

∣ \tilde{ψ}_{k}^{i} ⟩

= cos (δ θ_{i}) ∣ ψ_{k} ⟩ - sin (δ θ_{i}) ∣ ψ_{k ⊥} ⟩ .

⟨ cos^{2} (δ θ_{i})⟩ = \int ∣ ⟨ \tilde{ψ}_{k}^{i} ∣ ψ_{k} ⟩ ∣^{2} P (δ θ_{i}) = F,

⟨ cos^{2} (δ θ_{i})⟩ = \int ∣ ⟨ \tilde{ψ}_{k}^{i} ∣ ψ_{k} ⟩ ∣^{2} P (δ θ_{i}) = F,

⟨ sin^{2} (δ θ_{i})⟩

⟨ sin^{2} (δ θ_{i})⟩

⟨ cos (2 δ θ_{i})⟩

⟨ sin (2 δ θ_{i})⟩

ρ_{k} = F ∣ ψ_{k} ⟩ ⟨ ψ_{k} ∣ + (1 - F) ∣ ψ_{k ⊥} ⟩ ⟨ ψ_{k ⊥} ∣,

ρ_{k} = F ∣ ψ_{k} ⟩ ⟨ ψ_{k} ∣ + (1 - F) ∣ ψ_{k ⊥} ⟩ ⟨ ψ_{k ⊥} ∣,

∣ ω_{0}^{n} ⟩

∣ ω_{0}^{n} ⟩

∣ ω_{1}^{n} ⟩

cos (2 ϕ_{x}) = (- 1)^{i_{N - 1}} \frac{1 - 4 p _{0} p _{1} cos ^{2 N - 2} ( 2 θ )}{1 - 4 p _{0} p _{1} cos ^{2 N} ( 2 θ )} .

cos (2 ϕ_{x}) = (- 1)^{i_{N - 1}} \frac{1 - 4 p _{0} p _{1} cos ^{2 N - 2} ( 2 θ )}{1 - 4 p _{0} p _{1} cos ^{2 N} ( 2 θ )} .

P (i_{N} ∣ x, k) = \frac{1}{2} [1

P (i_{N} ∣ x, k) = \frac{1}{2} [1

+ (- 1)^{i_{N} + k} sin (2 θ) sin (2 ϕ_{x})]

P_{N}^{a d} = x, k \sum p_{k} P (k ∣ x, k) P (x ∣ k) .

P_{N}^{a d} = x, k \sum p_{k} P (k ∣ x, k) P (x ∣ k) .

P_{N}^{a d} = \frac{1}{2} [1

P_{N}^{a d} = \frac{1}{2} [1

+ cos (2 θ) cos (2 ϕ_{x}) (- 1)^{i_{N - 1} + k} p_{k} P (x ∣ k))] .

P_{N}^{a d} = \frac{1}{2} 1 + \frac{sin ^{2} ( 2 θ )}{1 - cos ^{2 N} ( 2 θ )} x, k \sum p_{k} P (x ∣ k)

P_{N}^{a d} = \frac{1}{2} 1 + \frac{sin ^{2} ( 2 θ )}{1 - cos ^{2 N} ( 2 θ )} x, k \sum p_{k} P (x ∣ k)

+ cos^{2} (2 θ) \frac{1 - cos ^{2 N - 2} ( 2 θ )}{1 - cos ^{2 N} ( 2 θ )} x, k \sum (- 1)^{i_{N - 1} + k} p_{k} P (x ∣ k) .

P (x ∣ k) = P (i_{N - 1} \overset{x}{˙} ∣ k) = P (i_{N - 1} ∣ \overset{x}{˙}, k) P (\overset{x}{˙} ∣ k),

P (x ∣ k) = P (i_{N - 1} \overset{x}{˙} ∣ k) = P (i_{N - 1} ∣ \overset{x}{˙}, k) P (\overset{x}{˙} ∣ k),

x, k \sum (- 1)^{i_{N - 1} + k} p_{k} P (x ∣ k)

x, k \sum (- 1)^{i_{N - 1} + k} p_{k} P (x ∣ k)

= \overset{x}{˙}, k \sum (sin (2 θ) sin (2 ϕ_{\overset{x}{˙}}) p_{k} P (\overset{x}{˙} ∣ k)

+ cos (2 θ) cos (2 ϕ_{\overset{x}{˙}}) (- 1)^{i_{N - 2} + k} p_{k} P (\overset{x}{˙} ∣ k)) .

x, k \sum (- 1)^{i_{N - 1} + k} p_{k} P (x ∣ k) = 2 P_{N - 1}^{a d} - 1.

x, k \sum (- 1)^{i_{N - 1} + k} p_{k} P (x ∣ k) = 2 P_{N - 1}^{a d} - 1.

P_{N}^{a d} = \frac{1}{2} [\frac{1}{12} 1

P_{N}^{a d} = \frac{1}{2} [\frac{1}{12} 1

+ cos^{2} (2 θ) \frac{1 - cos ^{2 N - 2} ( 2 θ )}{1 - cos ^{2 N} ( 2 θ )} (2 P_{N - 1}^{a d} - 1)] .

P (i_{N} ∣ x, k)

P (i_{N} ∣ x, k)

= \frac{1}{2} [1 + (2 F - 1) (- 1)^{i_{N}} cos (2 θ) cos (2 ϕ_{x})

+ (2 F - 1) (- 1)^{i_{N} + k} sin (2 θ) sin (2 ϕ_{x})]

P_{N}^{a d}

P_{N}^{a d}

+ (2 F - 1) cos^{2} (2 θ) \frac{1 - cos ^{2 N - 2} ( 2 θ )}{1 - cos ^{2 N} ( 2 θ )} (2 P_{N - 1}^{a d} - 1)] .

P_{N}^{a d} = \frac{1}{2} (\frac{x ^{2}}{2} 1

P_{N}^{a d} = \frac{1}{2} (\frac{x ^{2}}{2} 1

+ \frac{sin ^{2} ( 2 θ )}{1 - cos ^{2 N} ( 2 θ )} S_{N}),

S_{N} = i = 1 \sum N (2 F - 1)^{N + 1 - i} (1 - (2 F - 1)^{i - 1}) cos^{2 N - 2 i} (2 θ) .

S_{N} = i = 1 \sum N (2 F - 1)^{N + 1 - i} (1 - (2 F - 1)^{i - 1}) cos^{2 N - 2 i} (2 θ) .

S_{N} =

S_{N} =

- (2 F - 1)^{N} \frac{1 - cos ^{2 N} ( 2 θ )}{1 - cos ^{2} ( 2 θ )} .

∣ ψ_{k}^{(n)} ⟩ = cos (θ_{n}) ∣0 ⟩ + (- 1)^{k} sin (θ_{n}) ∣1 ⟩, k = 0, 1

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Multiple-copy state discrimination of noisy qubits

Kieran Flatt

[email protected]

Stephen M. Barnett

Sarah Croke

School of Physics and Astronomy, University of Glasgow, Glasgow, G12 8QQ, UK

Abstract

Multiple-copy state discrimination is a fundamental task in quantum information processing. If there are two, pure, non-orthogonal states then both local and collective schemes are known to reach the Helstrom bound, the maximum probability of successful discrimination allowed by quantum theory. For mixed states, it is known that only collective schemes can perform optimally, so it might be expected that these schemes are more resilient to preparation noise. We calculate the probability of success for two schemes, one local and one collective, in the regime of imperfect preparation fidelity. We find two surprising results. Firstly, both schemes converge upon the same many-copy limit, which is less than unity. Secondly, the local scheme performs better in all cases. This highlights the point that one should take into account noise when designing state discrimination schemes.

I Introduction

In many quantum information processing tasks, one needs to identify, by measurement, the state of a system given that the finite and discrete set of states from which it is taken is known. This task is called state discrimination Barnett and Croke (2009); Bae and Kwek (2015); Bergou (2010); Barnett (2009); Chefles (2010). Unless the set of possible states is an orthogonal basis for some space they cannot be perfectly discriminated and instead the user usually seeks to minimise one of two figures of merit, either the probability of incorrectly identifying or failing to identify the state. The measurement which minimises the former of these is the Helstrom, or minimum-error, measurement Helstrom (1976). If there are two possible states, the optimal measurement has a simple analytic form. In more complex cases, such as three-or-more pure states Ha and Kwon (2013) or mixed states Weir et al. (2017); Bae and Hwang (2013), only limited results are known.

Our above comments relate to single-copy state discrimination. Given a resource of multiple systems, all prepared in the same state, it might be expected that the correlations can be used to improve the probability of success. This intuition is correct and the Helstrom bound, the optimal value of this probability, is known for discriminating two states. However, in this case a physical implementation of the measurement is typically hard to find. Furthermore, the issue of locality versus collectivity arises: can the bound be achieved with local measurements, those on individual systems, only or must the discriminator use collective measurements, which are more difficult to perform? It is known that the best measurement to discriminate multi-partite states is often a collective measurement, even for product states. Famous examples of this are the double trine ensemble Peres and Wootters (1991); Massar and Popescu (1995); Chitambar and Hsieh (2013); Croke et al. (2017) and the domino states Bennett et al. (1999); Childs et al. (2013); Croke and Barnett (2017).

For two-pure-state discrimination, it is known that a local scheme can reach the Helstrom bound Acín et al. (2005); Brody and Meister (1996); Ban et al. (1997). In other scenarios, very few analytic results have been acquired and most knowledge comes from numerical simulations Higgins et al. (2011); Slussarenko et al. (2017). Here, some counterintuitive results emerge. One example is that the distinction between local and global optimality emerges. In some cases, among local schemes, the best overall measurement involves a fixed measurement on each qubit, which succeeds locally with a suboptimal probability Higgins et al. (2011). For a small number of copies, adaptive schemes perform better than fixed schemes Higgins et al. (2009), but in the limit of large numbers of copies, this advantage disappears, even for mixed states Higgins et al. (2011, 2009); Calsamiglia et al. (2008, 2010); Hayashi (2009). Further, it is for almost pure, but strictly speaking mixed, states that the gap in performance between collective stratgies and local strategies is most pronounced in the many-copy limit. Such unexpected results signal the need for further analytical work in this area.

The work presented in this paper investigates a separate, but related question. How resilient are multiple-copy state discrimination schemes to preparation noise? No real preparation is ever perfect, but for high enough fidelity we may consider the states to be pure. Further, decoherence properties of even state-of-the-art qubits can demonstrate significant variability (for a recent example see for examples Refs. Burnett et al. (2019); Schlör et al. (2019)), resulting in a corresponding variability in the rate at which a preparation characterised as very high fidelity degrades over time. Finally, in a real-world physical communications system, instabilities in noise properties of a channel can lead to uncharacterised noise in the received states. How sensitive are schemes designed for pure states to a small amount of uncharacterized preparation noise? As the truly optimal scheme for noisy qubits will be collective, it might be expected that such schemes will be more resilient to preparation noise than the equivalent local scheme. Our approach is to compare two equivalent schemes, one local Acín et al. (2005) and one collective Blume-Kohout et al. (2013), both of which reach the Helstrom bound for discriminating two pure states. We apply each scheme, optimised for a specific pair of pure states, to the corresponding mixed states and relate the probability of success to the preparation fidelity. Our results show that, surprisingly, the local scheme consistently performs better than the collective scheme. Neither, however, approaches unit success probability as the number of copies, $N$ , grows. Rather, they approach the same fixed bound. We discuss how to use information which would otherwise be thrown away in the local adaptive scheme to improve on this bound. This recovers asymptotic behaviour which, as the number of qubits approaches infinity, tends towards perfect discrimination.

II Preliminaries

Two pure states of a qubit occupy a single great circle on the Bloch sphere. For this reason, they can be characterised in relation to each other by real numbers only and written in the form

[TABLE]

The overlap of these two states is $\langle\psi_{0}|\psi_{1}\rangle=\cos(2\theta)$ and, without loss of generality, $0\leq\theta\leq\pi/4$ . If a single system is prepared in either of these states with probabilities $p_{k}$ , the highest possible probability of successful discrimination is given by the Helstrom bound,

[TABLE]

If $\theta=\pi/4$ the two states are orthogonal. In such a case, ${\rm P}^{H}_{1}=1$ and they can be perfectly discriminated. Otherwise, this quantity is less than one. The measurement which achieves this bound is a projective measurement onto the eigenvectors of $p_{0}|\psi_{0}\rangle\langle\psi_{0}|-p_{1}|\psi_{1}\rangle\langle\psi_{1}|$ .

If instead there is a resource of $N$ copies of the state, we are seeking to distinguish $|\psi_{0}\rangle^{\otimes N}$ from $|\psi_{1}\rangle^{\otimes N}$ . As these can be considered as two single pure states on the total Hilbert space, the multiple-copy Helstrom bound is

[TABLE]

In this case, the measurement which achieves this is again a von Neumann measurement, one that projects onto the eigenstates of $p_{0}|\psi_{0}\rangle\langle\psi_{0}|^{\otimes N}-p_{1}|\psi_{1}\rangle\langle\psi_{1}|^{\otimes N}$ . To find these we must find the eigenvalues of a $2^{N}$ dimensional matrix, a task which is much simplified by the permutation symmetry in the many copy case. For pure states in particular, there are just two-dimensions that are important, and a number of optimal schemes are known, of which we consider two.

In this article we are concerned with systems in which the resource qubits are prepared imperfectly. This is represented by a parameter $\delta\theta_{i}$ which characterises the displacement of the $i$ th qubit’s state from the ideal case such that

[TABLE]

In the second line we have related the noisy form of the state to the ideal case, Eq. 1 and introduce $|\psi_{k\perp}\rangle$ to indicate the state orthogonal to $|\psi_{k}\rangle$ . The fidelity $F$ is the standard way to parameterise the noise on a system. It can be understood operationally as the probability that a measurement of the prepared state will identify it as the ideal state Barnett (2009). For pure states, it is defined as the overlap of the prepared and ideal states, averaged over the noise’s probability distribution which we assume is symmetric, i.e., ${\rm P}(\delta\theta_{i})={\rm P}(-\delta\theta_{i})$ . One can consider this a Gaussian distribution however that level of detail is not required in what follows. The two noise parameters are then related by

[TABLE]

where $1/2\leq F\leq 1$ . From this we also have

[TABLE]

The first two of these follows from the definition of the fidelity while the third uses the symmetry of the probability distribution. These are the only functions which are averaged in what follows. We assume that the noise on each qubit is independent of the others and average at each stage.

Using these results we express the noisy form of the state, Eq. II, as a mixed state. We obtain

[TABLE]

where we have averaged over the probabilty distribution of $\delta\theta_{i}$ . If $F=1$ it is the relevant pure state. If instead $F=1/2$ , which is the smallest possible value of the fidelity, it is a maximally mixed state, so that maximum noise erases all information about the state. For other values of $F$ , the state varies monotically between these two points. Our interest throughout this paper will be in systems which are close to perfect fidelity.

III Local-adaptive measurement

An important result in multiple-copy state discrimination is that it is possible to reach the Helstrom bound, Eq. 3, using local measurements only. We follow here the scheme of Acín et al Acín et al. (2005) but similar results have been found by others Brody and Meister (1996); Ban et al. (1997). They examine a local and adaptive scheme in which the measurement of the $n$ th copy can depend upon the outcome of measurements on the previous $(n-1)$ copies. We first need to introduce some notation. The sequence of measurement outcomes is represented by a bit string $x$ as long as the number $N$ of qubits, with the $n$ th result labelled $i_{n}$ . The measurement onto the $n$ th qubit is a projector onto the basis

[TABLE]

Here, we use $x$ for the bit string of the first $(n-1)$ results and adopt a different notation when it is required. In the local-adaptive measurement scheme, the measurement at each point depends on the previous outcomes in the scheme however the overall result is determined by the final measurement outcome alone. The optimal scheme of this kind turns out to be Bayesian updating Acín et al. (2005). On the first qubit, one projects onto the eigenvectors of $p_{0}|\psi_{0}\rangle\langle\psi_{0}|-p_{1}|\psi_{1}\rangle\langle\psi_{1}|$ . On the rest, the relevant eigenbasis is instead ${\rm P}(0|x)|\psi_{0}\rangle\langle\psi_{0}|-{\rm P}(1|x)|\psi_{1}\rangle\langle\psi_{1}|$ , in which ${\rm P}(k|x)$ is the probability, calculated from Bayes’ theorem, that the state $|\psi_{k}\rangle$ was prepared given that bit string $x$ is the measurement record. The $\phi_{x}$ for which this measurement satisifies the Helstrom bound is found to be

[TABLE]

The only appearance of the bit string $x$ here is in the single index $i_{N-1}$ , which is the value of the prior measurement. Thus the scheme does not use the entire measurement and is in this sense Markovian as well as Bayesian.

Here we apply the local-adaptive scheme, in the form optimised for pure states, to the mixed states relevant to imperfect preparation. A true Bayesian scheme, one that uses the entire measurement record, would be the best way to generalise the scheme to mixed states. We return to this point later. For now, we are interested in a direct comparison of the pure state schemes and so proceed with the Markovian form.

We begin by showing that this scheme reaches the Helstrom bound in the case of perfect preparation. We use a different approach to that in Ref. Acín et al. (2005) as it does not generalise straightforwardly to include noise. This calculation gives a form for the success probability with $N$ qubits in terms of that for $(N-1)$ qubits, an inductive formula which is solved by the Helstrom bound. We then modify the calculation to include noise. This leads to a different inductive formula, which is then solved to give the overall success probability. In thse calculations, we make repeated use of the result

[TABLE]

for the probability that the $N$ th outcome is $i_{N}$ given that the state $|\psi_{k}\rangle$ was sent and that the initial $(N-1)$ results were $x$ . This is calculated using Eqs. 1 and III.

In the local-adaptive scheme, the identification of the prepared state is made with the final outcome. For this reason, the probability of success is

[TABLE]

This is a sum over both signal states $k=0,1$ and over all bit strings $x$ of length $(N-1)$ , none of which contribute directly to the state identification. We first substitute Eq. 12, with $i_{N}=k$ , into this result to give

[TABLE]

Then next step is to use Eq. 11 for the optimal value of $2\phi_{x}$ in this equation:

[TABLE]

The first sum in this expression is straightforward to evalulate. It is simply a sum over a complete set of possible scenarios and we have $\sum_{x,k}p_{k}{\rm P}(x|k)=1$ . The other series is a little more complicated. We use the usual rules of conditional probability to write

[TABLE]

where we introduce the notation $\dot{x}$ for the bit string of the first $(N-2)$ results. We use also Eq. 12, with $x$ replaced by $\dot{x}$ and $i_{N}$ replaced by $i_{N-1}$ , for the probabilities ${\rm P}(i_{N-1}|\dot{x},k)$ in this equation. Bringing together all of these results, a short calculation reveals

[TABLE]

This should be compared with Eq. 14, in which the same expression occurs but over the final rather than penultimate outcome. This can be used to write the expression as

[TABLE]

After subsituting this into Eq. 15, we are left with the inductive expression

[TABLE]

The general solution to this equation is the multiple-copy Helstrom bound, Eq. 3, which can be verified by direct substitution. The $N=1$ case corresponds to single-copy state discrimination and that bound is derived in the usual manner. That the probability expression has this inductive form follows as the measurement strategy is Markovian. We have followed others in showing that the Helstrom bound can be reached with local measurements only Acín et al. (2005); Brody and Meister (1996); Ban et al. (1997). Our main result in this section is a generalisation of this expression to the regime of imperfect preparation fidelity.

The calculation proceeds in the same manner as that without noise. The difference is in the probability of a specific result $i_{N}$ given that the state $|\psi_{k}\rangle$ was prepared, which changes when the latter is replaced with a noisy state. To take this into account, Eq. 12 is replaced by an equivalent expression calculated using Eqs. 9 and III. The new probability is

[TABLE]

so that the only change in the noisy case is the appearance of the factor $(2F-1)$ here. We use this to derive, in exactly the same manner as in the perfect-fidelity case, the probability of success. The result of this, as might be expected based on the change to the individual probabilities, is simply

[TABLE]

This relation is hardly more complicated than the noisless case, Eq. 19, but its solution is much more complicated. By recursive application of this formula using the $N=1$ case, which can be evaluated analytically, we establish that the solution is

[TABLE]

where we introduce the notation

[TABLE]

This solution can be verified by substitution into the inductive relationship. The series $\mathcal{S}_{N}$ can be evaluated using the usual formulae for geometric progressions. After some algebraic manipulation we find

[TABLE]

Between Eqs. 22 and III, the probability that the local-adpative scheme successfully identifies the state is defined in terms of the preparation fidelity $F$ . In the perfect-fidelity case $F=1$ , substitution shows that $\mathcal{S}_{N}=0$ , and we have that the usual Helstrom bound is achieved. If instead $F=1/2$ , the prepared state is by definition a completely mixed state for both $|\psi_{0}\rangle$ and $|\psi_{1}\rangle$ , so that the states are indistinguishable. For this value of the fidelity, the probability becomes one-half which corresponds to guessing. The other interesting limit is the behaviour of the scheme if there are many copies of the state. We look at this in a later section where we also plot the success probability.

IV Quantum data gathering

The previous measurement scheme is purely local. It produces a classical bit value for each of the resource qubits. It is known that schemes of this type are in general not able to perform optimal state discrimination when the possible states are mixed. One requires collective measurements. Here, we are interested in the ability of these schemes to function in the presence of preparation noise. As an example of one scheme which measures collectively, we consider quantum data gathering Blume-Kohout et al. (2013).

This scheme requires a quantum memory, a qubit which does not decohere between interactions. This probe is initialised in the state $|0\rangle$ . When required, we label this space $\mathcal{H}_{A}$ . The interaction with the first qubit is a SWAP gate. The remaining interactions leave the resource qubits, labelled $S_{i}$ (where $i=1,2,...,N$ ), each in the state $|0\rangle$ and, if there is no preparation noise, leave the probe in one of two states

[TABLE]

in which

[TABLE]

These two states have an overlap $\langle\psi^{(n)}_{0}|\psi^{(n)}_{1}\rangle=\cos^{n}(2\theta)$ , where $n$ is the number of qubits which the probe has interacted with until that point in the scheme. Thus, the probability of success is the Helstrom bound. The protocol works as the product state of the $N$ systems exists in a two-dimensional subspace of the overall Hilbert space. This, of course, no longer holds for mixed states, for which some information about the states will be retained in the resource qubits. The interactions between the probe and resource qubits is a unitary operation which maps this subspace onto the two dimensions of the probe’s space through the index $k$ , which is the only piece of information needed to characterise each state. The unitary operator $U_{n}$ that performs such an operation has the property $U_{n}|\psi_{k}\rangle_{S_{n}}|\psi^{(n-1)}_{k}\rangle_{A}=|0\rangle_{S_{n}}|\psi^{(n)}_{k}\rangle_{A}$ . Alone, this does not span the Hilbert space and we need to include also the state’s components which appear only if the preparation is imperfect. The choice we make is $U_{n}|\psi_{k\perp}\rangle_{S_{n}}|\psi^{(n-1)}_{k\perp}\rangle_{A}=|1\rangle_{S_{n}}|\psi^{(n)}_{k\perp}\rangle$ . The unitary operator, written in the computational basis for both qubits, is

[TABLE]

After all resource qubits have been processed, the qubit is measured with a Helstrom measurement which corresponds to distinguishing $|\psi^{(N)}_{0}\rangle$ from $|\psi^{(N)}_{1}\rangle$ . Again, the quantity we calculate is the probability that this measurement is successful if the prepared qubits are instead mixed states.

The strategy that we use to calculate this probability is to find, by representing the interactions as Kraus operators acting on $\mathcal{H}_{A}$ , the probe’s state at each stage of the protocol. These Kraus operators are derived by considering that the resource qubits are subsequently measured in the computational basis though we sum over both outcomes. This strategy gives us the possibility of considering that such a measurement, which could be used as a diagnostic for the protocol’s behaviour, does occur. The Kraus operators are calculated as $M^{(n)}_{i,k}=\langle i|_{S_{n}}U_{n}|\tilde{\psi}^{n}_{k}\rangle_{S_{n}}$ , where $i=0,1$ and we use the noisy form of the state. As the calculation involves pairs of Kraus operators, at this point we do not average over the noise. The Kraus operators are best expressed in the form

[TABLE]

One way to think about these is objects is that the outcome $M^{(n)}_{0,k}$ indicates that the protocol is running well and conversely for $M^{(n)}_{1,k}$ . This is because the former is the only outcome if the fidelity is perfect. This is seen in the Kraus representation as the action of $M^{(n)}_{0,k}$ is to map the state $|\psi^{(n-1)}_{k}\rangle$ onto $|\psi^{(n)}_{k}\rangle$ , thus preserving the information which is encoded in that basis, whereas the operator $M^{(n)}_{1,k}$ has the opposite effect: by mapping the state $|\psi^{(n-1)}_{k}\rangle$ onto $|\psi^{(n)}_{k\perp}\rangle$ it deletes all the information which has been acquired up to that point. This is the origin of the claim that a subsequent measurement of the prepared qubit can act as a diagnostic. This point is later considered in more detail.

We are now in a position to calculate the density matrix of the probe after $N$ interactions. We assume that all noise is in the state preparation and that the operations are implemented perfectly. At the first step, the sample is swapped with the probe, so that the probe is left in the state

[TABLE]

as was shown earlier (to simplify the notation, we drop the index $k$ from the density operator $\rho$ ). We evaluate the next step in full detail and the result allows us to find, by inspection, the form of the density matrix in general. We multiply by the Kraus operators and then average over $\delta\theta_{i}$ in one step here. It is a straightforward (though longwinded) calculation to find

[TABLE]

where $\sigma^{(2)}_{x}=|\psi^{(2)}_{k}\rangle\langle\psi^{(2)}_{k\perp}|+|\psi^{(2)}_{k\perp}\rangle\langle\psi^{(2)}_{k}|$ . We keep the convention of using a superscript on the Pauli matrix to indicate the basis in which it is written. This density matrix can be understood as two pieces: a trace-one, diagonal piece consisting of the first two terms and another consiting of only the $\sigma_{x}$ matrix. We can expect, based on this, that the same is true of the general density matrix, which we expect can be written

[TABLE]

This is confirmed by the following analysis, in which we evaluate $A_{N}$ and $B_{N}$ by calculating how each piece (diagonal and Pauli) is updated. We again multiply by the Kraus operators and average over $\delta\theta_{i}$ in a single step. The first result is

[TABLE]

Notice that again we find the same structure, that of a diagonal piece and a Pauli matrix. The other update is

[TABLE]

It is seen that both terms contribute in the form, written in the natural basis of the next step, that we have predicted and the density matrix will always take the form of Eq. IV. Repeated application of the above two results allow us to evaluate $A_{N}$ and $B_{N}$ , which are both written in terms of geometric progressions. We find

[TABLE]

It is straightforward to evaluate the summation to give

[TABLE]

which is then used alongside Eq. 25 to evaluate

[TABLE]

The denominators of the two fractions inside the braces could each be simplified however we leave them in this form so that it is clear that there are no convergence issues in the limit $F\rightarrow 1/2$ or $\theta\rightarrow 0$ . Between Eqs. IV, 35 and 36, we have characterised the probe’s density matrix in terms of the fidelity and state parameters only. If $F=1$ , which corresponds to the perfect fidelity case, $A_{N}=1$ and $B_{N}=0$ . This corresponds to the probe being in the pure state $|\psi^{(N)}_{k}\rangle$ , as one would expect. If instead $F=1/2$ , which corresponds to maximum infidelity, then again $B_{N}=0$ however here $A_{N}=1/2$ . This means that the probe is in a maximally mixed state so that it carries no information about the prepared state. This corroborates with the analysis of the similar cases in the local-adaptive scheme.

In the quantum data gathering routine, following the unitary interactions between the probe and all resource qubits, the probe is left in the density matrix that we have calculated. If the fidelity is perfect, this will be one of two possible states, either $|\psi^{(N)}_{0}\rangle$ or $|\psi^{(N)}_{1}\rangle$ . At this stage in the protocol, the probe is then measured with the Helstrom measurement which best distinguishes these states. This is the final piece of the calculation, which gives us the probability ${\rm P}^{qdg}_{N}$ that the scheme succeeds. Helstrom’s conditions tell us that the best measurement is a projector onto the eigenvalues of $p_{0}|\psi^{(N)}_{0}\rangle\langle\psi^{(N)}_{0}|-p_{1}|\psi^{(N)}_{1}\rangle\langle\psi^{(N)}_{1}|$ . The case $p_{0}\neq p_{1}$ is significantly more involved without adding further understanding. For this reason we restrict our attention to equiprobable preparation $p_{0}=p_{1}=1/2$ for this scheme. The relevant eigenvectors are

[TABLE]

where the subscript $\pm$ indicates an associated eigenvalue of $\lambda=\pm 1$ . It is the uppermost of these which corresponds to the correct outcome. The success probability derived from this

[TABLE]

The Helstrom bound is written in a form useful here as ${\rm P}^{H}_{N}=(1+\sin(2\theta_{N}))/2$ . We see that ${\rm P}^{qdg}_{N}$ is leading order in the Helstrom bound (once $A_{N}$ and $B_{N}$ are entered), followed by terms which are linearly and inversely proportional to that object. This structure is similar to the equivalent expression for the local-adaptive measurement scheme. Eqs. 35, 36 and 38 together define the probability of success for the quantum data gathering.

V Discussion

In Fig. 1 we plot, as a function of the number $N$ of resource qubits, the probability of failure for both local-adaptive measurements and quantum data gathering, alongside a majority voting fixed measurement scheme, for two values of the angle $\theta$ and the fidelity $F$ . For now we focus on the former two schemes. In both cases, we have used equiprobable preparation $p_{0}=p_{1}=1/2$ . Despite the range of values, some broad features emerge. We comment on the many-copy limit, in which both quantitites converge upon the same value, below. What is relevant at this point is that, in all cases, the local scheme approaches this limit with fewer qubits than the collective scheme. This improvement is small enough, in the fourth or fifth decimal place for some cases, that it is probably not experimentally significant. Nonetheless, we have shown that one property, resilience to noise, is improved by measuring locally.

The third scheme plotted in Fig. 1 is a majority voting scheme in which the Helstrom measurement is performed on each qubit and the most common outcome in the measurement record is the overall outcome. There is no simple analytic expression for the success probability but it is straightforward to find numerically Higgins et al. (2009). The Helstrom measurement is that for discriminating the two mixed states, rather than the original pure states, though this will be the same for equal priors. Thus, this measurement scheme takes into account both the whole measurement record and the noise in the preparation. In general, we find that this simple generalisation is enough to outperform the other two schemes. In particular, it is not limited by the same asymptotic behaviour as those schemes. This turns out not to be true if there is only a small amount of noise, as can be seen in the graph with $F=0.999$ and $\theta=\pi/12$ . The majority voting scheme will not reach the multiple-copy Helstrom bound in any case. As the fidelity becomes closer to one, the two previously analysed schemes will become closer to the genuine optimal scheme, hence they perform better for moderate $N$ in the high-fidelity case.

Special attention should be paid to the many-copy limit of both schemes. Interestingly, one finds the same value in both cases:

[TABLE]

In the limit $F=1$ , this equation reaches unity and so the states can be perfectly discriminated given an infinite number of copies. If instead the two states are the same $\theta=0$ , then we find a probability of one-half. This makes sense as it should be impossible to distinguish two equal states and all that can be done is to guess. These two limits are non-commuting. This occurs because the measurement schemes are ill-defined when discriminating equal states, i.e., the unitary operation for quantum data gathering would need to map two orthogonal states onto the same state, which is clearly not possible.

That both schemes reach the same many-copy limit, which in general is less than unity, is intriguing. It suggests that there is a systematic error which arises when applying a state discrimination scheme to the wrong pair of states, which cannot be overcome by increasing the number of resource qubits.

The specific form of the many-copy limit can be calculated in a different manner, by understanding the behaviour of the local-adaptive measurement scheme in such a regime. Analysing this behaviour also helps in improving intuition of that scheme. Inspection of Eq. 11 reveals that the scheme in this case can be understood as hypothesis checking. If the outcome on one qubit suggests that $|\psi_{0}\rangle$ was the prepared state, the next measurement will be onto the basis $|\psi_{0}\rangle,|\psi_{0\perp}\rangle$ , with the latter outcome associated with a preparation of $|\psi_{1}\rangle$ . This explains why the strategy cannot perfectly discriminate. When applied to mixed states, neither measurement outcome is impossible. This hypothesis-checking scheme can be used to calculate the probability of success. We assume that the strategy of hypothesis checking is used for an infinite number of qubits. We find agreement with the original calculation. Two probabilities are required. Firstly,

[TABLE]

is the probability of finding outcome $a$ , given that the previous outcome was $a$ (so that the measurement at this stage is $|\psi_{a}\rangle,|\psi_{a\perp}\rangle$ ), given that $|\psi_{a}\rangle$ was sent. We require also

[TABLE]

which is the probability of outcome $a$ (i.e., the state $|\psi_{\overline{a}\perp}\rangle$ ) given that the previous measurement gave the outcome $\overline{a}$ , the other possible state, and that $|\psi_{a}\rangle$ was sent. In terms of these objects, the probability of success on the $(N+1)$ th qubit is written in terms of the probability of success on the $N$ th qubit:

[TABLE]

This result is then used iteratively to find an expression for the probability of success after $N^{\prime}$ more measurements:

[TABLE]

Finally, as $N^{\prime}$ is increased the first term will be suppressed, and in the limit of an infinite number of copies, the probability of success at a given point makes no contribution to the overall expression. All that remains is to evaluate the geometric summation and to rearrange for

[TABLE]

the same value which was found previously. Here, it has been found with a different method to the more general case. Unfortunately, a similar method for confirming the calculation does not exist for quantum data gathering, however inspection of the unitary Eq. IV reveals similar behaviour. In the many-copy limit there, the probe states become the diagonal basis states $|+\rangle,|-\rangle$ . One example of the behaviour of the unitary in this regime is $U_{N}|\psi_{0}\rangle_{S_{N}}|+\rangle_{A}=|0\rangle_{S_{N}}|+\rangle_{A}$ , so that all the unitary has done is to delete the resource qubit’s information conditioned upon it matching what is already known.

We have considered here only preparation noise. In the quantum data gathering scheme, there will also be noise in the gates needed to implement the unitary Eq. IV. This operation takes the form of a rotation controlled upon binary addition of the register of each individual qubit, which can be implemented by two CNOT gates alongside single-qubit gates. Thus, $2N$ two-qubit gates are needed to perform quantum data gathering on a resource of $N$ qubits. We assume that the contribution to the noise from single qubit gates is negligible. Because the diamond norm Aharanov et al. (1997); Aliferis et al. (2006); Aharanov and Ben-Or (2008), the standard measure of gate noise, satisfies the triangle inequality, that there are $2N$ gates required means that the total gate noise scales linearly with $N$ . This will appear as a noisy channel acting upon the probe’s state, and decrease further the probability of success. To make further comments, we would need to understand the form of the noise in more detail Sanders et al. (2016).

Methods to improve the schemes exist in each case. For the local measurement scheme, we have neglected the entire measurement record and made a decision based only on the final measurement made. As discussed above, the local adaptive scheme converges to a measurement which, in the pure state case is in a basis along and orthogonal to the prepared state (the “fully biased” measurement Higgins et al. (2011)). Thus in the limit, we always obtain the outcome verifying the current guess, and the probability of error approaches zero exponentially. This procedure, however, is not robust to noise: as soon as there is any finite probability of error in measurement, whether due to imperfect measurement or noisy preparation, the Markovian scheme results in a non-zero probability of error, which is simply the probability of error in any given round. This can be improved by referring back to the whole measurement record. If we have measured $N$ systems, and $N-1$ gave the result $1$ , while the $N$ th gave the result [math], we could reasonably infer that an error occured in the $N$ th round, and return outcome $1$ . Majority vote on the measurement record thus may be expected to recover an exponential decay in the probability of error. Note however that, although the probability of error at any given $N$ is given by Eq. 39 in the large $N$ limit, the measurement performed at each step depends on the outcome of the previous measurement. If each measurement were the same, the measurement record would be a sequence of independent and identically distributed (i.i.d.) random variables, and we could use classical statistical techniques, specifically the classical Chernoff bound Higgins et al. (2011); Cover and Thomas (2006) to argue that the probability of error indeed decays exponentially. Indeed, this is verified by the numerically evaluated function in Fig. 1. As the measurement record is not i.i.d., this is not possible, and more sophisticated techniques are needed. However, we know that in this limit the measurement switches between the two possible fully biased measurements, and an upper bound on the probability of error is given by considering just the measurement outcomes for that measurement which occurs more often. This tells us that the probability of error decays with an error exponent given by the classical Chernoff bound for this measurement, and with the exponent modified by the fraction of the total measurement record considered. Further, previous work Higgins et al. (2011) shows that for very high fidelities, the fixed measurement giving the best error exponent in the asymptotic limit is close to fully biased. For fidelities less than around $99\%$ , the optimal fixed measurement becomes unbiased Higgins et al. (2011).

In quantum data gathering, the scheme can be modified by measuring the resource qubits at each step and acting based upon the outcome. As noted, an outcome of $|1\rangle$ indicates that all quantum information gathered up until that point has been lost. So, one way to modify the protocol is to restart whenever such an outcome occurs. Some care must be taken as only a finite number of consecutive $|0\rangle$ outcomes can occur before a bad outcome. In Fig. 1 it is seen that only small number, three or four, of interactions are required to get very close to the best-possible probability. However, numerical evaluation of the relevant probabilities reveals that even this small number is unlikely enough (while still being highly probably, $p\geq 0.97$ typically) to bring the overall probability of state discrimination below that which occurs if the qubits are not measured. This is played off against two things. Firstly, success here is heralded at the expense of increasing ambiguity in some cases, similar to unambiguous state discrimination. Secondly, if there are many resource qubits available, a small run of successes becomes likely to occur at some point. Thus, in some scenarios it may be advantageous to post-select based on the measurement outcomes. A hybrid scheme in which subsets of systems are measured collectively, followed by majority voting on the measurement output would give an improved probability of success, but still less than the local scheme. Here we chose to evaluate the performance of a scheme requiring a single qubit of memory as the quantum probe. A fully general scheme, which would achieve the optimal Helstrom measurement for arbitrary many-copy states, would require a processor of size $\log N$ Blume-Kohout et al. (2013). Our results show that how a collective measurement is implemented has a considerable effect on its robustness to noise.

VI Conclusion

We have considered the ability of two multiple-copy state discrimination schemes to perform when the state preparation is imperfect. We find two surprising results. Firstly, that for small amounts of uncharacterised noise, the optimal local adaptive measurement is more robust than the simple, single qubit collective scheme. With a simple modification of the scheme to take into account the whole measurement record, we obtain a protocol which performs better than fixed, unbiased local measurements and majority vote for small $N$ , and retains the desirable exponential decay of error probability in the limit of large $N$ , without requiring prior knowledge of the amount of noise. We also find that both schemes have the same many-copy limit, which is less than unity. Despite the different physical mechanisms used in each scheme, they have precisely the same behaviour in this regime. This suggests that the quantity found is a generic property of applying an incorrect scheme, and should be investigated further.

It would be useful to know an optimal state discrimination scheme for mixed states of the type considered here. A natural starting point would be to generalise the local-adaptive scheme to use the entire measurement record when updating the prior probabilities in a Bayesian manner and to calculate the range, if any, in which this strategy is optimal. In general, more analytic work is required in multiple-copy state discrimination. Some of the techniques used here may be found to be useful in that task.

Acknowledgements.

This work was supported by the UK Engineering and Physical Sciences Research Council and by the Royal Society (RP150122).

Bibliography33

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1Barnett and Croke (2009) S. M. Barnett and S. Croke, Adv. Opt. Photon. 1 , 238 (2009).
2Bae and Kwek (2015) J. Bae and L.-C. Kwek, J. Phys. A: Math. Theor 48 , 083001 (2015).
3Bergou (2010) J. Bergou, Mod. Opt. 57 , 160 (2010).
4Barnett (2009) S. M. Barnett, Quantum Information (Oxford University Press, Oxford, 2009).
5Chefles (2010) A. Chefles, Contemp. Phys. 41 , 401 (2010).
6Helstrom (1976) C. W. Helstrom, Quantum detection and estimation theory (Academic, 1976).
7Ha and Kwon (2013) D. Ha and Y. Kwon, Phys. Rev. A 87 , 062302 (2013).
8Weir et al. (2017) G. Weir, S. M. Barnett, and S. Croke, Phys. Rev. A 96 , 022312 (2017).