Quantum-enhanced learning of rotations about an unknown direction
Yin Mo, Giulio Chiribella

TL;DR
This paper introduces a quantum machine learning approach for rotating a quantum bit about an unknown axis, demonstrating quantum memory advantages over classical methods for finite spin values.
Contribution
It shows that quantum memory of logarithmic size outperforms any classical memory in learning rotations about an unknown direction.
Findings
Quantum memory of O(log j) qubits surpasses classical memory performance.
Quantum advantage persists for all finite j and limited access times.
Provides a benchmark for experimental demonstration of quantum learning.
Abstract
We design machines that learn how to rotate a quantum bit about an initially unknown direction, encoded in the state of a spin-j particle. We show that a machine equipped with a quantum memory of O(log j) qubits can outperform all machines with purely classical memory, even if the size of their memory is arbitrarily large. The advantage is present for every finite j and persists as long as the quantum memory is accessed for no more than O(j) times. We establish these results by deriving the ultimate performance achievable with purely classical memories, thus providing a benchmark that can be used to experimentally demonstrate the implementation of quantum-enhanced learning.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Quantum-enhanced learning of rotations about an unknown direction
Yin Mo
Department of Computer Science, The University of Hong Kong, Pokfulam Road, Hong Kong
Giulio Chiribella
Department of Computer Science, The University of Hong Kong, Pokfulam Road, Hong Kong
Department of Computer Science, University of Oxford, Wolfson Building, Parks Road, Oxford, UK
Abstract
We design machines that learn how to rotate a quantum bit about an initially unknown direction, encoded in the state of a spin- particle. We show that a machine equipped with a quantum memory of qubits can outperform all machines with purely classical memory, even if the size of their memory is arbitrarily large. The advantage is present for every finite and persists as long as the quantum memory is accessed for no more than times. We establish these results by deriving the ultimate performance achievable with purely classical memories, thus providing a benchmark that can be used to experimentally demonstrate the implementation of quantum-enhanced learning.
††preprint: APS/123-QED
I Introduction
Quantum machine learning Biamonte et al. (2017); Dunjko and Briegel (2018) explores the interface between machine learning and quantum information science. On the one hand, quantum algorithms have been shown to offer speedups to a variety of classical machine learning tasks Aïmeur et al. (2006); Harrow et al. (2009); Rebentrost et al. (2014); Rønnow et al. (2014); Wiebe et al. (2014); Dunjko et al. (2016); Amin et al. (2018). On the other hand, ideas from machine learning stimulated the formulation of new quantum tasks, such as quantum state classification Sasaki et al. (2001); Sasaki and Carlini (2002); Guţă and Kotłowski (2010); Sentís et al. (2012), quantum learning of gates Bisio et al. (2010); Marvian and Lloyd (2016); Sedlák et al. (2019) and measurements Bisio et al. (2011); Sentís et al. (2012).
An important component of any learning machine is its internal memory, wherein information gathered from the environment is stored. A machine equipped with purely classical memory can only gather information through measurements, and can only perform conditional operations controlled by classical data. In contrast, a machine equipped with a quantum memory can gather information by interacting coherently with its environment, and can perform operations that are controlled by quantum data. A fundamental question is whether the additional freedom offered by the quantum memory can enhance the learning performance.
A task where quantum memories are known to enhance the performance is quantum cloning Scarani et al. (2005), which can be rephrased as the task of learning how to prepare copies of an unknown quantum state by gathering sample copies of it. In this task, a machine with a quantum memory can achieve strictly higher accuracy than all machines with a purely classical memory Bužek and Hillery (1996); Gisin and Massar (1997); Werner (1998).
A strikingly different situation occurs in the task of learning how to perform unitary gates. In this case, quantum memories can enhance the performance of probabilistic learning machines Nielsen and Chuang (1997); Vidal et al. (2002); Hillery et al. (2002); Vidal et al. (2002); Brazier et al. (2005); Ishizaka and Hiroshima (2008); Bartlett et al. (2009); Sedlák et al. (2019), but the enhancements observed so far disappear if the machines are required to approximate the desired gate with unit probability Bisio et al. (2010). The reason for such behaviour is that the learning machines considered so far were designed to perform groups of unitary gates, such as the group of all qubit rotations, or the group of qubit rotations about a fixed axis. In these highly symmetric scenarios, a general theorem by Bisio *et al * Bisio et al. (2010) implies that every quantum machine operating with unit probability can be replaced by a machine that achieves the same learning accuracy with a purely classical memory. Given the generality of this result, one may be tempted to conjecture that quantum memories are of no use for deterministically learning unitary gates. Such conjecture would be consistent with Nielsen and Chuang’s no-programming theorem Nielsen and Chuang (1997), which implies that whenever a set of unitary gates can be perfectly encoded into a set of quantum states, the states in the set must be orthogonal, and therefore storable in a purely classical memory.
In contrast with the above observations, here we show that quantum memories can generally enhance the performance of deterministic machines attempting to learn unitary gates. To make this point, we provide a concrete example where the optimal deterministic learning strategies, with and without quantum memory, can be determined explicitly. Specifically, we consider machines that learn how to rotate a quantum particle by a desired angle about an initially unknown direction , imprinted in the state of a spin- particle. We consider two different ways of imprinting the direction in a spin- particle, corresponding to the following scenarios:
- •
*Scenario 1: spin relaxation. * A static magnetic field , pointing in an unknown direction , acts in a certain region of space. A spin- probe enters the region and undergoes a thermalisation process with respect to the magnetic dipole Hamiltonian , where are the spin operators, , and is a suitable constant. For simplicity, we assume that the temperature is low enough that the thermal state is approximately the ground state of the Hamiltonian, namely the eigenstate of the operator with maximum eigenvalue , hereafter denoted as . An extension to thermal states at finite temperature will be discussed in Section VI.
- •
Scenario 2: action of an unknown unitary gate. A black box implements an unknown rotation , which transforms the -axis into the direction . A machine prepares a spin- probe in an initial state (possibly depending on the desired rotation angle), and sends the probe as input to the black box. After the action of the black box, the state of the probe is , where is the unitary matrix representing the action of the rotation .
Scenario 2 is also relevant to the study of quantum reference frames Bartlett et al. (2007). Our learning task can be translated into a distributed quantum protocol involving two distant parties, Alice and Bob, who do not share a reference frame for spatial directions. The goal of the protocol is to allow Bob to rotate a target particle by a desired angle about the direction of Alice’s -axis, encoded in the state of a spin- particle prepared by Alice and sent to Bob as a token of her reference frame. In this setting, the unknown rotation describes the mismatch between Alice’s and Bob’s Cartesian axes, and the optimal learning strategy provides the optimal protocol for encoding the direction of Alice’s -axis in a spin- particle and for rotating Bob’s target particle accordingly.
A key difference between Scenarios 1 and 2 is that the initial state of the probe is irrelevant in Scenario 1 (every initial state is reset to the state ), while it can be optimised in Scenario 2. More generally, the optimal probe state in Scenario 2 could be an entangled state involving, in addition to the the spin- particle, an auxiliary system stored in the internal memory of the machine. Nevertheless, we will show that such auxiliary system does not increase the accuracy in the execution of the desired rotation, and therefore it can be omitted without loss of generality.
In this paper we establish the optimal learning strategies for both Scenarios 1 and 2, focussing on the case where the target particle is a qubit. A summary of the key result is as follows. For , we find that the optimal strategies for Scenarios 1 and 2 coincide. In both cases, the optimal learning strategy consists in
preparing the probe in the initial state , the eigenstate of with maximum eigenvalue 2. 2.
imprinting the direction in the probe, and storing the resulting state in a quantum memory of qubits 3. 3.
retrieving the probe’s state from the memory and letting it interact with the target through the isotropic Heisenberg interaction , where are the Pauli matrices for the target qubit.
Notably, the structure of the optimal learning machine is independent of the desired rotation angle : a single probe state and a single interaction Hamiltonian work optimally for all possible angles. The rotation angle only affects the interaction time between the probe and the target.
For every , we prove that the optimal machine with quantum memory outperforms every machine with purely classical memory. We determine the optimal fidelity over all machines with purely classical memory, providing a benchmark that can be used to demonstrate the advantage of quantum memories in realistic experiments. For example, we show that the optimal classical strategy for and has fidelity , while the optimal quantum strategy has fidelity of . As a consequence, every experimental fidelity above guarantees the demonstration of quantum-enhanced learning. In general, we show that a non-zero quantum advantage is present for every rotation angle and for every . We also prove that the advantage persists even if the memory is accessed multiple times, as long as the number of accesses to the memory is . In Scenario 1, we find that the quantum advantage persists at non-zero temperature , as long as the magnetic energy is large compared to the thermal fluctuation , being the Boltzmann constant.
For , we find out a striking difference between Scenarios 1 and 2. In Scenario 1, the quantum memory offers an advantage for all possible rotation angles. In Scenario 2, the advantage disappears when the rotation angle approaches . In that regime, the optimal strategy consists in
preparing the probe in the initial state , the eigenstate of with eigenvalue , 2. 2.
sending the probe to the unknown gate , and measuring the resulting state on the basis , where is the eigenstate of with eigenvalue , 3. 3.
conditionally on outcome , performing a spin flip around the -axis.
For , the optimal strategies for Scenarios 1 and 2 coincide, and the availability of a quantum memory offers advantages for all rotation angles except and .
The paper is structured as follows. In Section II we introduce the problem of learning a rotation about an unknown direction, considering two alternative ways of imprinting the direction into the state of a spin- probe. We derive the optimal quantum strategy in Section III, and the corresponding quantum benchmark in Section IV. In Section V, we show that the advantage persists even if the memory state is accessed multiple times, and in Section VI, we show that the optimal learning strategy for Scenario 1 is robust to thermal noise. In Section VII, we extend our results from qubits to systems of arbitrary dimensions. The conclusions are drawn in Section VIII.
II Learning how to rotate about an unknown axis
In this section we introduce the task of learning how to rotate a quantum particle about an initially unknown axis. We consider two scenarios, in which the unknown axis is imprinted in a quantum probe via two physically different processes: (1) spin relaxation, and (2) action of an unknown rotation gate. We formalise the optimisation problems corresponding to these scenarios and establish a relation between the corresponding solutions.
II.1 Scenario 1: learning from a relaxation process
Suppose that a static magnetic field is turned on for a limited amount of time in a bounded region of space. While the field is turned on, a spin- particle is placed in the region and undergoes a relaxation process, whereby its spin becomes aligned with the field’s direction. After the alignment has taken place, the state of the particle is stored in the internal memory of a quantum machine, which will later use it to rotate a target particle by a desired angle about the direction .
We denote the spin- particle as , and let and be its spin operators, satisfying the commutation relations , , and . All throughout the paper the standard notation (respectively, ) for the eigenstate of the operator (respectively, ) with eigenvalue .
The alignment of the magnet with the external magnetic field can be described by a thermalisation process, whereby the initial state of the magnet converges to the thermal state of the magnetic Hamiltonian , where is a suitable constant. For simplicity, we will assume that the temperature of the bath is low enough that the thermal state is approximately the ground state of . Explicitly, the ground state is the spin coherent state .
Overall, the alignment process can be modelled as a quantum channel (completely positive trace-preserving map) that resets every state of the probe to the state . In Section VI we will extend our discussion to the finite-temperature scenario, where the channel resets the probe state to the thermal state of the magnetic dipole Hamiltonian.
The goal of the quantum machine is to rotate a target particle by a given angle about the direction . We will mostly focus on the case where the target is a spin- particle, regarded as a qubit. In this case, we denote the target rotation by , where are the three Pauli matrices, and .
To learn how to implement the target rotation, the machine will transfer information from the magnet to its internal memory . Mathematically, this operation is described by a quantum channel (completely positive trace-preserving map) transforming states of into states of . To be completely general, we allow the channel to depend on the desired angle . If the memory is classical, the channel represents a measurement on the magnet, followed by the storage of the outcome in the memory. If the memory is quantum, the channel can be any process transforming states of the magnet into states of the memory.
When asked to perform the target rotation, the machine will retrieve information from its internal memory, and will use such information to control the evolution of the target system, hereafter denoted by . If the memory is classical, the control amounts to a conditional operation on the target depending on the classical data stored in the memory. If the memory is quantum, the control can be any general interaction between the memory and the target system. In both cases, the control operation can be described by a quantum channel transforming joint states of the composite system into states of .
Overall, the structure of the learning process is depicted in Figure 1.
If the initial state of the target is , then the final state is
[TABLE]
where is the effective quantum channel transforming joint states of the probe and the target into states of the target alone.
To evaluate the accuracy of the learning process, we compare the output state with the desired output . As a figure of merit, we use the average input-output fidelity Gilchrist et al. (2005)
[TABLE]
where is the rotationally-invariant probability distribution on the unit sphere, is the initial state of the target qubit, and is the unitarily invariant probability distribution on the pure states. The associated optimisation problem is:
Problem 1
Find the quantum channel that maximises the fidelity in Equation (2).
The optimisation can be performed with different constraints on the channel , corresponding to different assumptions on the machine’s internal memory. In this paper, we will consider two cases:
The machine is equipped with a quantum memory of qubits. In this case, the channel is an arbitrary completely positive trace-preserving map. 2. 2.
The machine is equipped with a classical memory of arbitrary size. In this case, the channel must be decomposable into a measurement on the probe followed by a conditional operation on the target.
We will carry out both optimisations and compare the maximum fidelity achievable with a quantum memory with the maximum fidelity achievable with classical memories of arbitrary size.
II.2 Scenario 2: learning from a rotation gate
Consider the following general problem. A quantum machine has access to one use of a black box implementing some unknown unitary gate , randomly drawn from some set . By interacting with the black box, the machine has to learn how to perform another unitary gate , acting on a target system . Typically, the gate learning problems considered so far correspond to the case (the machine attempts to emulate the gate Bisio et al. (2010); Marvian and Lloyd (2016); Sedlák et al. (2019)), or to the case (the machine attempts to invert the gate Bartlett et al. (2009); Bisio et al. (2010)). In general, the relation between and can be arbitrary.
To learn the target gate, the machine sends a probe to the black box. In general, the probe can be entangled with an auxiliary system , stored in the machine’s internal memory. If the initial state of the composite system is , then the state after the action of the black box is
[TABLE]
where denotes the identity operator on the auxiliary system.
After the black box has acted, the probe returns to the machine, which transfers information from the state to its internal memory . The transfer of information is described by a quantum channel with input system and output system . Overall, the imprinting of the parameter in the machine’s memory is called the training phase. Accordingly, we call the training gate.
After the training phase has been concluded, the machine will be asked to perform the gate on the target system. We call this phase the execution phase. The machine will access its internal memory and use it to control the evolution of the target system. The control mechanism is described by a quantum channel with input system and output system .
We call the above scenario the -to- learning problem. Its overall structure is summarised in Figure 2. The temporal separation between the training phase and the execution phase makes the -to- learning problem distinct from the problem of simulating the gate using the gate as a resource Chiribella et al. (2008); Bisio et al. (2014); Chiribella et al. (2015); Yang et al. (2017); Miyazaki et al. (2017); Quintino et al. (2018). In that problem, the gate is simulated by an arbitrary circuit using the gate , not necessarily a circuit of the form depicted in Figure 2.
A general result by Bisio et al concerns the case where the set is a group and the mappings and (or ) are two unitary representation of the group . In this scenario, the authors showed that the optimal learning performance can be achieved with a purely classical memory Bisio et al. (2010). In this paper, we present an instance of -to- learning problem that evades Bisio et al no go theorem. In our scenario, is a rotation , the probe is a spin- particle , the training gate is the unitary gate that implements the rotation on the probe, the target is a qubit, and the target gate is the rotation defined by
[TABLE]
where is a fixed, but otherwise arbitrary angle, is the 2-by-2 unitary matrix representing the rotation , and is the 2-by-2 matrix representing a rotation by about the -axis. Since the rotation angle is fixed, the target operations do not form a group, and therefore our learning problem falls outside the hypotheses of Bisio et al’s no go theorem.
The -to- learning problem is also relevant to the study of quantum reference frames Bartlett et al. (2007). Suppose that two distant parties, Alice and Bob, do not share a reference frame for directions. This means that Bob’s Cartesian axes , and are related to Alice’s Cartesian axes , and by an unknown element of the rotation group , namely , for all . Now, imagine that Bob wants to rotate a qubit by an angle about the direction of Alice’s -axis. To assist Bob in this task, Alice will send him a quantum system carrying information about her reference frame. If the transmitted system is a spin- particle, prepared by Alice in the state , then Bob will receive the particle in the state , owing to the mismatch of their reference frames. Using the state as a resource, Bob can attempt to execute the desired rotation, corresponding to the unitary gate . More generally, Alice could send Bob a spin- particle together with an auxiliary particle whose state space is invariant under rotations. In this case, Bob will receive the state , where is the initial state of the spin- and the auxiliary particle. In this setting, the search for the optimal communication protocol between Alice and Bob is equivalent to the search of the optimal learning strategy for the -to- learning problem.
A diagrammatic representation of the -to- learning problem is provided in Figure 3. The spin- probe and the auxiliary system start off in the state . Then, the probe is sent through the gate . After the action of the gate , the probe and system will be in the state
[TABLE]
where is the identity on the auxiliary system’s Hilbert space. Then, the state is encoded in the machine’s memory via a channel . In the execution phase, the machine will perform a quantum channel , transforming the input state of the memory and the target into the output state of the target.
The average fidelity for -to- learning task is
[TABLE]
where is the normalized Haar measure over the rotation group, and is the effective channel transforming states of the composite system into states of .
This leads to the following optimisation problem:
Problem 2
Find the auxiliary system , the input state , and the channel that maximise the fidelity in Equation (6).
Problem 2 reduces to Problem 1 of the previous subsection if system is trivial, and if the initial state is the spin coherent , independently of . In this case, the state (5) inputted in the quantum machine is the spin coherent state , where is the rotated -axis , . Since the rotation is chosen at random according to the Haar measure, the direction is distributed uniformly over the unit sphere (see e.g. Section 4.1 of Holevo’s textbook Holevo (2011)). Hence, one has
[TABLE]
Hence, the fidelities and coincide when the input state is the spin coherent state . Under this condition, both fidelities and are maximised by the same quantum channel .
III Optimal quantum strategies
Here we determine the optimal quantum strategies for learning rotations around an unknown direction. We solve Problems 1 and 2 defined in the previous section for all values of the spin and for all values of the rotation angle . For , we show that the optimal state for Problem 2 is the spin coherent state , and therefore the optimal fidelity coincides with the optimal fidelity for Problem 1. In both problems, the best approximation of the target rotation is realised by setting up an isotropic Heisenberg interaction between the target and the probe. For and , we find some curious features of the optimal strategies. Notably, the optimal solution of Problem 2 deviates from the optimal solution of Problem 1 for when the rotation angle approaches .
III.1 Structure of the optimal solution of Problem 2
Here we focus on Problem 2 and determine the structure of its optimal solution. The main result is the following theorem:
Theorem 1
The optimal strategy for learning the target gate from the training gate has the following features:
no auxiliary system is needed 2. 2.
the optimal input state is an eigenstate of 3. 3.
the optimal quantum channel is rotationally covariant, namely
[TABLE]
where and are the quantum channels induced by the unitary gates and , respectively.
The theorem follows from two lemmas:
Lemma 1
No auxiliary system is needed in the optimal strategy for learning the gate from the gate . The optimal input is an eigenstate of the -component of the angular momentum.
**Proof. **Note the target gate satisfies the relation for every rotation around the axis. Then, the fidelity (6) can be rewritten as
[TABLE]
where we used the shorthand notation , and we derived the second equality from the invariance of the Haar measure with the change of variables . Defining the average state
[TABLE]
and its rotated version \langle\phi_{\theta}\rangle_{k}=\big{(}U_{k}^{(j)}\otimes I_{\rm A}\big{)}\,\langle\phi_{\theta}\rangle\,\big{(}U_{k}^{(j)}\otimes I_{\rm A}\big{)}^{\dagger}, the fidelity can be expressed as
[TABLE]
Since is the average of over all rotations about the axis, it can be expressed as
[TABLE]
where is a probability distribution, and each is a pure state of the auxiliary system. Since the fidelity is linear in the input state, the optimal choice is to pick one of the terms in the mixture, such as . Moreover, the state of the the auxiliary system can be absorbed in the definition of the channel . This concludes the proof that the optimal input state can be chosen to be without loss of generality and that no auxiliary system is needed.
Consistently with the above result, we will omit the auxiliary system from now on.
Lemma 2
The optimal channel for learning the gate from the gate can be chosen to be covariant without loss of generality.
**Proof. **The optimality of covariant channels follows from the following chain of equalities:
[TABLE]
having defined in the second equality, and \mathcal{C}_{\theta}^{\prime}:=\int{\rm d}g\,\mathcal{U}_{g}^{\dagger}\mathcal{C}_{\theta}\big{(}\mathcal{U}_{g}^{(j)}\otimes\mathcal{U}_{g}\big{)} in the third equality. Since is covariant, the above equality shows that every channel can be replaced by a covariant channel with exactly the same fidelity.
Covariant channels have the same performance for all possible training gates. Hence, for a covariant channel the fidelity can be rewritten as
[TABLE]
III.2 Choi operator formulation
Theorem 1 guarantees that the optimal input state for learning the gate from the gate is an eigenstate of . Let us denote it generically as , for some between and , possibly depending on the rotation angle . In the following we will search for the optimal value and for the optimal covariant channel .
First of all, we rewrite the average fidelity as
[TABLE]
where is the entanglement fidelity Horodecki et al. (1999), defined as
[TABLE]
being the canonical maximally entangled state and denoting a reference qubit, entangled with the target qubit. In turn, the entanglement fidelity can be expressed as
[TABLE]
where is the rotated maximally entangled state
[TABLE]
and is the Choi operator Choi (1975)
[TABLE]
being a reference system of dimension , () being the identity map on the reference system (), and being the canonical maximally entangled state in dimension .
The problem is to maximise the fidelity (17) over all Choi operators of covariant channels. The set of the possible Choi operators is characterised by the following three conditions:
Covariance Chiribella et al. (2009): for all rotations (here and denote the entry-wise complex conjugates of the matrices and , respectively.) 2. 2.
Positivity: is positive semidefinite, denoted as 3. 3.
Trace preservation: , where denotes the trace over the output, and denotes the identity over the input.
We now put the above conditions in a form that is convenient for optimization.
Covariance. The covariance condition can be further simplified using the fact that complex conjugate representations of the rotation group are unitarily equivalent. Defining the operator
[TABLE]
the covariance condition becomes
[TABLE]
At this point, the total Hilbert space can be decomposed into orthogonal subspaces, corresponding to different values of the total angular momentum. Specifically, the angular momentum takes values and , and the total Hilbert space is decomposed as
[TABLE]
Relative to this decomposition, using Schur’s lemmas and the covariance condition (21), the operator can be written as:
[TABLE]
where is the projection on the factor with total angular momentum , and are complex coefficients, and is a complex 2-by-2 matrix.
Positivity. The positivity of the operator is equivalent to the positivity of the coefficients and of the matrix .
Trace preservation. The condition of trace preservation can be conveniently expressed in terms of the real coefficients and of the complex matrix . Indeed, tracing over the output, we obtain
[TABLE]
for a suitable choice of basis . Using Eq. (24), the trace preservation condition becomes
[TABLE]
Figure of merit. In terms of the operator , the entanglement fidelity can be expressed as
[TABLE]
The expression can be further simplified by decomposing the state on the subspaces of Eq. (22). After a bit of labor with the Clebsch-Gordan coefficients, we find the decomposition
[TABLE]
with
[TABLE]
Using the above decomposition, the entanglement fidelity can be expressed as
[TABLE]
to be maximized over all positive coefficients and , and over all non-negative matrices satisfying the constraint (25).
Lemma 3
The matrix can be chosen to be rank-one without loss of generality, namely for some suitable vector .
**Proof. **The entanglement fidelity depends on the matrix through the matrix element . Now, one has the chain of inequalities
[TABLE]
the second inequality following from the fact that is positive.
The first inequality holds with the equality sign when the phase of the complex number is equal to the phase of the complex number . The second inequality holds with the equality sign if is rank-one. In particular, the upper bound is attained by the rank-one matrix with and .
Since the normalization constraint (25) involves only the diagonal matrix elements of , the matrix can be replaced by the matrix without loss of generality.
The proof of the above lemma shows that the optimal entanglement fidelity has the form
[TABLE]
with . The maximum of the fidelity (31) under the constraints (25) can be determined with the method of Lagrange multipliers. In the following we present the result of the maximization, leaving the details to Appendix A.
III.3 Optimal quantum strategy for
For , it turns out that Problems 1 and 2 have the same optimal solution:
Theorem 2
*When , the optimal probe state for learning the gate from the gate is for every value of . For both Problems 1 and 2, optimal average fidelity over all pure input states is *
[TABLE]
and has the asymptotic expression
[TABLE]
The optimality of the probe state is in agreement with a result by Holevo on the optimal estimation of directions, cf. Section 4.10 of Holevo (2011). In other words, the optimal probe state for learning how to rotate about an unknown direction coincides with the optimal probe state for producing a classical estimate of such direction, as long as is larger than 1. It is worth stressing, however, that the optimal quantum strategy for rotating about an unknown direction is not based on estimation: in Section IV we will show that no estimation-based strategy can achieve the optimal quantum fidelity (32).
The exact values of the average fidelity are plotted in Figure 4 for various values of from to . Note that the fidelity decreases monotonically with the rotation angle . Intuitively, rotating by smaller angles is easier, because the uncertainty about the rotation axis has less influence on the performance. The easiest rotation is the identity , which is independent of the rotation axis and therefore can be implemented without error. The hardest rotation is the spin flip, corresponding to . In this case, the average fidelity has the simple form
[TABLE]
Note that, since the optimal probe state is , the optimal channel for Problem 1 coincides with the optimal channel for Problem 2. In Appendix B, we show that an optimal channel can be attained by setting up an isotropic Heisenberg interaction between the memory spin and the target spin. Explicitly, we show that the maximum fidelity (32) is attained by the channel
[TABLE]
where denotes the partial trace over the probe, and is the unitary operator
[TABLE]
in which is the vector of the three Pauli matrices, is the Heisenberg coupling, and is the function
[TABLE]
where for , and for . Note that is approximately equal to in the large limit.
Physically, the unitary evolution (36) can be realized by setting up an isotropic Heisenberg interaction, described by the Hamiltonian , for some suitable coupling constant , and by letting the two spins evolve for time
[TABLE]
depending on the angle of the target rotation. Remarkably, the same probe states and the same interaction can be used to control the full time evolution of the target system: one has only to adjust the interaction time [determined by the angle ] based on the evolution time in the target dynamics [determined by the angle ]. For example, we can set and simulate the precession of a spin- particle around the direction indicated by the memory state.
An important feature of the optimal strategy is that the optimal probe state is independent of the rotation angle . Since the operation of storing the state in the quantum memory is also independent of , it follows that all the operations in the training phase can be accomplished without knowing the rotation angle. This offers the possibility to decide the value of at later times. In fact, the machine can optimally approximate the full continuous-time dynamics of the target particle, because the optimal operations for different corresponds to unitary evolutions with the same Hamiltonian, just with different evolution times.
The optimality of the Heisenberg interaction is not limited to the average fidelity. In terms of scaling with , the unitary gate (36) is optimal also for the worst-case fidelity, defined as
[TABLE]
where is the fidelity for the simulation of on the specific input state . Indeed, in Appendix C, we show that the worst-case fidelity of the unitary gate (36) is
[TABLE]
Hence, the worst-case infidelity has the scaling . This is the best scaling one can hope for, because the average infidelity cannot vanish faster than [as shown by Eq. (33)], and the average infidelity is a lower bound to the worst-case infidelity.
The optimality of the Heisenberg interaction answers in the affirmative a question raised by Marvian and Mann Marvian and Mann (2008), who assumed the Heisenberg interaction and showed that it can be used to approximate arbitrary rotations in the limit of large limit. In the conclusion of their work, Marvian and Mann asked whether the Heisenberg interaction achieves the best scaling of the error with the spin size. Our results provide an affirmative answer, showing that the Heisenberg interaction maximizes the average fidelity and has the optimal error scaling in the worst-case scenario.
III.4 Optimal quantum strategy for
For , the optimal probe state for Problem 2 is still the coherent state for every rotation angle , and the optimal solutions of Problems 1 and 2 still coincide.
Curiously, the optimal learning strategy exhibits a transition when the rotation angle approaches . For , the optimal fidelity is still given by Equation (32), and the optimal channel is still given by Equation (35).
For , instead, the optimal fidelity becomes
[TABLE]
and is achieved by the following strategy:
Perform a joint measurement on the memory and the target. The measurement has two outcomes and is described by the quantum operations and , with
[TABLE]
being the projector on the subspace with total angular momentum , with . 2. 2.
If the measurement yields outcome “”, then apply the unitary gate (36), corresponding to the Heisenberg interaction, and discard the memory. If the measurement yields outcome “”, then perform the optimal 2-to-1 universal NOT channel Bužek et al. (1999), namely the channel defined by
[TABLE]
The probability of the outcome “”, corresponding to the universal NOT, depends on the parameter in Eq. (78). At the critical distance , one has , and the optimal strategy is realized through the Heisenberg interaction. As the rotation angle gets closer to , the coefficient increases, reaching its maximum value for . At this point, the weight of the universal NOT is maximum. Notably, the value is never reached, meaning that the optimal joint measurement on the input qubits is never projective.
III.5 Optimal quantum strategies for
The case is the only case where Problems 1 and 2 yield different solutions. The difference appears when the rotation angle is within a critical distance from .
For , the optimal probe state for Problem 2 is , and therefore the optimal solutions for Problems 1 and 2 still coincide. The optimal average fidelity is still given by Equation (32) and the optimal channel is still given by Equation (35).
For , the optimal average fidelity for Problem 1 is
[TABLE]
corresponding to Equation (32) with . The optimal channel is still given by Equation (35).
Instead, the optimal fidelity for Problem 2 is
[TABLE]
and is attained with the probe state , the -orbital aligned in the direction of the -axis. In Subsection IV.5, we will show that the optimal quantum fidelity (45) is achievable with a purely classical memory. Specifically, we will see that the optimal strategy is to perform a projective measurement on the probe, with the three measurement outcomes corresponding to the three Cartesian axes. The measurement outcome is then stored into a classical memory of 2 bits. In the execution phase, the machine rotates the target qubit by an angle about the axis corresponding to the measurement outcome.
III.6 Optimal fidelities for and
The dependence of the fidelity on the rotation angle is plotted in Figure 5 for and . The value of the optimal quantum fidelity is contrasted with the maximum fidelity achievable with a purely classical memory, which will be derived in Section IV.
IV The quantum benchmark
In this section we derive the maximum fidelity achievable by learning machines with a purely classical memory of arbitrarily large size. Such fidelity provides a benchmark that can be used to certify the experimental demonstration of quantum-enhanced learning. We consider the two learning tasks corresponding to Problem 1 (learning from a spin coherent state) and Problem 2 (learning from a rotation gate) coincide. The quantum benchmarks for these two problems coincide for all values of except . For , the two benchmarks become different when the desired rotation angle approaches .
IV.1 Measure-and-operate (MO) channels
Here we consider learning strategies where the memory in Figures 1 and 3 is purely classical. In this case, the transfer of information from the probe to the memory is described by a quantum-to-classical channel , of the form
[TABLE]
where is a set of orthogonal states of the memory, and is a Positive Operator-Valued Measure (POVM), describing a quantum measurement on system in the case of Figure 1, or a quantum measurement on system in the case of Figure 3.
The execution phase consists in reading out the index from the classical memory and performing a conditional operation on the system. Hence, the channel has the form
[TABLE]
The operations performed by machines with purely classical memory will be called measure-and-operate (MO) strategies. Combined together, the “measure” channel and the “operate” channel give a single quantum channel , of the form
[TABLE]
where denotes the partial trace over all systems except system .
In the following, we will solve the optimisations in Problems 1 and 2 under the constraint that the channel is of the form (48). By definition, the optimal MO fidelities are by definition no larger than the optimal quantum fidelities derived in the previous Section.
IV.2 Structure of the optimal MO strategy for Problem 2
The structure of the optimal MO strategy for Problem 2 is summarized by the following Theorem, proven in Appendix D.
Theorem 3
The optimal MO strategy for learning the gate from the gate has the following features:
no auxiliary system is needed 2. 2.
the optimal probe state is an eigenstate of , denoted as 3. 3.
the outcome of the optimal POVM is an element of the rotation group , denoted as 4. 4.
the optimal POVM is rotationally covariant Holevo (2011)**, and has the form
[TABLE]
where is a unit vector 5. 5.
the optimal conditional operation has the form , where is a fixed channel acting on the target qubit.
In the following we will maximise the gate fidelity over all MO strategies with the features described by Theorem 3. For convenience we will express the gate fidelity in terms of the entanglement fidelity [cf. Equation (15)].
IV.3 Choi operator formulation
For an optimal strategy as in Theorem 3, the entanglement fidelity takes the form
[TABLE]
where is the Choi operator of the channel , is the Choi operator of the channel , and .
Our goal is to maximise the entanglement fidelity (IV.3) over all values of , over all unit vectors , and over all Choi operators . To this purpose, the key observation is that the Choi operator can be chosen to be real in a suitable basis. Specifically, we have the following
Proposition 1
The Choi operator maximizing the fidelity (IV.3) can be chosen to be real in the Bell basis
[TABLE]
**Proof. **Every unitary is a real linear combination of the matrices and . Hence, every vector is a real linear combination of the vectors , , , and . Since the fidelity depends on the Choi operator only through the matrix elements , the optimal Choi operator can be chosen to be real in the same basis as the vectors .
Thanks to Proposition 1, the maximization of the fidelity can be restricted to the set of Choi operators that are real in the Bell basis. This set of Choi operators can be equivalently characterized as the set of Choi operators of unital channels, i.e. quantum channels mapping the identity operator to itself. Indeed, we have the following
Proposition 2
A qubit channel is unital if and only if its Choi operator is real in the Bell basis
[TABLE]
**Proof. **If a qubit channel is unital, then it is a convex combination of unitary channels Landau and Streater (1993). For every unitary channel, the corresponding Choi operator is real in the Bell basis. Indeed, every unitary channel has a Kraus decomposition with a single unitary operator of the form , with and . Hence, the Choi operator is real in the Bell basis. Since the set of real Choi operators is convex, every unital channel is contained in it.
Conversely, suppose that a channel has a Choi operator that is real in the Bell basis, i.e. , for some real symmetric matrix . Then, one has
[TABLE]
the last equality following from the relation . Hence, the channel is unital.
Since the fidelity is a linear function, its maximization can be restricted to the extreme points of the set of unital channels. For qubits, such extreme points are unitary channels Landau and Streater (1993). Hence, we obtained the following
Theorem 4
The quantum channel maximizing the fidelity (IV.3) can be chosen to be unitary without loss of generality.
Thanks to Theorem 4, the optimal entanglement fidelity (IV.3) can be expressed as
[TABLE]
where is a suitable unitary and .
The optimization can be further simplified using the following observation:
Proposition 3
The unitary gate maximizing the fidelity (IV.3) can be chosen without loss of generality to be a rotation about the axis.
**Proof. **Every unitary can be written as , where is a rotation about the axis by an angle , and is the rotation that transforms the axis into the rotation axis of . Hence, the corresponding state can be written as .
Using this fact, the optimal MO fidelity can be rewritten as
[TABLE]
(here denotes the transpose of the matrix .) The last equation shows that the maximisation of the fidelity can be reduced to rotations about the axis.
At this point, it remains to maximise the fidelity (IV.3) over , , and . The result of the optimization is summarised in the following, while the details are provided in Appendix E.
IV.4 Optimal MO strategy for
For , it turns out that the quantum benchmarks for Problems 1 and 2 coincide.
Theorem 5
*For , the optimal probe state for learning the gate from the gate by MO operations is for every value of . For both Problems 1 and 2, optimal MO fidelity is *
[TABLE]
and has the asymptotic expression
[TABLE]
The optimal MO strategy consists in
measuring the probe with the POVM , and 2. 2.
rotating the target qubit about the rotated -axis by the angle
[TABLE]
where for , and for .
Note that the probe state and the measurement are both independent of the rotation angle . This means that the machine can be trained optimally even before the value of the rotation angle has been decided. The operations in the training phase coincide with the optimal estimation strategy for directions, derived in the classic work by Holevo Holevo (2011).
The optimal MO strategy can be implemented by a learning machine with a purely classical memory. The size of the classical memory can be chosen without loss of generality to be bits. This is because the fidelity is a linear function of the POVM, and therefore its maximum is attained by an extreme point of the convex set of all POVMs with outcomes in . The extreme points of such set consist of POVMs that assign non-zero probability to at most rotations Chiribella et al. (2007). Hence, the optimal POVM in Theorem 5 can be replaced by another, equally optimal POVM with at most outcomes, which can be stored into a classical memory of bits.
A plot of the MO fidelity and of the optimal quantum fidelity is provided in Figure 6. Note that the error (one minus fidelity) goes to zero in both cases, but the rate for quantum strategies is twice as fast, as one can see by comparing Equations (33) and (57).
IV.5 Optimal MO strategies for
The case exhibits an anomalous behaviour when the rotation angle approaches . For , Problems 1 and 2 have the same optimal MO fidelity, and the same optimal MO strategy, described in Theorem 5. For , the optimal the optimal MO fidelities become different. For , the optimal average fidelity for Problem 1 is
[TABLE]
with
[TABLE]
corresponding to Equation (56) with . The MO strategy is still the one described in Theorem 5.
For Problem 2, the optimal probe states transitions from to , and the optimal fidelity becomes
[TABLE]
The optimal MO strategy consists of
Measuring the memory with the POVM operators 2. 2.
Rotating the target qubit about the axis by an angle , independently of .
Physically, the optimal POVM can be interpreted as a randomisation of the projective measurement that projects the spin- particle along the three Cartesian axes , and Chiribella et al. (2007). This projective measurement corresponds to the orthonormal basis for defined by , , and . In the language of atomic physics, , , and are the -orbitals aligned in the directions , and , respectively. Since the fidelity is a linear function of the POVM, the optimal POVM can be replaced by an equally optimal POVM based on the projective measurement of , followed by a rotation by about the Cartesian axis identified by the measurement outcome. In this discretised version of the MO strategy, the learning machine only needs a classical memory of bits.
V Persistence of the quantum advantage
We have seen that a machine equipped with a quantum memory can outperform every classical machine at the task of learning rotations about an unknown axis. Still, our analysis was restricted to the scenario where the quantum process accesses its memory only once, with the goal of reproducing a single use of the target gate. In the following we will study how the performance depends on the number of required executions of the target gate.
Let us focus on the regular case , where the optimal strategies for Problems 1 and 2 coincide, and the channel is realised by setting up a Heisenberg interaction between the memory and the target qubit. An important question is how many times the memory can be accessed before the accuracy drops below a certain threshold. In the context of quantum reference frames, the maximum number of accesses such that the fidelity is above threshold was called the longevity in Ref. Bartlett et al. (2006). Another important question is how many times the memory can be accessed before the quantum advantage is lost. The maximum number of accesses for which the fidelity is above the quantum benchmark (56) will be called persistence of the quantum advantage in the following.
Suppose that the joint evolution of memory and target is described by the same unitary gate at every step. Assuming the gate to be of the form of Eq. (36) for some fixed function , we obtain the close-form expression
[TABLE]
quantifying the average fidelity at the leading order in (see Appendix F for the derivation). From this expression one can see that the longevity grows as . However, the persistence of the quantum advantage is much shorter: comparing the fidelity (62) with the MO fidelity (57), we find that the quantum advantage disappears when the number of repetitions is larger than
[TABLE]
One could also consider more elaborate strategies where the interaction time between memory and target is optimised at every step. However, we find that these strategies do not increase the longevity nor the persistence of the quantum advantage in the large limit.
VI Robustness to thermal noise
In Problem 1, we made the simplifying assumption that the unknown direction is imprinted into the pure spin-coherent state , regarded as the low-temperature approximation of the thermal state of the magnetic dipole Hamiltonian. An interesting question is how this approximation affects our discussion of the quantum advantage. In the following we will address this question in the large limit, showing that quantum memories are useful whenever the magnetic energy is sufficiently large compared to the thermal fluctuations.
The thermal states of the Hamiltonian can be written as
[TABLE]
where is the temperature and is the Boltzmann constant. The spin coherent state is retrieved in the low temperature () limit, as one has .
Now, suppose that the learning strategy designed for the spin coherent state is adopted for the mixed state . In Appendix G, we show that the average fidelity has the asymptotic expression
[TABLE]
The above fidelity can be compared the benchmark in Equation (57), which quantifies the maximum fidelity achievable with classical memories. Note that Equation (57) provides the benchmark for both Problems 1 and 2, meaning that the benchmark applies to every pure probe state of the form , and by convexity, to every mixed probe state of the form . In particular, it applies to the thermal states , as the average fidelity over all directions is equal to the average fidelity over all rotations . Comparing the fidelity (65) with the benchmark in Equation (57), we obtain that the quantum strategy outperforms all classical strategies whenever is larger than , corresponding to the condition . Hence, the quantum advantage persists whenever the magnetic energy is larger than times the thermal energy .
Note that the quantum benchmark in Equation (57) is the optimal fidelity achievable with arbitrary probe states. If one further enforces the condition that the the probe state be thermal, then the value of the benchmark would be even lower, thereby extending the set of temperatures for which the quantum memory offers an advantage.
Note also that the above discussion applies to a variant of Problem 2 where the probe is subject to thermal noise before the action of the training gate , resulting into a mixed input state . Also in this setting, the quantum memory offers a provable advantage when the parameter is larger than .
VII Learning higher dimensional gates
Our result establishes the existence of a quantum advantage for learning single-qubit rotations about an unknown axis. This finding is conceptually important, because the advantage for single qubits implies an advantage of coherent learning for quantum systems of arbitrary dimension. Indeed, one can immediately prove the advantage by using the qubit benchmark for gates that act nontrivially only in a fixed two-dimensional subspace.
Our results also give a heuristic for the problem of learning rotation gates on higher dimensional spins. The idea is to encode the rotation axis in a spin coherent state and to let the memory and target spin interact as closed system. Explicitly, we make two spin systems undergo the Heisenberg interaction , where are the spin operators of the target spin. Using the unitary gate , in Appendix H we obtain the average fidelity
[TABLE]
in the large limit. Remarkably, the error grows quadratically—rather than linearly—with the size of the target spin: in order to ensure high fidelity, the size of the memory must be large compared to the square of the size of the target system. The same conclusion holds for the worst-case fidelity, which has the asymptotic expression
[TABLE]
with for even and for odd .
The quantum strategy exhibits an advantage over the MO strategy consisting in measuring the direction from the spin coherent state pointing in direction and performing a rotation based on the outcome. Again, we find that the error of the quantum strategy vanishes in the macroscopic limit of large memory systems, at a rate twice as fast than the error of the classical strategy (see Appendix H for more details). It is an open question whether the above quantum and MO strategies are optimal for arbitrary .
VIII Conclusions
We determined the ultimate accuracy for the task of learning a rotation of a desired angle about an unknown axis, imprinted in the state of a spin- particle. In this task, we found that quantum memories enhance the learning performance for every and for every rotation angle . Specifically, we found that a quantum machine with a memory of qubits outperforms all learning machines with classical memory of arbitrarily large size.
We found that the advantage of the quantum memory persists even when the memory is accessed multiple times, as long as the total number of accesses is at most linear in the spin size. Quite interestingly, we observe a relation between the persistence and the size of the advantage: in the large limit, the quantum advantage is of size and persists when the memory is accessed for times. Our results indicate that, as the memory size grows, the quantum advantage is spread over a larger amount of time. This tradeoff achieves the classical limit for spins of infinite size, for which the advantage disappears and the memory can be accessed infinitely many times.
At the fundamental level, our results provides the first example of a quantum memory advantage in a deterministic learning task involving unitary gates as the target operations. Advantages of quantum memories have been known for longer time for non-deterministic learning tasks, where the learning machine has a non-zero probability of aborting. For example, Refs. Nielsen and Chuang (1997); Vidal et al. (2002); Hillery et al. (2002); Vidal et al. (2002); Brazier et al. (2005); Ishizaka and Hiroshima (2008); Bartlett et al. (2009) provide examples of machines that learn an unknown unitary gate without errors, albeit with a non-unit probability of success. In all these examples, a quantum memory is necessary in order to achieve error-free learning. In practice, however, no real machine is error-free, and in order to experimentally demonstrate the advantage of the quantum memory one needs a benchmark that quantifies the best performance achievable with classical machines. No such benchmark has been derived for the non-deterministic learning tasks considered in Refs. Nielsen and Chuang (1997); Vidal et al. (2002); Hillery et al. (2002); Vidal et al. (2002); Brazier et al. (2005); Ishizaka and Hiroshima (2008); Bartlett et al. (2009), and a rigorous demonstration of the advantage of the quantum memory has not been possible so far. A promising direction of future research is to apply the techniques developed in this paper to the derivation of quantum benchmarks for non-deterministic learning of unitary gates.
Our work calls for the experimental demonstration of quantum-enhanced learning of rotations around an unknown direction. For small values of the spin, a possible testbed is provided by NMR systems, where spin-spin interactions are naturally available Vandersypen and Chuang (2005). Another possibility is to use quantum dots, where one can engineer a coupling between a single spin and an assembly of spins effectively behaving as a single spin particle Chesi and Coish (2015). This scenario, named the box model, can be achieved through a uniform coupling of a central spin to the neighbouring sites. No matter what platform is adopted, our results provide the rigorous benchmark that can be used to validate the successful demonstration of quantum-enhanced unitary gate learning in realistic scenarios where the implementation is subject to noise and experimental imperfections.
Acknowledgements. The authors thank E Bagan for discussions and feedback on an earlier version of the manuscript. This work is supported by the National Natural Science Foundation of China through grant 11675136, the Hong Kong Research Grant Council through Grant No. 17326616 and 17300317, the Croucher Foundation, the HKU Seed Funding for Basic Research, the Foundational Questions Institute through grant FQXi-RFP3-1325, and the Canadian Institute for Advanced Research (CIFAR).
Appendix A Derivation of the optimal quantum strategy
In order to find the maximum of the fidelity (31) under the constraints (25) we use the method of Lagrange multipliers, setting and . The search of the stationary points of the fidelity yields the following four cases:
- Case 1: . In this case, the fidelity is given by
[TABLE]
and is attained by the Choi operator
[TABLE]
with
[TABLE]
The maximum of the fidelity is attained by , independently of . Explicitly, the maximum fidelity is
[TABLE]
Note that the fidelity converges to 1 in the large limit, meaning that the learning becomes nearly perfect for large spins. Comparison with Cases 2,3, and 4 in the following shows that the fidelity (71) is optimal for every angle whenever the spin is larger than 1. 2. Case 2: . In this case, the Lagrangian method yields the fidelity
[TABLE]
achieved by setting
[TABLE]
and according to Eq. (25). The fidelity does not tend to in the large limit, indicating that the Case 2 strategy is suboptimal for large . Still, it turns out that for this strategy is optimal for some values of the angle around . In this case, the entanglement fidelity becomes
[TABLE]
and the optimal Choi operator is
[TABLE]
with
[TABLE]
The transition from the Case 1 strategy to the Case 2 strategy occurs when the distance is below the critical value . 3. Case 3: , . Note that a strategy with can only exist for , because for there is no subspace with spin , and therefore the coefficient is not present. The method of Lagrange multipliers implies that, among the strategies with and , the maximum fidelity is attained when and take their maximum values. The corresponding the Choi operator is
[TABLE]
and its fidelity is
[TABLE]
The maximum, attained for , is
[TABLE]
The fidelity does not reach 1 in the large limit, indicating that the Case 3 strategy is suboptimal for large . Nevertheless, we find out that for the Case 3 strategy is optimal for rotation angles around . For , the entanglement fidelity is
[TABLE]
A numerical comparison with the fidelity for Case 1 indicates that the above fidelity is optimal for , with . For , instead, the Case 1 strategy is optimal. 4. *Case 4: . * This case is similar to Case 3, and the fidelity has the expression
[TABLE]
By comparison with the other cases, we find that the Case 4 fidelity is never optimal.
Note that for Problem 1 with , only Case 1 and Case 2 need to be considered as the memory state is . It is easy to check that in Eq. (73) does not satisfy constraint Eq. (25) for arbitrary , showing that Case 1 is always the optimal solution for Problem 1 when .
Appendix B Heisenberg interaction is the optimal learning strategy
In this section, we prove that the channel in Eq. (35) with unitary gate in Eq. (36) is the optimal learning channel. To this purpose, we calculate its entanglement fidelity , and show that it is equal to the optimal entanglement fidelity given by Eq. (68).
First of all, we note that the unitary gate can be expanded as
[TABLE]
where is an irrelevant global phase, which we will ignore from now on. Using this expression, we obtain the relations
[TABLE]
and we can get:
[TABLE]
The entanglement fidelity for this physical realization can be written as
[TABLE]
where being the identity map on the reference system . Then by inserting Eq. (B) and
[TABLE]
into Eq. (87), we can get that:
[TABLE]
where the equality will be reached when we set equal to Eq. (37). It is equal to the optimal entanglement fidelity in Eq. (68).
Appendix C Worst-case fidelity
Here we show that learning to perform target gate by using Heisenberg interaction in Eq. (35, 36) has an error scaling in in terms of the worst-case fidelity (defined by Eq. (39)).
The worst-case fidelity is over all learning gate and over all input target states :
[TABLE]
where
[TABLE]
is the fidelity for the simulation of on the specific input state , and
[TABLE]
is calculated according to the optimal physical realization.
Note that the trace is invariant under cyclic permutations and , we can rewrite Eq. (91) as:
[TABLE]
By expanding in basis {, }: , we find that:
[TABLE]
By inserting Eq. (C) into Eq. (91), we can get
[TABLE]
showing that
[TABLE]
Appendix D Proof of Theorem 3
The proof of the first two items of Theorem 3 is identical of the proof of Lemma 2.
It remains to prove that there exists an optimal MO strategy consisting of a covariant POVM and of conditional operations .
The MO fidelity for Problem 2 can be expressed as
[TABLE]
For every , we define the probability
[TABLE]
the POVM
[TABLE]
and the quantum channels
[TABLE]
Note that the operators satisfy the normalization condition
[TABLE]
following from Schur’s lemma.
In terms of the above probabilities, POVMs, and channels, the expression (97) can be rewritten as
[TABLE]
Since the fidelity is a convex combination, we have the upper bound
[TABLE]
It is immediate to check that the bound is attained by the MO strategy consisting of the POVM and of the conditional operations , where is the outcome that maximizes the expression in the right-hand-side of Equation (103).
Appendix E Optimization of the MO strategy
Our goal is to maximize the fidelity
[TABLE]
over all values of , all unit vectors , and all unitary gates . Using the relation , we can rewrite the fidelity as
[TABLE]
with and .
For every angle , the vector can be expanded as
[TABLE]
having used the notation for the eigenstates of the -component of the total spin of a bipartite system consisting of two spins and , respectively. Hence, we have
[TABLE]
and
[TABLE]
Moreover, the fidelity can be expressed as
[TABLE]
having defined . We now insert Equation (106) into the above expression, taking advantage of the orthogonality relation
[TABLE]
In this way, the fidelity becomes
[TABLE]
with
[TABLE]
Expanding as , we obtain
[TABLE]
Note that the bound can be attained by choosing to be an eigenstate of with suitable eigenvalue .
Now, let be the the modulus of , and let and be the positive and negative part of , respectively. With inserting these definitions in Eq. (110), the fidelity can be upper bounded as
[TABLE]
the second inequality following from the Cauchy-Schwarz inequality applied to the vectors \left(\sqrt{\Gamma_{+}}-\sqrt{\Gamma_{-}}\right)\,\big{(}|j,m\rangle\otimes|j,-m\rangle\big{)} and \left(\sqrt{\Gamma_{+}}+\sqrt{\Gamma_{-}}\right)\,\big{(}|j,n\rangle\otimes|j,-n\rangle\big{)}. We will discuss the attainability of the bound (111) in the end of the proof.
Inserting the definition of [Eq. (E)] in the bound (111), we obtain
[TABLE]
which becomes
[TABLE]
For , one can easily see that each of the three summands in the above expression has its maximum value for , independently of the angles and . Setting and optimizing over we obtain that the maximum is obtained for
[TABLE]
for in , and by
[TABLE]
for in (recall that the range of is between [math] and ). For these values of , the entanglement fidelity is
[TABLE]
The same approach works for , in which case is the only possible choice, and the optimization over yields again the optimal value (114).
Note that the choice of angles in Eqs. (114) and (115) satisfies the condition . Hence, the operator is positive, and therefore . As a consequence, the inequality (111) is attained by choosing .
For , the optimal MO strategy is determined by a brute-force approach, by setting and , optimizing the right-hand-side of Eq. (113) over . When , the optimal MO strategy is the same as when . When , the optimal is , and the optimal angle becomes . Also in this case, the operator is positive, and therefore the inequality (111) is attained by choosing .
Appendix F Persistence of the quantum advantage
The state of the memory spin after the interaction can be obtained by application of the complementary channel , defined by
[TABLE]
where denotes the partial trace over the target spin, and is the unitary operator in Eq. (36).
To evaluate this state, it is convenient to look at the evolution of the basis states . By explicit calculation, we obtain the relation
[TABLE]
where the coefficients are given by
[TABLE]
At the first step, the memory starts in the state . By repeatedly applying Eq.(117), we then obtain the memory state at every step. Explicitly, the memory state for the -th usage is given by
[TABLE]
where is the probability distribution after usages, which is given by
[TABLE]
being Tricomi’s function (confluent hypergeometric function of the second kind). Using the recursion formula
[TABLE]
we get the asymptotic expression
[TABLE]
Now, Equation (119) gives us the memory state at the -th iteration. The fidelity obtained by using this state is given by
[TABLE]
where is the average fidelity when the probe is in the state , namely
[TABLE]
The average over the input states can be easily computed using the relation with the entanglement fidelity, Equation (15) . Using Equation (35) for the gate , we obtain the asymptotic expression
[TABLE]
One can see directly that in asymptotics, is a arithmetic progression and is a geometric progression. Inserting the above expressions into Eq. (123) we obtain
[TABLE]
Comparing with the MO fidelity in Eq.(57), we obtain that the persistence of the quantum advantage tends to .
The exact dependence of the fidelity on is shown in Figure 7 for different values of the spin and for rotation angle . Interestingly, the persistence of the quantum advantage is exactly equal to the asymptotic value for all the values of shown in the figure.
We showed the explicit calculation of and when the interaction time is fixed at every step. More general strategies where the interaction time is optimized at every step can be studied in the same way. In the large limit, we find that such step-by-step optimization is not needed: the fidelity tends to the same value, no matter whether the interaction time is optimized at every step or once for all. As a result, the persistence of the quantum advantage is the same in both scenarios.
Appendix G Robustness of the quantum strategy
Here we evaluate the fidelity in the execution of the gate when the optimal learning strategy for pure states is adopted with a probe in the thermal state . The fidelity of this strategy is
[TABLE]
with as in Equation (36). Inserting the expression for the state into the above equation, we obtain
[TABLE]
with defined as in Equation (124). The asymptotic expression for was computed in Equation (125). Inserting this expression in the above equation, we obtain
[TABLE]
Appendix H Learning higher dimensional rotations for spin- particle
Following the structure of the optimal learning mechanism for spin , we choose the memory state to be and we let the two spins undergo the Heisenberg interaction
[TABLE]
where are the spin operators of the target spin.
Using the above strategy, we can explicitly compute the entanglement fidelity, given by
[TABLE]
[TABLE]
where being the canonical maximally entangled state of two spin- particles, denotes a reference qubit, entangled with the target spin- particle, and is a rotation of around the axis in representation.
Inserting the formula of in Eq. (131), using the expressions of the Clebsch-Gordan coefficients, we arrive at the asymptotic expression
[TABLE]
The average fidelity is then given by
[TABLE]
A similar calculation can be done for the MO strategy consisting in measuring the memory state with POVM and then performing the conditional operation on the target spin- particle, which means rotate with angle with the rotated -axis :
[TABLE]
with
[TABLE]
By denoting as the angle between and , and the rotation angle for the rotation , the entanglement fidelity can be rewritten as
[TABLE]
Performing the average, we obtain the asymptotic expression
[TABLE]
which can then be used to evaluate the average fidelity as
[TABLE]
By comparing with Eq. (134), we again see that the error is exactly twice the error of the coherent quantum learning strategy.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Biamonte et al. (2017) J. Biamonte, P. Wittek, N. Pancotti, P. Rebentrost, N. Wiebe, and S. Lloyd, Nature 549 , 195 (2017).
- 2Dunjko and Briegel (2018) V. Dunjko and H. J. Briegel, Reports on Progress in Physics 81 , 074001 (2018).
- 3Aïmeur et al. (2006) E. Aïmeur, G. Brassard, and S. Gambs, in Conference of the Canadian Society for Computational Studies of Intelligence (Springer, 2006) pp. 431–442.
- 4Harrow et al. (2009) A. W. Harrow, A. Hassidim, and S. Lloyd, Physical Review Letters 103 , 150502 (2009).
- 5Rebentrost et al. (2014) P. Rebentrost, M. Mohseni, and S. Lloyd, Physical Review Letters 113 , 130503 (2014).
- 6Rønnow et al. (2014) T. F. Rønnow, Z. Wang, J. Job, S. Boixo, S. V. Isakov, D. Wecker, J. M. Martinis, D. A. Lidar, and M. Troyer, Science 345 , 420 (2014).
- 7Wiebe et al. (2014) N. Wiebe, A. Kapoor, and K. M. Svore, ar Xiv preprint ar Xiv:1412.3489 (2014).
- 8Dunjko et al. (2016) V. Dunjko, J. M. Taylor, and H. J. Briegel, Physical Review Letters 117 , 130501 (2016).
