Query-optimal estimation of unitary channels in diamond distance
Jeongwan Haah, Robin Kothari, Ryan O'Donnell, Ewin Tang

TL;DR
This paper presents an optimal algorithm for estimating unknown unitary quantum channels with minimal applications, improving efficiency over previous methods and establishing matching lower bounds.
Contribution
The authors develop a query-optimal algorithm for unitary channel estimation in diamond norm, achieving Heisenberg scaling with minimal resource usage.
Findings
Achieves $ extsf{d}^2/ extsf{ε}$ applications for $ extsf{d}$-dimensional unitaries.
Introduces a bootstrap technique to improve estimation accuracy.
Proves a matching lower bound confirming optimality.
Abstract
We consider process tomography for unitary quantum channels. Given access to an unknown unitary channel acting on a -dimensional qudit, we aim to output a classical description of a unitary that is -close to the unknown unitary in diamond norm. We design an algorithm achieving error using applications of the unknown channel and only one qudit. This improves over prior results, which use [via standard process tomography] or [Yang, Renner, and Chiribella, PRL 2020] applications. To show this result, we introduce a simple technique to "bootstrap" an algorithm that can produce constant-error estimates to one that can produce -error estimates with the Heisenberg scaling. Finally, we prove a complementary lower bound showing that estimationā¦
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsQuantum Computing Algorithms and Architecture Ā· Quantum Information and Cryptography Ā· Quantum Mechanics and Applications
\setkomafont
disposition
Query-optimal estimation of
unitary channels in diamond distance
Jeongwan Haah111Microsoft Quantum, Redmond, Washington, USA
āā
Robin Kothari222Most of this work was done while the author was at Microsoft Quantum, Redmond, Washington, USA. The author is currently at Google Quantum AI, Venice, California, USA.
āā
Ryan OāDonnell333Carnegie Mellon University Computer Science Department, Pittsburgh, Pennsylvania, USA. Part of this research was performed while the author was at Microsoft Quantum.
āā
and Ewin Tang444University of Washington, Seattle, Washington, USA
Abstract
We consider process tomography for unitary quantum channels. Given access to an unknown unitary channel acting on a -dimensional qudit, we aim to output a classical description of a unitary that is -close to the unknown unitary in diamond norm. We design an algorithm achieving errorĀ using applications of the unknown channel and only one qudit. This improves over prior results, which use [via standard process tomography] or [Yang, Renner, and Chiribella, PRL 2020] applications. To show this result, we introduce a simple technique to ābootstrapā an algorithm that can produce constant-error estimates to one that can produce -error estimates with the Heisenberg scaling. Finally, we prove a complementary lower bound showing that estimation requires applications, even with access to the inverse or controlled versions of the unknown unitary. This shows that our algorithm has both optimal query complexity and optimal space complexity.
Contents
-
1.1 Preliminaries: Distances for unitary (and other) channels
-
C Gate-efficient pure state tomography with optimal sample complexity
1 Introduction
Quantum mechanics models the time evolution of a closed system as a unitary operator on the vector space of states. From the perspective of an experimentalist, this model serves as a mechanism for producing statistical predictions. This operational point of view prompts a fundamental question: when and how one can interact with a system with some unknown dynamics to learn its corresponding unitary operator? This question generally goes under the name of process tomography, and has been studied extensively in a variety of settings depending on chosen figures of merit; see our discussion inĀ SectionĀ 1.3 below. The figures of merit may be one or more of the following: the accuracy of an estimate (measured with some metric on the space of quantum channels), the number of times one has to run the unknown dynamics, the complexity of initial states and that of measurements, the size of coherent Hilbert space required, and the complexity of classical processing.
In this paper, we design an algorithm with the following properties. LetĀ denote the diamond norm, the completely bounded norm on the space of linear operators acting on density matrices. LetĀ denote the set of all unitary matrices. IfĀ is any unitary operator, we denote byĀ the associated unitary channel
[TABLE]
Theorem 1.1** (Upper bound).**
There is a quantum algorithmĀ that, given black-box access to an unknown -dimensional unitary channel and any , outputs a classical description of a unitary. This output is probabilistic and can be viewed as a -valued random variable . The algorithm satisfies the following properties:
(Query complexity)* queries the black box times.* 2. 2.
(Space complexity)* only uses one qudit of dimension . Specifically, it only prepares states of the form , where the positive integerĀ and unitariesĀ are adaptively chosen, and measures them in the computational basis.* 3. 3.
(Gate complexity)* If the -dimensional qudit is embedded in qubits, then uses one- and two-qubit quantum gates in total beyond the queries to the black box, and the classical time complexity is .* 4. 4.
(Closeness of output unitary)* The unitary output by is -close to the unknown in diamond norm with high probability. Specifically, , which implies .* 5. 5.
(Closeness of output channel)* We can also view the output of as a channel. Let denote the mixed unitary channel induced by runningĀ onĀ and then applying its output . Then .*
Complementary to these performance guarantees, we also show the following lower bound, which also holds against algorithms that can use and controlled versions of and .
Theorem 1.2** (Lower bound).**
Let be an algorithm that, for an unknown -dimensional unitary accessible through black box oracles that implement , , , and , can output a classical description of a unitary channel such that with probability . Then must use oracle queries.
Due to subtle differences in assumptions and objectives, we give comparison to the prior art after we properly define mathematical notions. In brief: standard process tomography for general channelsĀ [Leu00] uses one qudit but requires a error dependence, which we improve quadratically.555We were not able to locate a -query process tomography algorithm for diamond-norm distance in the literature; we prove this result in SectionĀ 2. Most prior work takes the accuracy measure to be either the one in ItemĀ 5 of TheoremĀ 1.1, referred to in this work as the problem of Approximate Storage-And-Retrieval, or the closeness of the output unitary in entanglement infidelity. Prior work of Yang, Renner, and ChiribellaĀ [YRC20] gives an algorithm that applies the unknown in parallel to achieve the current best guarantees for these problems. In particular, it recovers TheoremĀ 1.1.5 with the same query complexity. However, these problems are easier than that of unitary estimation to diamond-norm distance: norm conversion gives an algorithm for diamond-norm distance with applications. We improve on this by a factor of , and improve the number of qudits used from to .
Notation.
Throughout the paper, all s have base , and for any . The dimension of a qudit is denoted byĀ , which we sometimes take to be qubits, so that . For matrices, is the largest singular value (also known as the operator norm or spectral norm), is the trace norm (also known as the Schatten 1-norm), and is the Frobenius norm. For a vector , denotes the matrix with on the diagonal. We useĀ to denote the identity element ofĀ .
1.1 Preliminaries: Distances for unitary (and other) channels
We are mainly interested in distance measures between unitary channels. However, for comparison with prior work we need to consider distances between general channels. For more information on the distances discussed herein, the reader may consult, e.g.,Ā [GLN05, YF17]. We defer proofs to AppendixĀ A.
A standard way to measure distances between channels operating on a register āā is to choose a distanceĀ on mixed quantum states and then consider
[TABLE]
where is an ancilla register and is a mixed state on both registers. When is trace distance, this leads to the diamond norm:
Definition 1.3**.**
When the state metric in EquationĀ 2 is trace distance, , we obtain the diamond-norm distance, denoted .
The diamond-norm distance is a natural choice for measuring worst-case error. It has direct operational meaning: if a channel sitting in a quantum circuit is replaced by another channel that is -different in diamond-norm, then the output distribution of the algorithm is at most -different from the original in total variation distance. Between unitary channels, the diamond-norm distance is equivalent (up to constants) to other familiar metrics on unitary matrices. We will switch between these metrics throughout the paper depending on which is convenient. One such metric is operator norm distance up to phase, which we use in SectionĀ 2.
Definition 1.4**.**
For unitaries we define
[TABLE]
We must take distance up to phase because a unitary channel only specifies a corresponding unitary up to global phase; for example, and perform identical operations on quantum states. Another equivalent metric is the intrinsic metric on the projective Lie group , which we use in SectionĀ 3.1.
Definition 1.5**.**
We consider the length of a path on the Lie group with respect to the operator norm on the tangent spaceĀ . For unitaries , we define to be the length of a shortest smooth pathĀ between them (). As an equation,
[TABLE]
We further define the corresponding metric on the projective group via this metric.
[TABLE]
An alternative way to define is that it is the unitarily invariant metric satisfying for .666 Taking this definition as a starting point, one may find it involved to prove triangle inequality, which is trivial under the definition by the minimization over paths. We will later see that these two definitions coincide.
That is, is the largest angle an eigenvalue of forms with .
Proposition 1.6** (Equivalence of diamond norm, operator norm, intrinsic Lie metrics).**
For unitaries , the metrics , , and are equivalent up to constants. Specifically, the following inequalities hold.
[TABLE]
Other work on unitary estimation and āStorage-and-Retrievalā problem, defined below, considers an accuracy measure that is not equivalent: the entanglement (in)fidelity. This is the accuracy measure that naturally arises when one estimates a channel by performing state tomography on its Choi state. Another way to recover this norm is to consider the channel norm induced by Bures distance (square-root of infidelity), i.e.Ā taking to be Bures distance in EquationĀ 2, but rather than maximizing over allĀ one simply fixes to be the maximally entangled state.777When is unitary, the entanglement infidelity is the same as the so-called average gate infidelity up to the small constant factor of Ā [HHH99]. Without this restriction of fixing , the resulting distance is an alternative distance studied inĀ [YF17]. It is termed the minimum gate fidelity whenĀ is unitary.
Definition 1.7**.**
The entanglement infidelity between -dimensional channels is defined to be
[TABLE]
where is squared quantum state fidelity and is the Choi stateĀ ofĀ .
The notation is not standard. In case and for , we have a simpler formula
[TABLE]
Remark 1.8**.**
* is not a metric onĀ , but it is related to one:*
[TABLE]
is a metric (which is roughly for smallĀ ). The Frobenius norm on the right-hand side can be thought of as the two-norm distance between the Choi states of and , treated as pure state vectors, not density matrices, minimized over global, unphysical phase factors.
An elementary argument shows that estimating a unitary channel to entanglement infidelity only implies estimating it to error in diamond-norm distance. The factor of reflects that entanglement infidelity is an average-case accuracy measure, whereas diamond norm is a worst-case accuracy measure. For example, the entanglement infidelity between the multiply controlled NOT gate (e.g., the usual CNOT if or Toffoli if ) and the identity is for largeĀ , but the diamond-norm distance between the two isĀ .
Proposition 1.9**.**
For all and any unitaries ,
[TABLE]
The left-hand side is sharp for even and the right-hand side is sharp for allĀ :
[TABLE]
1.2 Unitary estimation and related problems
We are interested in the following task:
Definition 1.10**.**
The estimation task for unitary channels is the following: One is given black-box access to a unitary channel byĀ . After applying the unitary channel someĀ times, the algorithm should output a classical description888Throughout this paper, for ease of exposition we will be treating these descriptions as if they have infinite precision. To formalize our results when numbers can only be specified up to a machine precision , one could discretize the space of unit vectors and unitary matrices with -nets. Then, a classical description of a unitary matrix or a unit vector is a list of numbers defining a matrix or vector that need not exactly satisfy the desired constraints, but is -close to an element of the net. When we need to implement a unitary from its classical description, we implement this element. This incurs an error of throughout the argument, which can be made to be by incurring overhead in classical time complexity, beyond the factors of from incorporating finite precision arithmetic (which we ignore in our discussions of time complexity). This issue only affects constant factors in our proofs in a very minor way, and we trust the diligent reader to take note of the required changes. of a unitaryĀ . The algorithm may be probabilistic and hence should be thought of as a random variable. The goal is to achieve
[TABLE]
Here, is some accuracy measure between unitary channels.
Note here we are only using the error measure between two unitary channels.
Remark 1.11**.**
More generally, one can introduce another parameterĀ and have the goal be that except with probability at mostĀ . As long as is a metric, this can be achieved with applications ofĀ as follows: First, Markovās inequality implies that . Then, one can reduce to a generalĀ at the expense of anĀ factor using the āmedian trickā; see PropositionĀ 2.4.
In order to more carefully quantify the resources needed to solve the unitary estimation task, we generally consider hybrid classical/quantum algorithms. These mix classical computation and measurement-outcome-processing with quantum state preparation, applications ofĀ and other quantum operations, and measurements. Besides the query complexity , there are some additional resources one wishes to minimize, one of which is the following.
Definition 1.12**.**
In the task of estimating , assume that for some number of qubitsĀ . The space overhead of an estimation algorithm is the total number of qubits beyond the minimum,Ā , used in the course of the algorithm.
There are several tasks closely related to unitary estimation in the literature, going under the names of āuniversal programmingā, āunitary cloningā, āreference frame transmissionā, and āunitary learningā. For comparison with previous work, it suffices for us to discuss the problem commonly known as ālearningā a unitary channel. However, since ālearningā and āestimationā are often regarded as synonymous in casual speech, we will followĀ [SBZ19] and use the terminology āStorage-And-Retrievalā, to avoid confusion.
Definition 1.13**.**
The SAR task for unitary channels (Storage-And-Retrieval) involves developing two quantum algorithmsĀ [BCD*+*10]. The āstorageā algorithmĀ is given black-box access to a unitary ; after applying the unitary some Ā times, should output a possibly mixed quantum stateĀ . The āretrievalā algorithmĀ takes as inputĀ and implements a possibly nonunitary qudit channel , which may be random with some distribution. The goal is to achieve the following:
[TABLE]
Note that here is measuring the error between a unitary channel and a possibly nonunitary channel. The storage complexity of the algorithm is the number of qubits used forĀ .
Remark 1.14**.**
The case of minimizingĀ while insisting on is known as Probabilistic SAR and is studied in, e.g.,Ā [SBZ19]. In this case, the channelĀ is necessarily unitary. The output of the storage algorithm inĀ [SBZ19] is always pure. On the other hand, for comparison to prior work, we focus on the case ofĀ , which may be termed Approximate SAR. This latter problem is the one commonly known as ālearningā a unitary.
We can perform this task with a unitary estimation algorithm by simply taking the estimateās classical description and then synthesizing and applying it. We provide some notation formalizing this.
Definition 1.15**.**
If is a -valued classical random variable, we denote by the mixed-unitary channel associated toĀ
[TABLE]
Note that if is a unitary-valued random variable, the notation stands for a unitary-channel-valued random variable; that is, is not the same thing as . We will now compare unitary estimation and SAR. An instructive example to keep in mind is the case where an algorithm takes a black box for the identity channel and outputs the matrix , which is or with probability . One can verify that and , so by the following remark, on this input with diamond-norm distance as the error measure, performs unitary estimation to error and Approximate SAR to error .
Remark 1.16**.**
A unitary estimation algorithmĀ may be turned into an SAR algorithm with the same Ā andĀ parameters: the storage algorithmĀ isĀ , withĀ being the mixed state of orthogonal quantum states each of which is classical and encodes an outputĀ ofĀ ; and the retrieval algorithmĀ simply synthesizes and appliesĀ from its classical description. Here, the channel is unitary.
More subtly, we can also turn into an Approximate SAR algorithm as follows. We let the storage algorithm the same as before, but let the retrieval algorithm implement a mixed unitary with probability one. This reduction is identical to the previous one, with the difference being that we treat the randomness of as part of the channel being applied. The accuracy of this Approximate SAR is given by
[TABLE]
This error measure is different in general from the error measure in the unitary estimation algorithmās promise, .
If is diamond-norm distance, then by the convexity of the norm, so in this setting Approximate SAR reduces to unitary estimation. However, can in fact be made much smaller, to ; see PropositionĀ 1.18. We can think about Approximate SAR as, roughly, the version of unitary estimation where we only need to be right āin expectationā. In particular, though the output of the unitary estimation algorithm is explicitly given, the output of the converted Approximate SAR algorithm is not, so we might not know what it is.
Remark 1.17**.**
(Continued fromĀ RemarkĀ 1.16) If , the entanglement infidelity, the accuracy of a unitary estimation algorithm is equal to that of its converted Approximate SAR algorithm. (Note that is not a metric.) To see this, recall that the squared quantum state fidelity between a pure state and a mixed state is linear inĀ : . Then, with a maximally entangled stateĀ , we have
[TABLE]
where we omitted subsystem indices and identity tensor factors. Hence, .
One can use this fact about entanglement infidelity to show that the variance of a unitary estimation algorithm bounds the error of its converted Approximate SAR algorithm. In some sense, this gives a quadratically better version of RemarkĀ 1.16, since -error unitary estimation can give -error Approximate SAR.
Proposition 1.18** ([L]emma 2).**
*Yang2020]
If a unitary estimation algorithmĀ with an output distributionĀ is unitarily covariant, i.e., for all , then*
[TABLE]
Therefore, by RemarksĀ 1.17 andĀ 1.9,
[TABLE]
1.3 Prior work
As far as we are aware, ours is the first work to study unitary estimation with the more stringent diamond-norm distance as its accuracy measure. We survey the existing literature on related topics, and then compare it to our result.
Standard process tomography.
Standard quantum process tomographyĀ [NC12, Chapter 8.4.2] solves the task of general-channel estimation problem by preparing a basis of quantum states, passing them through the channel, and performing state tomography on the results. One has to take care to analyze how the error bounds for state tomography affect the error bounds for channel estimation; see, e.g.,Ā [LSS*+*20] estimation of minimum gate fidelity for bounds of the form for general-channel estimation with respect to minimum fidelity. For the special case of unitary channels one can get improved bounds, as one only needs to work with Ā pure-state estimation tasks. Naively analyzing this strategy yields an sample upper bound, as an error for each state estimate can compound to a error in diamond-norm distance for the resulting channel. But as we show in SectionĀ 2, with care this method can be used to obtain query complexity for unitary estimation with respect to diamond-norm distance. An advantage of this method is that it uses zero space overhead.
Another well-known approach to quantum process tomography is the ancilla-assisted method dating back to LeungĀ [Leu00]. Here one prepares the maximally entangled state in dimensions, passes the first half through the channel, and uses state tomography. Again, if the channel is unitary one can use pure state tomography, and with this approach it is not hard to deduce that queries suffices to obtain unitary estimation with respect to entanglement infidelityĀ . Besides using space overhead, this approach also only gives an query complexity bound for diamond-norm distanceĀ , via the relations PropositionsĀ 1.6 andĀ 1.9. However, like with standard process tomography, with a tighter analysis, one may be able to give an query complexity bound.
Unitary estimation.
The specific task of unitary estimation was perhaps first studied by AcĆn, JanĆ©, and VidalĀ [AJV01], where representation theory was used to determine an optimal algorithmic strategy with respect to entanglement infidelity under the assumption that the unknownĀ is applied in parallel to one half of a bipartite system. Asymptotic analysis of the error was not given, however. Later, Peres and ScudoĀ [PS02] gave an alternate method establishing that queries suffice to obtain entanglement infidelityĀ in the case . Then Bagan, Baig, and MuƱoz-TapiaĀ [BBMT04a] established the same scaling for the method fromĀ [AJV01]; and, theyĀ [BBMT04b] and Chiribella, DāAriano, Perinotti, and SacchiĀ [CDPS04] did the same for a similar method that didnāt require entangled measurements on both parts of the bipartite system. See also independent work of HayashiĀ [Hay06]. Chiribella, DāAriano, and SacchiĀ [CDS05] again showed a more general optimality result for entanglement infidelity, implying that is a lower bound for this accuracy measure when ; however, the asymptotic dependence forĀ was not established. Later, KahnĀ [Kah07] showed that the optimal scaling for generalĀ is of the form for some functionĀ , but was unable to asymptotically analyze it.
Finally, in 2020, Yang, Renner, and ChiribellaĀ [YRC20] were able to analyze the optimal unitary estimation algorithm for entanglement infidelity and showed that it achieves errorĀ with queries. Moreover, they showed PropositionĀ 1.18, and since their algorithm is unitarily covariant, this implies an Approximate SAR algorithm with with respect to diamond-norm errorĀ . We remark that this optimal unitary estimation algorithm fromĀ [YRC20] applies the unknown unitaryĀ in parallel and hence has space overhead on the order of the query complexityĀ .
Comparison to [YRC20].
Our main result recovers the unitary estimation (with respect to entanglement infidelity) and Approximate SAR (with respect to diamond-norm distance) results of Yang, Renner, and ChiribellaĀ [YRC20]. First, by norm conversion PropositionĀ 1.9, our algorithm with achieves the guarantee , giving the unitary estimation guarantee. Second, since any algorithm can be made unitarily covariant, using the same reduction of PropositionĀ 1.18, we achieve an Approximate SAR algorithm with with respect to diamond-norm error .
Neither of these results from [YRC20] imply our result of unitary estimation to diamond norm error. Through norm conversionĀ PropositionĀ 1.9, a -query unitary estimation algorithm with respect to entanglement infidelity implies a -query algorithm with respect to diamond-norm distance . This conversion is tight, even if the unitary estimation algorithm is unitarily covariant. Also, as discussed in RemarkĀ 1.16, the output of an approximate SAR algorithm is correct in expectation, but any individual output need not be close in diamond norm.
Finally, our algorithms have space overhead of zero, improving over [YRC20] and making our algorithm significantly closer to practical. Note that the main figure of merit in [YRC20], program cost, denotes the size of the output of the storage algorithm, and is different from the space referred to here, which is the space complexity of the storage algorithm (DefinitionĀ 1.13).
Comparison to [vACGN23].
Van Apeldoorn, Cornelissen, Gilyén, and Nannicini shows that, given a unitary implementing an unknown state , one can compute an estimate which is -close in Euclidean norm with probability [vACGN23, Theorem 23]. Their algorithm uses applications of the controlled black box unitary and its inverse , along with qubits. This implies an algorithm for unitary estimation, by using this state tomography algorithm for standard process tomography (see Propositions 2.3 and 2.2) which uses applications of and and space overhead. This algorithm is based on quantum singular value transformation, so the use of and and the nonzero space overhead appear to be inherent limitations of their approach. Similarly, other standard primitives in quantum algorithms like amplitude estimation, which we might ordinarily reach for when aiming for a quadratic improvement in error, have similar limitations in terms of requiring stronger access to or space overhead. Our algorithm is more direct and so does not lose anything in query complexity or space complexity, and only uses queries to , and not or . The gate complexity of both algorithms is similar.
Comparison of techniques to prior work.
Existing algorithms for estimating a parametrized class of gates with Heisenberg scaling, like metrology with a GHZ stateĀ [JWD*+*08] and robust phase estimationĀ [KLY15], proceed by applying the black box many times in parallel or in series (respectively). Our algorithm uses the same principles (see the warmup in Fig.Ā 1), but as discussed in the next section, naive generalizations of these approaches cannot work. We introduce a novel āshift to identityā step to avoid areas in the space of unitary channels where applying the black box in series fails; this adaptivity allows us to extend robust phase estimation, which can estimate a parameter on the complex unit circle, to work for elements in a much trickier space.
The process tomography protocol most similar to the algorithm presented in our work is that of gate set tomographyĀ [NGR*+*21], which can achieve Heisenberg scaling, , in regimes of practical interest by interleaving long sequences of gates. This is qualitatively similar to what our algorithm does, though we are not aware of work giving theoretical bounds on such protocols.
Prior work on lower bounds.
Lower bounds on channel estimation and discrimination are just as well-studied as upper bounds. Some prior work has tried to characterize the optimal strategy for various problems, such as showing they are sequential or parallelĀ [DFY07, BMQ22]. As mentioned previously, though these results describe optimal strategies for these problems, it is unclear if they imply lower bounds better than . Up to log factors, this lower bound is an immediate consequence of parameter counting (e.g.Ā [NC12, Chapter 8.4.2], [BKD14], which one can make rigorous with a Holevo boundĀ [CvDNT99]) and the optimality of Heisenberg scaling. However, these two arguments do not combine naturally to get the bound we would expect.
To our knowledge, we give the first jointly optimal lower bound, in the sense that for any decaying function our lower bound matches the query complexity of our algorithm up to a universal constant. Our lower bound combines prior work on unitary channel discriminationĀ [BMQ22] with a reduction technique from quantum query complexityĀ [CGM*+*09, BCC*+*17]. Though quantum query complexity focuses on diagonal unitaries, we observe that roughly the same technique can be applied to this general unitary estimation setting with some modification.
1.4 Techniques
An algorithm with suboptimal scaling.
A standard idea in process tomography is to estimate by doing state tomography on , , and so on, and then somehow collate these estimates into a full estimate of . Since each state tomography requires samples, this whole procedure can be done with applications of (TheoremĀ 2.1). There are two minor issues to address when formalizing this. First, an error in each state tomography could cascade to a error in the final estimate of . We address this by ensuring that the state tomography algorithm produces Haar-random error, so that with high probability the errors do not compound. Second, we can only estimate each column up to a phase, so we need to additionally deduce the relative phases between columns. We address this by showing that learning the columns up to phase of both and , being the discrete Fourier transform, suffices to deduce these relative phases (PropositionĀ 2.3).
Achieving the optimal Heisenberg scaling.
We wish to improve the dependence of process tomography to . Our strategy will be to use process tomography as a subroutine, but boost the error some other way. Specifically, we give a bootstrapping procedure (AlgorithmĀ 1) that estimates a unitary to error by making calls to a ābaseā algorithm that can only output unitary estimates to error , paying an factor of overhead in the query complexity (TheoremĀ 3.3). Using the unitary estimation algorithm with , as the base, this gives the desired query complexity.
A good first try is to get constant-error estimates for for going from 1 to , with the hope that the estimate of will refine the estimate of to error. Taking powers of an unknown gate with exponentially increasing degree is an existing method in quantum metrology for achieving the Heisenberg limit. In particular, we can view this as a non-coherent version of phase estimationĀ [KLY15], where the application of to an eigenvector of extracts the th bit of its corresponding eigenvalue. This argument can be shown to work in simple cases, such as when is a rotation in two dimensions (Fig.Ā 1), but if all the eigenvalues of are , then any power is eitherĀ orĀ so it is impossible to gain any better information about the eigenvectors ofĀ from its powers.
Nonetheless, we find that if the powers of are always close to the identity for all sufficiently high exponents (and hence no eigenvalue is close to ), then this idea works out. Specifically, we show in LemmaĀ 3.1 that if unitaries and are -close in diamond-norm distance, then and are -close, provided that and are -close to the identity. We can use this lemma to bootstrap a constant-error estimate to an -error one, provided we always apply the base algorithm to matrix powers that are close to the identity. Thus, we always bring the unknown unitary close to the identity: instead of running the base unitary estimation algorithm on , we run it on , where is our best estimate to so far in the algorithm. This recenters the unitary at the identity so that we can power it up even further. We note here that, upon formulating the right technical lemma to use (LemmaĀ 3.1), the analysis of the resulting algorithm (AlgorithmĀ 1) becomes surprisingly simple.
Lower bound.
To achieve our lower bound of , we consider a hard instance of the unitary estimation problem, in which one is asked to identify one of candidate unitary channels that are -apart from one another and are all -close to the identity in the diamond norm. Specifically, we choose the ensemble to be th-power of a net of reflections.
Simple arguments give a lower bound of , with being a generic lower bound for identifying one of candidate unitary channelsĀ [BMQ22] and being the number of applications of a channel necessary to discriminate two channels that are -close in diamond-norm distance. To improve this, we wish to argue that, since a unitaryĀ taken from the hard instance applies an th power of a reflection, the task of distinguishing is a factor of harder than the task of distinguishing the net of reflections. Intuitively, this holds because is essentially the identity for all but an fraction of the time it is applied.
This heuristic picture can be made precise by showing a reduction from āfractional-queryā algorithms to ādiscrete-queryā algorithmsĀ [CGM*+*09, BCC*+*17]. This reduction converts a circuit that calls , the th power of a reflection, into one that calls that reflection conditioned on some ancilla. So, hardness of a problem using queries to the reflection lifts to hardness of the problem using queries to . Translating to our setting, directly applying this reduction shows that a to distinguishing a net of reflections translates to a lower bound for the hard instance. This reduction is tight, though, so removing the term requires additional insight.
We consider again the reduction of [CGM*+*09, BCC*+*17] converting a fractional-query circuit to a discrete-query circuit. This reduction adds one ancilla qubit per oracle call toĀ and the joint state of all the ancillas is a superposition of some bitstrings. The average weight of the bitstring is an effective number of the oracle calls to the reflection, which is smaller than the naive query complexity forĀ by a factor ofĀ . Without much change in the output distribution of the overall algorithm, one can modify the algorithm to monitor the weight of the ancilla bitstring and actually reduce the number of oracle calls by the factor ofĀ . This reduction however comes at a price: the modified algorithm now requires postselection whose success probability is exponentially small in the query number. Addressing this small success probability is where one picks up the sub-logarithmic term. However, for this particular lower bound we need not address it: an exponentially small success probability is still hard to achieve for unitary discrimination. Bavaresco, Murao, and QuintinoĀ [BMQ22] gives a useful relation between the query complexity of a unitary discrimination problem and success probability, and we observe that small success probability is still meaningful as long as the algorithm is appreciably better than merely guessing the answer. This gives the optimal lower bound.
1.5 Discussion
Improving gate complexity.
We give an algorithm for unitary channel estimation that is query-efficient and space-efficient. However, we still need to run quantum circuits with depth, in addition to the oracle calls. If this could be improved to , it would make this algorithm significantly more practical. However, the depth is bottlenecked by the āshifting to identityā step, which requires a unitary synthesis and so is high depth. As discussed in SectionĀ 1.4, this step appears to be necessary to avoid the hard case of unitaries with eigenvalues.
A simpler question is whether the algorithm can be made gate-efficient. In view of this goal, we give a gate-efficient version of pure state tomography with optimal sample complexity in AppendixĀ C; prior gate-efficient algorithms lose log factors in the sample complexity. This should be able to give a query complexity algorithm with the desired gate complexity and optimal space complexity, though we do not prove this.
Estimating the eigenvalues of a unitary channel.
We show in AppendixĀ B that shifting to the identity is not necessary when we merely want to learn the eigenvalues of the unitary without the eigenvectors. In this Appendix, we describe how to use a (gate-efficient) control-free version of phase estimation to achieve this result (though notably, the estimates we achieve do not distinguish between eigenvalues of different multiplicity). This algorithm appears to be folklore. If one can reduce unitary estimation to instances of eigenvalue estimation, then this result would give gate-efficient unitary channel tomography with query complexity .
Generalizing beyond unitary channels.
Finally, we note that our results do not extend to general channels. For example, the channel that destroys the input and outputs the outcome of a biased coin requires queries to estimate. However, we leave open the question of analyzing this algorithmās tolerance to noise, and the related question of whether process tomography is possible for āclose-to-unitaryā channels.
2 Base tomography using applications
In this section we show a base tomography algorithm that has a quadratic dependence on the desired precision. We will use the āoperator norm up to phaseā distance for this section, defined in DefinitionĀ 1.4, which is equivalent to diamond norm up to constants.
Theorem 2.1**.**
There is a tomography algorithm that, given access to an unknown unitary , as well as parameters , applies at most times and outputs an estimateĀ satisfying except with probability at mostĀ . The algorithm uses only a -dimensional Hilbert space (in particular, only Ā qubits when ).
In fact, by standard methods (explicitly shown in PropositionĀ 2.4 below) it suffices to prove this theorem for a fixed confidence value less thanĀ , say .
Our algorithm will essentially learn the unknownĀ column by column, using the below pure state tomography result, PropositionĀ 2.2. This result was essentially previously known, but we will take care of a few minor details in SectionĀ 2.1.
Proposition 2.2**.**
There is a pure state tomography algorithm with the following behavior. Given access to copies of a pure state , it sequentially and nonadaptively makes von Neumann measurements onĀ copies of (using only -dimensional Hilbert space). Then, after classically collating and processing the measurement outcomes, it outputs (a classical description of) an estimate pure state
[TABLE]
such that: (i)Ā is a complex phase; (ii)Ā the infidelityĀ is at most except with probability at mostĀ ; (iii)Ā the vectorĀ is distributed Haar-randomly999Perfect Haar-randomness is only possible by making idealized assumptions about the algorithmās hardware; such technical issues of algorithmic complexity are deferred to FootnoteĀ 8. among all states orthogonal toĀ .
This column-by-column technique has the minor downside that each column estimate can be off by a different complex phaseĀ . However, there are a few simple ways to work around this flaw; in particular, the following result (proven in SectionĀ 2.1 below) gives a completely black-box method:
Proposition 2.3**.**
Let be a tomography algorithm as in TheoremĀ 2.1, except with the following weaker guarantee aboutĀ :
[TABLE]
Then there is a tomography algorithm (usingĀ twice) that achieves TheoremĀ 2.1, withĀ and in place of andĀ , and additional classical time complexity.
Putting all of the above together, we can establish our base tomography result:
Proof of TheoremĀ 2.1.
From the preceding discussion, it suffices to obtain a unitary tomography routine achieving EquationĀ 21 with failure probability at mostĀ . We apply the pure state tomography routine from PropositionĀ 2.2, with to be chosen later, on each of , where . This indeed uses at most times, within a Hilbert space of dimension onlyĀ , and produces estimates . Let denote the (possibly nonunitary) matrix with the ās as columns; we will show that
[TABLE]
except with probability at mostĀ . Then if we output any unitaryĀ satisfying (for example, , where is a singular value decomposition), then will satisfy EquationĀ 21, as desired.
To establish EquationĀ 22, write
[TABLE]
for as in EquationĀ 20. Then with and the matrix with ās as columns, we have
[TABLE]
We have
[TABLE]
except with probability at mostĀ , by a union bound. We will also shortly show:
[TABLE]
(Here is a universal constant.) Combining the above, we conclude
[TABLE]
except with probability at mostĀ , and this norm bound is at mostĀ (as needed for EquationĀ 22) provided we take the constant in small enough.
It remains to prove the claim from EquationĀ 26. Recall thatĀ has independent unit columns , with Haar-random orthogonal toĀ . Introduce i.i.d.Ā real random variables , whereĀ is distributed as for a Haar-random unit vector inĀ . If we further introduce i.i.d.Ā uniformly random complex phases , then the unit vectors
[TABLE]
are in fact Haar-random and independent. LettingĀ denote the matrix with the ās as columns, it is a standard fact in random matrix theory101010For example, ifĀ ās columns were independent Haar-random unit vectors inĀ (as opposed toĀ ) then [Ver18, TheoremsĀ 3.4.6,Ā 4.6.1] would directly yield that except with probability at mostĀ , for some constant . The generalization to the complex case is very minor. that, for some universal constantĀ , we have
[TABLE]
We can now rewrite EquationĀ 28 as
[TABLE]
from which we can conclude
[TABLE]
except with probability at most . Thus to complete the proof of the claim in EquationĀ 26, it suffices to show
[TABLE]
for some constantĀ . Note that for each constant value ofĀ , the random variableĀ has a continuous probability density onĀ ; from this observation, itās easy to deduce that it suffices to prove EquationĀ 32 under the assumption for some constantĀ . But this is easy: Ā has meanĀ and is sub-exponentially distributed with parameterĀ [Ver18, PropositionĀ 2.7.1, LemmaĀ 2.7.6, TheoremĀ 3.4.6]; hence andāwith a union boundāthis is more than sufficient for EquationĀ 32, onceĀ is sufficiently large. ā
2.1 Ancillary results for base tomography
We begin by giving a proof of the pure state tomography result we needed:
Proof of PropositionĀ 2.2.
This result was essentially proven inĀ [CL14, KRT17]. To be precise, we will refer to the analysis fromĀ [GKKT20, TheoremĀ 2]. The result therein is exactly what we need, except for the following distinctions:
- ā¢
Rather than making von Neumann measurements, [GKKT20] refer to performing the āuniform POVMā on each copy ofĀ ; this is the continuous POVM with elements labeled by unit vectors , in which the -element has density with respect to Haar measure onĀ . However, this is mathematically equivalent to first using classical randomness to choose a Haar-randomĀ , and then projectively measuring in the basis ofĀ ās columns.
- ā¢
The classical post-processing algorithms in [KRT17, GKKT20] do not necessarily output a pure (rank-one) hypothesis; they output a possibly mixed stateĀ , counting it as a success (in the case of [GKKT20]) when . This also means they do not explicitly confirm conditionĀ (iii) in PropositionĀ 2.2, concerning the Haar-randomness ofĀ .
But inspection of the actual algorithm in [GKKT20] shows that this second issue is easily fixed. The algorithm first forms , where the ās are the measurement outcomes. With measurements, this matrix is shown to satisfy except with probability at mostĀ . The authors of [GKKT20] then āroundāĀ to a quantum stateĀ by first diagonalizing it as for and , and then taking , where is the nearest probability vector toĀ . With this adjustment, they show that as needed.
Note that this last inequality implies that the closest rank- matrixĀ toĀ must satisfy . On the other hand, it is well known thatĀ is simply given by , whereĀ is formed fromĀ by zeroing out all entries except the largest. It follows that the zeroed-out entries sum to at mostĀ , and hence . If we now form , then is a rank- state with and hence and . Thus we may express and outputĀ (after slightly adjusting the constant onĀ ).
It only remains to observe that the process of formingĀ is completely symmetric with respect to the subspace orthogonal toĀ , and hence the āerror vectorāĀ is indeed distributed Haar-randomly. ā
Next, we give the algorithm for PropositionĀ 2.3, which fixes the column phases for our unitary tomography algorithm:
Proof of PropositionĀ 2.3.
Let denote the discrete Fourier transform, so . (The only property we use aboutĀ is that each entry has the same magnitude; if , one may use the Hadamard transform instead.) GivenĀ , our algorithm applies once to and once toĀ ; call the results , respectively. Except with probability at mostĀ , we get that there exist diagonal unitaries Ā andĀ such that
[TABLE]
Given this, we conclude that is close to , up to phases on rows and columns.
[TABLE]
We can then deduce by computing in time, and then essentially reading off the relative column phases. First, notice that by pigeonhole principle applied to Eq.Ā 34,
[TABLE]
Let denote the inequality in Eq.Ā 35. Since has magnitude , we can define to be , and it follows that
[TABLE]
By Eq.Ā 35, and hold for at least half of ās. Let denote the coordinate-wise (real and imaginary) median of . Then by Eq.Ā 37,
[TABLE]
Let be the matrix with on the diagonal. Then . Combining this with from EquationĀ 33 easily yields
[TABLE]
and so our algorithm may output . ā
Finally, we give the (completely standard) trick for boosting confidence:
Proposition 2.4**.**
Let be a learning algorithm for objects in a metric space with efficiently computable distance . Assume that on inputĀ , the algorithm outputs satisfying except with probability at mostĀ . Then there is an algorithmĀ that takes an additional inputĀ , uses at most times, and guarantees except with probability at mostĀ .
Proof.
For times, runĀ independently, obtaining . By a standard Chernoff bound, except with probability at mostĀ , there are at least āgoodā estimates, where we say is āgoodā if . By the triangle inequality, every good estimate is also ācentralā, where we say estimate is ācentralā if it has the following property: for at least estimates . Let now select and output any central estimateĀ ; the method of brute-force checking central-ness has time complexity times the cost of computing a distance, which is for matrices. Since , the Pigeonhole Principle implies that at least one of the estimates for which is also good. Thus the triangle inequality implies , as desired. ā
3 Bootstrap of precision to applications
3.1 Key lemma: Geometry of unitary groups
The purpose of this subsection is to recall some well-known metric notions on unitary groups and projective unitary groups; see [Sza97] and [KSV02, §8.3.3]. Only Lemma 3.1 will be used in later sections. Readers who are familiar with the intrinsic metric induced by operator norm may quickly proceed to the next subsection.
Consider the operator norm on the Lie algebraĀ of all -by- antihermitian matrices. By demanding left- and right-invariance we obtain a metric on a Lie groupĀ , as defined in DefinitionĀ 1.5. Recalling this definition, the length of a smooth path is given by
[TABLE]
where . The distance between two points of is the infimum of the lengths of all smooth paths connecting the two:
[TABLE]
This definition makes it obvious that is a metric, obeying the triangle inequality. Though is equivalent to the āextrinsicā metric , we will use the intrinsic metric in this section because it leads us to think in terms of the Lie algebra.
For any and any real number , we define an open metric ball of radiusĀ centered atĀ by
[TABLE]
As the notation suggests, due to the right invariance of the metric, the ballĀ is a shift of . We will writeĀ forĀ .
Every unitary quantum channel is defined by a unitary, but specifies the unitary only up to a global phase factor. This motivates us to consider projective unitary groupsĀ where is the center ofĀ consisting of phase factors. The dimension of a matrix in the centerĀ is implicit. The metric induces a metric on the projective unitary group for which we use the same notation. In analogy with , we write
[TABLE]
Strictly speaking, this is not a metric ball of the projective unitary group since is a subset ofĀ , but this will hardly cause any confusion below.
We consider the fractional power of a unitary in a small neighborhood ofĀ for any realĀ . The small neighborhood is actuallyĀ and we define
[TABLE]
for anyĀ withĀ , which we remark is a strict inequality. This is a proper definition because the exponential map is injective in the corresponding neighborhood ofĀ ; see LemmaĀ 3.2 below. We will use the following lemma in our accuracy boosting algorithm in the next subsection.
Lemma 3.1**.**
For any and ,
[TABLE]
The proof of this lemma will use the following.
Lemma 3.2** (Following Eq.Ā 5, Lemma 3, and Lemma 4 of [Sza97]).**
For anyĀ , there isĀ withĀ such thatĀ . IfĀ withĀ , thenĀ . Further, the following hold:
- (a)
For anyĀ such thatĀ , . 2. (b)
For anyĀ , . 3. (c)
For any , . 4. (d)
For any such that , .
We did not optimize the constantĀ inĀ ItemĀ 3.2(d).
Proof ofĀ LemmaĀ 3.1.
We first prove the non-projective version of the statement: consider and . Write and with . Then
[TABLE]
The inequalities follow from ItemĀ 3.2(b), ItemĀ 3.2(c), ItemĀ 3.2(d), and ItemĀ 3.2(b), respectively.
Now, to prove the lemma, without loss of generality let be their representatives in , and let be the global phase minimizing . Then
[TABLE]
So, , so we can use Eq.Ā 46.
[TABLE]
Proof ofĀ LemmaĀ 3.2.
The eigenvalues of a unitary are on the complex unit circle, each of which can be specified uniquely by an angle inĀ . So, the exponential map is injective on the domain where . This proves the first two statements.
ItemĀ 3.2(a): Let be the eigenvalues ofĀ . Without loss of generality, suppose for allĀ . Then the path taking goes fromĀ toĀ and has length . This shows that .
To show that , consider a smooth path joiningĀ andĀ . The first column of a unitary matrix is a -complex-dimensional unit vector, which corresponds to a point in the standard unit sphere . So, the first column of the matrixĀ , denoted , defines a smooth curveĀ . Since for any matrixĀ , we conclude that the length ofĀ is at least the length of the pathĀ under the standard Euclidean metric:
[TABLE]
The pathĀ connects the two points . It is well known that the length ofĀ is at least the (smaller) angle between and in the complex plane, which isĀ .111111 Since the standard sphere is a closed Riemannian manifold, one can appeal to the HopfāRinow theorem to obtain a geodesic realizing the distance between any pair of points, and characterize geodesicsĀ by the geodesic equationĀ , to conclude that a path of minimum length between any pair of points must be on a great circle.
ItemĀ 3.2(b): By unitary invariance of the two metrics, it suffices to prove the statement for . ForĀ , we observe that , implying that It follows byĀ ItemĀ 3.2(a) that .
[TABLE]
ItemĀ 3.2(d): Note that for any , we have
[TABLE]
This is a well-known inequality; see e.g. Eq.Ā (143) in arXiv version ofĀ [CST*+*21]. To prove it, notice that , so therefore,
[TABLE]
Then,
[TABLE]
Rearranging, we complete the proof. ā
3.2 Bootstrap algorithm
Using LemmaĀ 3.1, we can show that we can bootstrap a base tomography algorithm that achieves constant error to an algorithm that gets query complexity. Following SectionĀ 3.1, for this section we use the distance metric between two unitaries of (DefinitionĀ 1.5). By PropositionĀ 1.6, this is equivalent to diamond-norm distance up to universal constants.
Theorem 3.3**.**
Suppose we have an oracle capable of applying an unknown unitary channel . Further suppose we have an algorithm that, given such an oracle, can output a unitary such that with probability . Then, given error parameters , AlgorithmĀ 1 outputs a unitary such that
- (a)
* with probability ;* 2. (b)
and .
AlgorithmĀ 1* has the further property that, if uses queries to , then AlgorithmĀ 1 requires only queries to .*
By plugging in the base tomography algorithm from TheoremĀ 2.1 into this bootstrap algorithm, we obtain our main resultĀ TheoremĀ 1.1. We restate the theorem now.
See 1.1
Proof of TheoremĀ 1.1.
It is clear that our bootstrap algorithm needs as many quantum registers as the base tomography algorithm does. The base tomography algorithm combines pure state tomography outcomes, each of which uses projective measurements in computational basis on a pure state of form where are unitaries known at the moment of the state tomography. SeeĀ TheoremĀ 2.1.
The query complexity and the accuracy guarantees follow fromĀ TheoremĀ 3.3, with TheoremĀ 2.1 as the base algorithm.
The number of one- and two-qubit gates used in the overall algorithm beyond the oracle calls, is determined by the complexity of implementing interspersing unitaries in the preparation of statesĀ . Those unitaries are on a -dimensional qudit, so it can be compiled to accuracyĀ in operator norm by elementary gatesĀ [KSV02]. We only need . The algorithm prepares states of form in total.
The classical time complexity is bounded by those of matrix multiplications, matrix diagonalizations, and finding diagonal phase factors in PropositionĀ 2.3. All of these take time.
The bound on follows from TheoremĀ 3.3, which shows that AlgorithmĀ 1 gives , and PropositionĀ 1.18. Rescaling if necessary, we complete the proof. ā
The bootstrap algorithm (AlgorithmĀ 1) is essentially four lines long: we begin with as an initial estimate for , and with every iteration, we refine this estimate. In iteration , we take the current estimate , and consider the āresidualā : the residual is close to the identity when is close to . We run to get an estimate of . Though this estimate is only good to constant error, by LemmaĀ 3.1, its th root will be -close to . So, we can use this root to form the next estimate , which is -close to . After iterations, this estimate is good enough to output. We have just given nearly a full proof of the first part of TheoremĀ 3.3. We swept two details under the rug: first, we need that is close enough to the identity to satisfy the conditions for LemmaĀ 3.1, which follows from induction; and we need to account for failure. We deal with these details below.
Proof of TheoremĀ 3.3.
Using the trick in PropositionĀ 2.4, we assume that our oracle can boost its success probability to with only overhead to output a unitary thatās correct up to error.
We analyze the output ofĀ AlgorithmĀ 1 for arbitraryĀ , first assuming that AlgorithmĀ 1 is successful for all . Let
[TABLE]
be the error after iteration of the while-loop. We will prove . For iteration [math], we have so by our assumption on ,
[TABLE]
We proceed by induction inĀ . First, notice that the input and output of the call to on the th iteration are in :
[TABLE]
So, we can apply LemmaĀ 3.1 to conclude that
[TABLE]
Therefore, the final accuracy guarantee is
[TABLE]
The query complexity is sum of those at AlgorithmĀ 1 for all : For any given , this call queries the oracle for for times, which can be performed using queries toĀ . Summing over all the iterations, we get the desired total query complexity,
[TABLE]
Notice that is chosen so the query complexity does not incur the factor that comes from a naive union bound. The failure probability of AlgorithmĀ 1 for eachĀ is , so the probability that any of them fail is bounded by , proving ItemĀ 3.3(a):
[TABLE]
Now, we prove ItemĀ 3.3(b). ItemĀ 3.3(a) only implies a bound of , since when the algorithm fails, the output unitary could be as bad as possible, away from . We prove now that AlgorithmĀ 1 fails gracefully: namely, if the algorithm first fails in iteration , then the output only has error . Let be the least index of the iteration where the base tomography algorithm fails, so that , and for . If the algorithm never fails, we take . The argument we described above still applies up to indexĀ , so that
[TABLE]
We can further conclude that the output of the algorithm will be about as good of an estimate to as , since the size of the adjustment in each iteration decays exponentially regardless of failure.121212 In the following computation, we use that , which holds because the eigenvalues of are in for all with eigenvalues in . However, there is a technical issue: AlgorithmĀ 1 cannot be performed if has as an eigenvalue, since in this case, the -th root of this arbitrary output is not defined. One can and should ignore such nongeneric behavior, but strictly speaking our algorithm is not defined for such cases. Formally, one should address this by changing the choice of discretization (FootnoteĀ 8) to ensure that the algorithm never outputs an estimate with as an eigenvalue. This can be done since it only excludes a measure-zero set.
[TABLE]
This suffices to give a bound on the quality of the output in expectation.
[TABLE]
Remark 3.4**.**
Our bootstrap crucially depends on the continuous group structure of the (projective) unitary group since we take fractional roots of a result from base tomography and the the fractional root needs to in the set of candidate outputs. It is not important whether the set of candidate outputs is a unitary group or a projective unitary group. Conversely, our bootstrap works for any closed Lie subgroup of the (projective) unitary group (e.g., an orthogonal group ) and an obvious analog ofĀ TheoremĀ 3.3 is proved verbatim. Note that if a subgroup is not closed, then the intrinsic metric and the extrinsic one are not equivalent and our bootstrap will not work. Also note that some care would be needed in the base tomography; the Fourier or Hadamard transform in the proof ofĀ PropositionĀ 2.3 might not belong to the Lie subgroup in consideration.
4 Lower bounds
The goal of this section is to prove TheoremĀ 1.2. The first subsection will prove a lower bound using existing results on unitary discriminationĀ [BMQ22], which we lift to a lower bound in the following section through a variant of an argument proving the equivalence of the fractional and discrete query modelsĀ [CGM*+*09, BCC*+*17].
4.1 Bound for constant
As a warm up, we first prove a query complexity lower bound for constant . We begin by constructing a hard instance of the estimation problem, which is a discrete set of unitaries in which each pair of unitaries is at least distance apart. For this instance, the task of estimating each unitary to error is equivalent to the task of simply identifying the unknown unitary.
Proposition 4.1**.**
There exists a set with of Hermitian unitaries (i.e., the eigenvalues are ) such that for any .
The constants 64 and are not sharp. A construction of some packing net of unitaries goes back at least toĀ [Sza83]. Here we construct a special net of unitaries that share the same real eigenspectrum.
Proof.
Considering -by- Pauli matrices embedded into a higher dimensional unitary group, we see that the claim is true for . Let if is even, or if is odd.
We first show the existence of a set of far apart unitaries in operator norm. LemmaĀ 8 ofĀ [HHJ*+*17] shows that there are density matrices of rankĀ in dimensionĀ where exactly half the eigenvalues are and the other half are zero, and any two different density matrices in this set are at least -apart in trace distance. We may write this set as with where is a Hermitian unitary of trace zero. If , we have
[TABLE]
Therefore, the set consists of Hermitian unitaries that are far apart in operator norm. We now need to define unitaries that are far apart in the norm used in the statement of the proposition.
Now, embed into as , where or . We claim that is a desired set. Consider and for . Then, for all ,
[TABLE]
Minimizing over all and using Eq.Ā 7, we get that . Finally, for . ā
The following result upper bounds the average success probability of any unitary identification algorithm that makes at most uses of the unknown unitary:
Proposition 4.2** (TheoremĀ 5 ofĀ [BMQ22]).**
Let by any unitary channels. For any unitary estimation algorithm that queries an input unitary channel times and outputs an indexĀ given with probability where , it holds that
[TABLE]
Ref.Ā [BMQ22] contains a proof that relies on results from several references therein. For the readersā convenience, we present an overview of the proof inĀ AppendixĀ D.
From this, it straightforwardly follows that if we use the set we just constructed, if is much smaller than , then we cannot solve the identification problem with high probability. But we can also prove something stronger:
Lemma 4.3**.**
If where , and for some constants , thenĀ for some constant that depends only on .
In combination withĀ PropositionĀ 4.2, this immediately implies a query lower bound for any unitary estimation algorithm that estimates to small constant distance with constant success probability, since one can use it to distinguish between the unitaries in the net from PropositionĀ 4.1 with constant success probability. We do not use the full power of this lemma to draw this conclusion, but we will need it in its full form in the next subsection.
Proof.
Suppose for a contradiction for some . It is easy to show from Stirlingās approximation that for all where is the binary entropy function. Then, . Taking logs and dividing byĀ , we get
[TABLE]
For , the left-hand side is at least . This completes the proof. ā
4.2 Bound for general
Weāre now ready to prove the stronger -dependent lower bound. Our high-level strategy is as follows. First, we construct a special set of unitaries that are pairwise apart using our constructed set . This constructed set brings all the unitaries in closer to the identity (and hence each other) by raising each unitary to a small power , which will be of the order of . Then assuming there is an that uses queries to solve the unitary estimation problem on this set, there exists another quantum algorithm for identifying unitaries from the set with query complexity, but there is a catch: The query complexity stated is in the fractional query model, a model introduced by Cleve, Gottesman, Mosca, Somma, and Yonge-MalloĀ [CGM*+*09]. (In this model, it is cheaper to query a small power of a unitary compared to querying the unitary itself.) But we want to show our lower bound in the standard model assumed in TheoremĀ 1.2. So we use a modified proof of the equivalence of the fractional and discrete query models from Berry, Childs, Cleve, Kothari, and SommaĀ [BCC*+*17] to give an algorithm using queries in the usual sense, except with failure probability instead of . This extremely low success probability is not a problem, since PropositionĀ 4.2 and LemmaĀ 4.3 yield strong lower bounds even with small success probability. Thus any such algorithm, which used queries, must use queries, which gives as desired.
We start by defining some notation for the fractional power of a reflection. Following convention fromĀ [BCC*+*17], this is the power in the negative angle direction: with this definition, .
Definition 4.4**.**
For a reflection (i.e., ) and , we define
[TABLE]
We can now state our main conversion lemma, which takes an algorithm that queries a fractional power of and converts it to an algorithm that only queries and makes fewer queries, at the cost of having low success probability.
Lemma 4.5**.**
Let be implemented by a quantum circuit that uses queries to or , with for a reflection and .
Then there is a circuit with ancilla qubits, which uses queries to and such that, for all ,
[TABLE]
A similar conversion was also stated in [BCC*+*17, LemmaĀ 3.8], but their lemma converts the circuit that uses oracle calls to and to a circuit that uses oracle calls to and incurs -norm errorĀ in the output state. Here, is the fractional query complexity of the circuit, since in that model each application of and incurs cost. So their lemma is similar to ours, but incomparable: It can achieve much smaller error, but the query complexity has a log factor which would weaken our lower bound if used directly.
To eliminate this log factor, we will modify their argument and approximate the complete circuit instead of each individual segment. This gives a circuit with calls to , but with the significantly worse success probability of . This exponentially decaying success probability is generally undesirable, but will turn out not to affect the lower bound argument, which merely needs that the success probability of the algorithm is not significantly better than guessing, which succeeds with probability .
Proof.
We closely follow the fractional query to discrete query reduction in [BCC*+*17]. We begin by writing a circuit to implement or from an application of . Consider the following circuit.
[TABLE]
Let us calculate the evolution of the state vector along this circuit:
[TABLE]
Using the circuit Eq.Ā 76 with either or , we can apply or , respectively, with one use of and postselection of the ancilla onĀ . This reproves [BCC*+*17, LemmaĀ 3.3] in the slightly more general setting where we may implement and the reflectionĀ is arbitrary, not necessarily diagonal.
Let denote the circuit augmented so that every instance of and is replaced with the circuit inĀ Eq.Ā 76 (where we suppose ) as shown in Fig.Ā 2. By composing Eq.Ā 77, we get that
[TABLE]
where , and the amplitude on the desired state satisfies
[TABLE]
Now it seems like we have made no progress, since the augmented algorithm naively uses queries to , which is the same number of queries used by the original algorithm. We would like to reduce this query complexity to . If , the is what we want. So, assume .
The observation that lets us drastically reduce the query complexity of this algorithm is that the initial state of the ancilla has very low weight on -bit strings with a large number of ās. In other words, it is mostly supported on low Hamming weight strings. Observe that the state is
[TABLE]
Here, we think of as being small, making . Indeed,
[TABLE]
If were replaced by an approximate state that is a superposition over only the bit strings of Hamming weight , if is large enough (roughly ), then the approximate state will be very close to the original state, and hence will not affect the output too much.
Then notice what the resulting circuit looks like: It has c gates, but the control register never holds a string of Hamming weight larger than . In other words, in no branch of the superposition do we ever apply more than times, although the circuit formally has copies of c. So morally, this circuit should only require queries to , not queries.
The argument for this is given more explicitly in [BCC*+*17, Page 28], but briefly, a bit string in the superposition can be thought of as encoding a partition of the circuit, dictating how long the circuit should be run before a query from is responded with instead of ; in this way, one can run the circuit with oracle queries, by conditioning the intermediate unitaries on the ancilla bit string.
Let denote the subnormalized state of obtained by discarding all computational basis components of Hamming weight . We consider a cutoff , where is the āmeanā bitstring weight, and is a parameter to be chosen. Then
[TABLE]
where we used a Chernoff bound since the error can be written as the probability that a binomial random variable takes a large value. With this, we can compute the error incurred by replacing with a normalized state .
[TABLE]
We take , so , and
[TABLE]
So, let denote the circuit of , modified so that upon being given , the circuit prepares the truncated instead of . This will be our final choice of circuit , so our goal now is to establish the claimed properties of the resulting circuit
[TABLE]
where is some state satisfying .
There is some phase ambiguity in definitions, since could have any phase as long as compensates for it so that the product of the phases is correct. So we will choose to have the same phase as , so that . If the ancilla state changes by in distance, since the rest of the circuit is unitary, this changes the output state of by error. Using this, we can conclude
[TABLE]
A consequence of this is that . Similarly, we can show
[TABLE]
This gives the desired properties. The number of queries made is . ā
We are now ready to prove the final lower bound, which we restate for the readerās convenience:
See 1.2
Proof.
Consider a quantum circuit with the assumed properties and that uses queries to the oracle, where is a function of and . Consider the net of reflections as described in PropositionĀ 4.1. We let . If we run a process tomography algorithm with the properties assumed in the statement on , then is promised to output an estimateĀ such that with probability using queries of and . Since , these and queries can be thought of as fractional queries to . We first show that the assumed output guarantee, run on an element of the net, suffices to determine which element is, upon choosing ; then, we show how to perform this algorithm, only using queries to , the doubly controlledĀ .
For the first step, suppose we have access to an unknown , and wish to identify which element it is. We can do this by applying to for (where we assume that so that ). On a successful run, which occurs with probability , the output satisfies . In this case, we can identify which element in the net it is by computing , and picking the element of the net it is closest to. The correctness guarantee implies that and are close in diamond norm:
[TABLE]
where above we used the triangle inequality and unitary invariance of the diamond norm. By the property of the net , only one element is ever within of any particular , so the closest net element to it must be the correct element .
Now, we show that can still be run given appropriate access to only. Applying LemmaĀ 4.5 on with queries to , we get a circuit that uses queries to . Further, if we run , the ancilla qubits will be measured inĀ with probability , and the resulting pure state will have -norm error from the true output ofĀ . We will perform , measure all of the ancilla in the computational basis, and if it isĀ (postselection), we output the output ofĀ (which we assume to be the measurement of the rest of the qubits in the computational basis); otherwise, we declare failure. Since produces a correct estimate with probability , and since closeness (in norm) of states implies closeness (in total variation distance) between the probability distributions resulting from measuring in the computational basis, Ā [BV97, LemmaĀ 3.6], we have
[TABLE]
To summarize, we have an algorithm that, with queries to , can output a classical description of such that with probability .
If we are promised that with the net from PropositionĀ 4.1, then choosing , the output of this algorithm can distinguish an element in the net with probability . It can also distinguish elements of the net , since identifying is equivalent to identifying . ByĀ PropositionĀ 4.2 andĀ LemmaĀ 4.3, any algorithm that can distinguish between all the elements in with probability , where are constants, must use queries. From this we can conclude the bound:
[TABLE]
Appendix A Deferred proofs for SectionĀ 1.1
See 1.6
Proof.
We can get Eq.Ā 6 from minimizing the terms in the inequality ItemĀ 3.2(b) over a global phase. So, it suffices to prove Eq.Ā 7, which relates diamond-norm distance to the other distances.
Let denote the eigenvalues of , and let denote their spread, i.e.Ā the length of the shortest arc (of the complex unit circle) containing the spectrum of :
[TABLE]
Then, examining the geometry, we can rewrite all of the distances in terms of this spread .131313 This reflects that all these norms are worst-case norms, in that they in some sense maximize error over all possible directions.
For diamond-norm distance, we get a simple formula using special properties for when the two channels are isometries.
[TABLE]
To get Eq.Ā 93, if the spread is , then the convex hull must contain the origin, and so the minimum is [math]; otherwise, the minimum is achieved at the midpoint of the extremal eigenvalues, according to the notation in Eq.Ā 92.
We can also see
[TABLE]
Using the following basic inequalities,
[TABLE]
we can conclude from Eqs.Ā 94 andĀ 95 that Eq.Ā 7 holds:
[TABLE]
See 1.9
Proof.
For , there is only one unitary channel (the identity channel), so all distances are zero and the equality holds trivially. So, suppose and, as in the proof above, let denote the eigenvalues of . Let be the length of a shortest arc on the unit complex circle that covers all the eigenvalues. By rotating if necessary, we may set and . Recall the following expressions.
[TABLE]
Since is a point in the convex hull, the first two lines here give the left-hand side of the proposition. For the right-hand side, if , then
[TABLE]
If , then and there must be some eigenvalue, say, where is in the interval , since otherwise there would be a shorter arc containing all the eigenvalues. It is easy to see by drawing a parallelogram on the complex plane that the magnitude of for varyingĀ assumes the largest valueĀ when is either end point of the intervalĀ . So,
[TABLE]
Therefore, if we set in all cases, we have
[TABLE]
where the second inequality in the second line uses and .
The tightness of the left-hand side is seen by setting where is even. The tightness of the right-hand side is seen by setting , with smallĀ and for all otherĀ . ā
See 1.18
The inequality follows from [YRC20, Lem.Ā 2], but we give another short proof (with worse constants) using the idea of āMixing Unitaryā Lemma ofĀ [Has16, Cam17].
Proof of Eq.Ā 19.
Write where . Since the distribution ofĀ is conjugation invariant,
[TABLE]
Then, for any stateĀ that may be entangled with some ancilla, we see, using nested commutators for and ,
[TABLE]
The result follows using norm equivalence (PropositionĀ 1.6). ā
Appendix B Estimating eigenvalues
In this section we describe an algorithm that, given access to an unknown unitaryĀ , outputs an approximation to the eigenvalues ofĀ that is close in ā-distance up to a global phaseā. Let us make this precise:
Definition B.1**.**
Let be two finite sets of āeigenphasesā. We write
[TABLE]
where denotes Hausdorff distance inĀ . In other words, iff, for some , the sets and have the following property (when all numbers are taken modĀ ): For every there is with , and vice versa (interchanging the roles of Ā andĀ ).
Our goal is to prove the following theorem:
Theorem B.2**.**
There is an eigenvalue estimation algorithm that, given and access to an unknown unitary , applies at most times and outputs the classical description of a setĀ such that except with probability at mostĀ , where is the set of ās eigenphases. (That is, .) Moreover, the quantum space requirement for the algorithm is only Ā qudits plus -qubit, and the gate complexity beyond the uses ofĀ is only .
The overall algorithm is similar to Phase Estimation. We begin as follows:
Proposition B.3**.**
There is an algorithm that, given , access to an unknown , and two eigenvectors , with eigenphases (respectively), has the following behavior: The algorithm uses one additional qubit, applies at most times, and outputs an estimateĀ that is withinĀ of , except with probability at mostĀ . In addition, at the end of the algorithm still holds , unentangled with its results.
Proof.
The algorithm strongly resembles Quantum Phase Estimation. Given in āregisters andĀ ā, suppose we:
- ā¢
Adjoin a qubit in a new āregisterĀ [math]ā.
- ā¢
Performed controlled-SWAP on registers , controlled on registerĀ [math].
- ā¢
Apply to registerĀ .
- ā¢
Perform controlled-SWAP again.
- ā¢
Detach the qubit in registerĀ [math].
It is easy to calculate that this procedure leaves registers Ā andĀ in the state , with the detached qubit in state . Up to a global phase, this is . We can then repeat this procedure but with , , , etc.Ā in place ofĀ , yielding qubits for all , at a cost of uses ofĀ . This is precisely the scenario arising within textbook Quantum Phase EstimationĀ [CEMM98], prior to its QFT. If we were to finish with QFT, we would essentially complete the proof, except that it would use qubits to achieve errorĀ with probabilityĀ . Instead, if we use Iterative Quantum Phase EstimationĀ [DJSW07], we only need one additional qubit to get the same guarantee. Finally, repeating the whole algorithm times and taking the median result completes the proof. ā
Proposition B.4**.**
There is an algorithm that, given , access to an unknown , and an eigenvector , has the following behavior: The algorithm uses one additional qudit and one additional qubit, applies at most times, and outputs the classical description of a setĀ such that except with probability at mostĀ , where is the set of ās eigenphases.
Proof.
Write for the eigenphase ofĀ on . The algorithm repeatedly does the following: Adjoin toĀ a second qudit in the maximally mixed state. Then use the routine from PropositionĀ B.3, with for some large constantĀ . The result is that except with probability at mostĀ , the routine uses applications ofĀ , and ends up holding the following: together with an -accurate estimate of , where is a uniformly random eigenvector ofĀ and is the associated eigenvalue. At this point, the difference may be recorded, and the register containing discarded.
By repeating this procedure times, the Coupon Collector analysis ensures that the algorithm will record -accurate values of for the at mostĀ distinct valuesĀ . (Except with probability at mostĀ , having chosenĀ appropriately.) It follows that the collection of recorded values satisfies (independent of whatĀ is), as required. ā
Remark B.5**.**
If then the log factors in the preceding analysis may be slightly improved. In this case, since the number of possible values for is only , one can use analysis of Non-Uniform Coupon Collecting to show that only repetitions are required. We omit further details.
Finally, TheoremĀ B.2 now follows by applying the algorithm from PropositionĀ B.4 with the maximally mixed state in place ofĀ .
Appendix C Gate-efficient pure state tomography with optimal sample complexity
Although it doesnāt seem to significantly improve the gate complexity of our algorithm, we record the following result:
Theorem C.1**.**
There is a tomography algorithm for -dimensional pure states, , with the following behavior. Given and copies of an unknown -qubit pure state , the algorithm sequentially measures the copies using von Neumann measurements and collects the classical results. The measurements are nonadaptively chosen using classical randomness, and each is implemented with gate complexity on the minimal number of qubits,Ā . Finally, after the classical results are processed in time, an estimateĀ is output, and except with probability at mostĀ it satisfies .
We were unable to find this theorem in the literature, although it follows from relatively standard ideas. It would seem that all previous works on gate-efficient pure state tomography (e.g.,Ā [GLF*+*10, CL14, KRT17, GKKT20]) have sample complexity no better than .
Proof.
Recall the standard pure state tomography algorithm from PropositionĀ 2.2 and its analysis from SectionĀ 2.1. Slightly more strongly we use that, with copies, it can be made to satisfy ([GKKT20, TheoremĀ 5])
[TABLE]
simultaneously for all constants , from which
[TABLE]
easily follows (for some constantĀ ). Hence for any positive even integerĀ ,
[TABLE]
Recall that
[TABLE]
where are independent Haar-random unitaries and denotes the column index of the th measurement outcome. For a given , let us define a ā-entryā to be an expression of the form , where are standard basis vectors.
Now
[TABLE]
which in turn is a sum of āmonomialsā, each of the form
[TABLE]
for a constant with and nonnegative integersĀ summing to at mostĀ . Note that the expected value of a monomial as in EquationĀ 109 is equal to
[TABLE]
since are independent.
Let us now introduce independent but pseudorandom unitaries . We writeĀ and and repeat the above development. Now suppose the following holds:
[TABLE]
Thenāusing that any -entry or -entry has magnitude at mostĀ āitās not hard to conclude that
[TABLE]
Thus for some the difference is at mostĀ and from EquationĀ 107 we get
[TABLE]
But when we have
[TABLE]
provided .
In summary, provided:
- ā¢
achieve EquationĀ 111 for and ,
- ā¢
each can be implemented with gate complexity ,
we get that except with probability at mostĀ , from which the result follows as in SectionĀ 2.1.
Let us now consider a generic product of -entries. Dropping the Ā subscript for notational simplicity, it looks like
[TABLE]
Since , the expectation of the above quantity is
[TABLE]
Since is a unit vector, it follows that the above expectation changes by no more thanĀ in magnitude if is replaced byĀ distributed as a quantum -tensor-product-expander (see DefinitionĀ 1 and equations (4)ā(6) ofĀ [BaHH16]). By the work ofĀ [BaHH16] (see also the latest strengthening fromĀ [Haf22]), we know that a -TPE can be generated by certain simple probability distribution on -qubit unitary circuits with gates (provided ). With and , this is gates, as needed to complete the proof of the theorem. ā
Appendix D Compressed proof ofĀ PropositionĀ 4.2
Here we give a summary of the proof of [BMQ22, Thm.Ā 5]. The content in this appendix is an excerpt of results inĀ [CDP08, CE16, HHHH10, BMQ22]. Our exposition will omit much of general discussion in those references, but present necessary pieces.
StepĀ 1: Quantum Testers.
It is well known that a quantum channelĀ (unitary or not) corresponds to a Choi operatorĀ , which is nothing but a density operator obtained by applyingĀ to an unnormalized pure density matrix that is maximally entangled with an ancilla of dimension equal to that of the input ofĀ . The Choi operator of the composition of quantum channels has a formula in terms of the Choi operators of the component channels, using so-called link product denoted byĀ Ā [CDP08]. Although the link product involves partial transposes (that generally do not preserve positivity) whenever an output of a channel is plugged in to another channel, the result of a link product is always a positive semidefinite (PSD) operator.
Let us call a sequence of quantum channels a quantum network, which is not necessarily a unitary. It has been observedĀ [CDP08] that even if some part of a quantum network is unspecified, one can associate a Choi operator to a quantum network by introducing a Hilbert space for each of input and output of an unspecified channel, giving a blank slot that will be filled by a quantum channel. Given a Choi operatorĀ for a quantum network with slots unfilled and with an outcomeĀ postselected, if we have Choi operatorsĀ of quantum channels to fill the slots, we can express the probability of obtainingĀ as
[TABLE]
where and is the link productĀ [CE16, §2.4]. This encompasses the scenario where the channels are not parallel but causally ordered since can include e.g. identity channels in between the slots. Here, is the full, rather than a partial, transpose ofĀ . Normally, a link product of two Choi operators takes partial transpose on the input Hilbert space of the causally succeeding factor, but since each channelĀ takes input from an open ālegā ofĀ and some other leg ofĀ takes input from the output ofĀ , we equivalently take full transpose ofĀ while keepingĀ intact. We may think of this as a consequence of the fact that is a one-dimensional operator, a number. Note that is a Choi operator of a quantum channel; in particular, if is that of a unitary channel, so is . The collection is called a quantum testerĀ [CDP09].
StepĀ 2: Semidefinite programming.
Now we consider the problem in the statement ofĀ PropositionĀ 4.2. There are candidate channels, labeled by . We are given access to uses of one of the candidates. Letting a quantum network to outputĀ , we have a tester that has blank slots to which we plug in identical channels, chosen uniformly at random from the candidates. Given uses of a candidate channel labeled byĀ , the probability of outputtingĀ is byĀ Eq.Ā 117 where is the corresponding Choi operator. The average probability of outputting a correct label is given by
[TABLE]
Now, each operatorĀ is a link product of some PSD operators, and hence is PSD. In addition, since probabilities must add up toĀ , we have for eachĀ .
Actually, the blank slots of a tester may be filled with arbitrary channels and must still produce a probability distribution. In particular, we must have
[TABLE]
for every that is the Choi operator of a unitary channel. Here we do not need to take transpose because the transpose of the Choi operatorĀ of a unitary channelĀ is still the Choi operatorĀ of the complex conjugate unitary channelĀ . This condition consists of infinitely many equations; however, one observesĀ [CE16] that this condition is equivalent to demanding for any affine combinationĀ in the affine span
[TABLE]
This seemingly more complicated condition simplifies the situation because any affine space in a finite dimensional vector space has a finite basis. So, the condition transcribes to a finite number of equationsĀ , one for each affine basis elementĀ ofĀ .
A set of PSD operators with the affine constraint may go beyond what a physical quantum network gives and may not qualify as a tester; we do not claim that physically realizable testers are fully characterized byĀ . Nonetheless, we consider a semidefinite programĀ [CE16] specified byĀ :
[TABLE]
Since any physically realizable tester is a feasible solution, the optimal value of this program is an upper bound onĀ . Suppose that is a feasible solution. Observe that
[TABLE]
because, if with , then
[TABLE]
It is an instance of the weak duality of semidefinite programming. The larger the classĀ is, the stronger our bound will be.
StepĀ 3: Some representation theory.
By construction, contains
[TABLE]
where is the Haar measure onĀ and is the Choi operator of the unitary channelĀ . We claimĀ [BMQ22] that for any ,
[TABLE]
which is in the form ofĀ Eq.Ā 122 and will therefore complete the proof ofĀ PropositionĀ 4.2. Now, Eq.Ā 125 is equivalent to sayingĀ [HHHH10] that there is with which for any vector where is a pure unnormalized state corresponding toĀ and is a unitary representation of a compact Lie groupĀ . It is standard to work in a basis where is block-diagonal, so where ranges over all inequivalent irreps occurring inĀ . The natural numberĀ is the multiplicity ofĀ withinĀ . Correspondingly we have and where each orthogonal summand, or , can be viewed as an entangled state across the irrep and the āmultiplicity space.ā LetĀ and . We may write where each is normalized. By CauchyāSchwarz,
[TABLE]
On the other hand, The average of conjugation by an irrepĀ is to trace it out and replace it by the identity, scaled to preserve the trace (Schurās lemma). Hence,
[TABLE]
Therefore, we have proved Ā [HHHH10].
It remains to show that . Because by definition, we aim to show
[TABLE]
As noted inĀ [BMQ22], this is a formula appearing in Schurās thesisĀ [Sch01, Eq.Ā (57)]. We note the following pointers to modern textbooks (e.g.Ā [FH04, App.Ā A]) to prove this identity. Obviously, an irrep appears inĀ if and only if it does inĀ . It is well known that the irreps ofĀ in the -fold tensor representation correspond to Young diagramsĀ withĀ Ā boxes and at mostĀ Ā rows (English notation). The characterĀ is given by the Schur polynomialĀ of degreeĀ evaluated at the eigenvaluesĀ ofĀ . An identity that is useful for us is the Cauchy identity
[TABLE]
where ranges over all Young diagrams with at mostĀ Ā nonzero rows but unlimited number of boxes. Here all the inverse polynomials should be understood as a formal power series. The dimensionĀ isĀ , so if we take the degreeĀ part of the right-hand side ofĀ Eq.Ā 129 and evaluate it at for allĀ , then we obtainĀ . This is equivalent to reading off the coefficient ofĀ in the series expansion ofĀ . Applying , the identityĀ Eq.Ā 128 follows.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[AJV 01] A AcĆn, E JanĆ©, and G Vidal. Optimal estimation of quantum dynamics. Physical Review A , 64(5):050302, 2001. doi:10.1103/Phys Rev A.64.050302 . Ā· doiĀ ā
- 2[Ba HH 16] Fernando BrandĆ£o, Aram Harrow, and MichaÅ Horodecki. Local random quantum circuits are approximate polynomial-designs. Communications in Mathematical Physics , 346(2):397ā434, 2016. doi:10.1007/s 00220-016-2706-8 . Ā· doiĀ ā
- 3[BBMT 04a] Emilio Bagan, Mariano Baig, and Ramon MuƱoz-Tapia. Entanglement-assisted alignment of reference frames using a dense covariant coding. Physical Review A , 69(5):050303, 2004. doi:10.1103/Phys Rev A.69.050303 . Ā· doiĀ ā
- 4[BBMT 04b] Emilio Bagan, Mariano Baig, and Ramon MuƱoz-Tapia. Quantum reverse engineering and reference-frame alignment without nonlocal correlations. Physical Review A , 70(3):030301, 2004. doi:10.1103/Phys Rev A.70.030301 . Ā· doiĀ ā
- 5[BCC + 17] Dominic W. Berry, Andrew M. Childs, Richard Cleve, Robin Kothari, and Rolando D. Somma. Exponential improvement in precision for simulating sparse hamiltonians. Forum of Mathematics, Sigma , 5:e 8, 2017. doi:10.1017/fms.2017.2 . Ā· doiĀ ā
- 6[BCD + 10] A. Bisio, G. Chiribella, G. M. DāAriano, S. Facchini, and P. Perinotti. Optimal quantum learning of a unitary transformation. Physical Review A , 81:032324, 2010. ar Xiv:0903.0543 , doi:10.1103/Phys Rev A.81.032324 . Ā· doiĀ ā
- 7[BKD 14] Charles H. Baldwin, Amir Kalev, and Ivan H. Deutsch. Quantum process tomography of unitary and near-unitary maps. Physical Review A , 90:012110, Jul 2014. doi:10.1103/Phys Rev A.90.012110 . Ā· doiĀ ā
- 8[BMQ 22] Jessica Bavaresco, Mio Murao, and Marco TĆŗlio Quintino. Unitary channel discrimination beyond group structures: Advantages of sequential and indefinite-causal-order strategies. Journal of Mathematical Physics , 63:042203, 2022. ar Xiv:2105.13369 , doi:10.1063/5.0075919 . Ā· doiĀ ā
