Intrinsic and Measured Information in Separable Quantum Processes
David Gier, James P. Crutchfield

TL;DR
This paper explores how quantum processes can be measured and how classical information emerges from quantum sources.
Contribution
The paper introduces quantum information properties that bound classical information outcomes and synchronization methods for hidden Markov sources.
Findings
Quantum information properties of separable qudit sequences bound classical information outcomes.
Adaptive measurements enable synchronization to hidden Markov sources.
Tomographic reconstruction approximates quantum sources with classical models.
Abstract
Stationary quantum information sources emit sequences of correlated qudits—that is, structured quantum stochastic processes. If an observer performs identical measurements on a qudit sequence, the outcomes are a realization of a classical stochastic process. We introduce quantum-information-theoretic properties for separable qudit sequences that serve as bounds on the classical information properties of subsequent measured processes. For sources driven by hidden Markov dynamics, we describe how an observer can temporarily or permanently synchronize to the source’s internal state using specific positive operator-valued measures or adaptive measurement protocols. We introduce a method for approximating an information source with an independent and identically distributed, Markov, or larger memory model through tomographic reconstruction. We identify broad classes of separable processes…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8
Figure 9
Figure 10
Figure 11
Figure 12
Figure 13
Figure 14
Figure 15
Figure 16
Figure 17
Figure 18
Figure 19
Figure 20
Figure 21
Figure 22
Figure 23
Figure 24
Figure 25|
|
|
|
|
| |
|---|---|---|---|---|---|
| ( | ( | ( | ( | ||
| I.I.D. Qubit Process |
|
| 0 | 0 | 0 |
| Period-3 Process ( | 0 | 1 |
|
| 3 |
| Period-3 Process ( | 0 | 1 |
|
|
|
| Quantum Golden Mean ( |
|
|
|
| 1 |
| Quantum Golden Mean ( |
| 0.5505 |
|
|
|
| 3-Symbol Quantum Golden Mean | 0.6667 | 0.3333 |
|
|
|
| Unifilar Qubit Source ( | 0.9184 | 0.0816 |
|
|
|
| Nonunifilar Qubit Source ( | 0.7306 | 0.2614 |
|
|
|
| Unifilar Qutrit Source | 0.8002 | 0.7848 |
|
|
|
- —Foundational Questions Institute and Fetzer Franklin Fund (a donor-advised fund of the Silicon Valley Community Foundation)
- —U.S. Army Research Laboratory and the U.S. Army Research Office
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsQuantum Mechanics and Applications · Quantum Information and Cryptography · Statistical Mechanics and Entropy
1. Introduction
Determining a quantum system’s state requires grappling with multiple sources of uncertainty, including several that do not arise in classical physics. Irreducible limits on measurement, in particular, have been a hallmark of quantum physics since Heisenberg introduced the position–momentum uncertainty principle in 1927 [1]. Similar incompatible measurements exist for generic pure quantum states [2].
For a 2-level quantum system or qubit, it is impossible to simultaneously measure the value of a spin in the x, y, and z directions. (Stated mathematically, the Pauli matrices , , and do not commute). Additionally, a single measurement in each basis is insufficient. One must measure many copies in each basis to specify the distribution of outcomes. As a result, determining an unknown qubit state through quantum state tomography requires measuring a large ensemble of identical copies with a set of mutually unbiased bases [3] or a single informationally complete positive operator-valued measure (POVM) [4].
These sources of uncertainty are familiar in quantum physics. Contrast them with when an observer receives a sequence of correlated qubits. Measuring them one by one, what will they see? And, what then can they infer about the resources necessary to generate these qubit strings? The following answers these questions by teasing apart the sources of apparent randomness and correlation in measured quantum processes.
1.1. Quantum and Classical Randomness
Also in 1927, von Neumann formulated quantum mechanics in terms of statistical ensembles and quantified the entropy of these mixed quantum states. In doing so, he extended Gibbs’ work on statistical ensembles and classical thermodynamic entropies to the quantum domain [5]. A mixed quantum state has an entropy , which is now known as the von Neumann entropy. ( is the trace operator). if and only if is a pure (nonmixed) quantum state. On the one hand, the von Neumann entropy is key to understanding quantum systems, particularly those with entangled subsystems that exhibit nonclassical correlations. On the other, the uncertainty quantifies a generic feature of statistical ensembles. It does not correspond to any particular quantum mechanical effect.
These two forms of uncertainty—due to ensembles and to quantum indeterminacy—are combined within the framework of quantum information theory, which generalizes classical information theory to quantum observables [6]. One notable example is noiseless coding. Shannon quantified the information content produced by a noiseless classical independent and identically distributed (i.i.d.) information source—one that emits a state drawn from the same distribution at each timestep [7]. Schumacher’s quantum noiseless coding theorem generalized this to quantum information sources. This gave a new physical interpretation of the von Neumann entropy: For an i.i.d. quantum source emitting state , is the number of qubits required for a reliable compression scheme [8].
1.2. Sources with Memory
Non-i.i.d. stationary information sources inject additional forms of uncertainty. For example, a source may have an internal memory that induces correlations between sequential qubits and therefore between measurement outcomes. Such correlations may be purely classical or uniquely quantal in nature. As we will show, an experimenter who assumes (incorrectly) that such a source is i.i.d. and then applies existing tomographic methods will not detect these correlations and so will overestimate the source’s randomness and underestimate its compressibility.
Classical memoryful sources are described within the framework of computational mechanics, in which stationary dynamical systems serve as information sources with their own internal states and dynamic [9]. Sequential finite-precision measurements of a dynamical system form a discrete-time stochastic process. The resulting process’ statistics allow one to construct a model of the source and calculate its asymptotic entropy rate, internal memory requirements, and other physically relevant properties [10,11]. Importantly, the uncertainty associated with sequential measurements of a classical information source can be reduced, sometimes substantially, by an observer capable of synchronizing to the source’s internal states [12,13].
Subjecting an open quantum system to sequential qudit probes presents a similar but more general challenge, as the amount of information an observer can glean from an individual qudit through measurement is limited. Recent results established that applying particular measuring instruments to qubits induces complex behavior in measurement sequences [14]. Here, we extend these results by studying the properties of the quantum states themselves in addition to particular sequences of measurement outcomes.
The following introduces novel quantum-information-theoretic properties for sequences of separable—i.e., nonentangled—qudits. We build on previous results that focused on entropy rates, compression limits, and optimal coding strategies for stationary quantum information sources [15,16,17], as well as on results for specific experimentally motivated deviations from the i.i.d. assumption [18]. The approach is distinct from but complements recent efforts on quantum stochastic processes in which an observer measures a quantum system directly. This is complicated due to the latter’s interaction with an inaccessible environment that induces memory effects in sequential measurement outcomes [19,20,21,22,23].
Section 2 introduces classical processes, separable qudit processes, and methods of transforming from one to the other via classical-quantum channels and measurement channels. Then, Section 3, in concert with Appendix A, defines the entropies associated with quantum and classical processes, respectively. Adapting Ref. [11]’s entropy hierarchy, we employ discrete-time derivatives and integrals to obtain a family of distinct quantitative measures of quantum process randomness and correlation. We prove that, for projective or informationally complete measurements, the sequences of measurement outcomes form classical processes whose information properties are bounded by those of the quantum process being measured.
Section 4 then surveys examples of increasingly structured separable qubit and qutrit processes. Section 5 discusses how an observer can synchronize to a memoryful source—i.e., determine its internal state—through sequential measurement. Section 6 uses the resulting catalog of possible process behaviors to answer practical questions for an observer of a quantum process attempting to perform tomography. Finally, Section 7 draws out lessons and proposes future directions and applications, most notably extending the results to the experimentally realizable generation of arbitrary entangled qudit states [24,25] and using correlations as a resource to perform thermodynamic quantum information processing [26,27].
2. Stochastic Processes
We consider the output of an information source to be a discrete-time, stationary stochastic process. If the source output is a classical random variable— for each timestep t—we can directly apply the methods of computational mechanics [11]. Our goal is to extend these methods to describe separable sequences of qudits, each represented by a pure state in d-dimensional Hilbert space: at each timestep t. Given such a qudit sequence, one can perform repeated, identical measurements such that the outcomes form a classical stochastic process. Since one can choose to measure qudit states in many different bases, the properties of the classical measured process are determined by both the state of the correlated qudits and the measurement choice. Thus, the relationship between a quantum process and classical measured processes is one-to-many. Figure 1 illustrates this setup.
2.1. Classical Processes
A classical stochastic process is defined by a probability measure over a chain of random variables:
with each taking on values drawn from a finite alphabet . A block of ℓ consecutive random variables is denoted as . The indexing is left-inclusive and right-exclusive. A particular bi-infinite process realization is denoted as , with events taking values in a discrete set . Realizations of a block of length ℓ are known as words and denoted as . The set of all words of length ℓ is .
We consider processes that are stationary, meaning that word probabilities are time-independent:
for all and . A stationary process’s statistics are fully described by the set of length-ℓ word distributions . A block of length ℓ has at most possible realizations (words).
One important subclass of processes are independently and identically distributed (i.i.d.) processes. The joint block probabilities of an i.i.d. process take the form
for all . This factoring of the block probabilities results in no statistical correlations between any random variables due to stationarity for all t for an i.i.d. process.
Another commonly studied subclass consists of the Markov processes for which the distribution for each depends only on the immediately preceding random variable . For Markov processes, the joint probabilities for finite-length blocks factor as
where is the probability distribution of random variable X conditioned on random variable Y.
Finally, there is the markedly larger subclass of hidden Markov processes that have an internal Markov dynamic that is not directly observable. Though the joint probabilities do not factor as in Equation (2), the internal Markov dynamic restricts the process statistics, as we describe next.
2.2. Presentations
A presentation of a process is a model consisting of a set of internal states and a transition dynamic between those states that together reproduce the process’s statistics exactly. A given process may have many presentations. We focus on those depicted with state transition diagrams (directed graphs) that generate stationary, discrete-time stochastic processes in a natural way.
A Markov chain is a process presentation defined by the pair :
- A finite alphabet of m symbols .
- A transition matrix T. That is, if the source emits symbol , with probability , it emits symbol next.
The stationary distribution for a Markov chain is denoted as , and it is a distribution over internal states in that satisfies . For a Markov chain, the set of internal states is exactly the set of emitted symbols, since the probability distribution for the next symbol is completely determined by the previous symbol. We represent each state as a node in a graph and each transition as a directed edge between nodes labeled by the associated probability.
Markov chains are sufficient to represent Markov processes, but we can describe the more general class of hidden Markov processes by allowing for internal states that are not directly observable. These processes are generated by hidden Markov chains (HMCs), which are defined by the triple :
- A finite set of internal states.
- A finite alphabet of m symbols .
- A set of m symbol-labeled transition matrices. That is, if the source is in state , with probability , it emits symbol x while transitioning to state .
We represent each possible transition between states as an edge between their nodes labeled with the emitted symbol and the transition probability. An HMC’s stationary distribution over uniquely satisfies .
Any HMC that exactly reproduces a process’s statistical features is a generative HMC. This is an important distinction, since only some of those also belong to the more restrictive class of predictive HMCs. An HMC is predictive if its state at time is completely determined by the state at time t and the emitted symbol. This property is known as unifilarity.
At this point, we must emphasize the difference between a process and a particular presentation of that process. This distinction is critical when designating processes and models to be ‘classical’ or ‘quantum.’ A discrete-time classical stochastic processes is classical because it consists of a chain of classical random variables. Markov chains and HMCs are classical models because their internal states and dynamics are both classical. One may instead construct a presentation of a classical process with a set of quantum states that the model transitions between via some quantum dynamic. An observer can recover the classical process’ statistics by taking sequential measurements on either the system or on ancilla qudits that interact with the system at each timestep. The simulation of classical stochastic processes with quantum resources is the objective of quantum computational mechanics. There, a class of quantum models (q-simulators) shows advantage in terms of memory requirements over provably minimal classical predictive models ( -machines) [28,29,30,31,32]. Likewise, different presentations of a quantum processes may have an underlying dynamic that is either classical or quantum. We turn now to quantum processes and their presentations.
2.3. Quantum Processes
Discrete-time classical stochastic processes consist of one classical random variable for each timestep. Likewise, discrete-time quantum stochastic processes consist of one quantum state at each timestep. We first describe an i.i.d. quantum information source and then generalize to sources with memory.
2.3.1. Memoryless
A discrete-time quantum information source emits a d-level quantum system or qudit at each timestep. The statistical mixture of the infinite qudit sequences emitted by a source is a quantum process. As in the classical setting, different classes of quantum processes are distinguished by their temporal correlations. Now, however, for quantum sources we must use quantum information theory to account for both classical and quantal correlations.
First, consider the output of an i.i.d. (memoryless) quantum information source. Let be a d-dimensional Hilbert space with pure states . A d-level i.i.d. quantum information source consists of a set of pure qudit states and a probability distribution over those states such that for all . We refer to as a quantum alphabet and consider only quantum alphabets with a finite number of pure states.
At each discrete timestep t, the source emits state with probability . The resulting ensemble is described by the density matrix:
This particular pure-state decomposition of is not unique. Moreover, an observer cannot determine through observations—unless consists of only one state—since many pure-state ensembles correspond to the same density matrix.
If an i.i.d. source emits at each timestep, then the quantum process generated by the source is simply the infinite tensor product state:
2.3.2. Memoryful
We cannot describe non-i.i.d. sources using a single probability distribution over but must introduce a probability distribution over sequences of states drawn from . We do this by associating each element of with an element in the symbol alphabet of an underlying classical stochastic process . Infinite qudit sequences then inherit probabilities from . This construction results in qudit sequences that are separable—i.e., not entangled.
We express the relationship between symbols and pure quantum states via a memoryless classical-quantum channel , taking . This is also known as a preparation channel (or encoder); see Appendix B for more.
Preparation channels are dual to measurement channels, described later, that map quantum states to classical probability distributions and, via sampling, to particular symbols.
For the classical process whose realizations consist of symbols , the associated quantum alphabet is constructed by passing each element of through such that . Thus, is completely determined by and . For example, in the i.i.d. case, Equation (3) can also be written as , and each possible pure-state decomposition of can now be interpreted as a different combination of classical random variable X and preparation channel .
In a slight abuse of notation, we write to indicate that quantum process is formed by passing each random variable of through the classical quantum channel .
Note that an infinite qudit sequence (separable or entangled) can be viewed as a one-dimensional lattice of qudits indexed by . These possibly entangled states can be described in full generality using an operator algebraic approach. We ground our formal definition of quantum processes in this mathematical setting. (Reference [17] provides a more detailed treatment of observable algebras for entangled qudit sequences over ).
Let be the d-dimensional matrix algebra describing all possible observables on lattice site t. (For , the space of observables is spanned by the identity and the complex Pauli (Hermitian, unitary) matrices). The state of the qudit at site t can be described by the density matrix acting on of dimension d. For a block of ℓ consecutive qudits, all observables can be described by the joint algebra over ℓ sites of the lattice, , and the state of this block is acting on , a Hilbert space of dimension . Combining all local algebras allows one to define an algebra over the infinite lattice. A quantum process is a particular state over the infinite lattice and can be written as .
As a necessary first step and to more readily adapt information-theoretic tools from classical processes, we return to the more restricted case: separable sequences of qudits drawn from a finite alphabet of pure qudit states. Given a classical word and a preparation channel , a qudit sequence takes the form
where ℓ is the length of the sequence, and . Note that , , and the number of possible qudit sequences of length ℓ is or—assuming all are distinguishable— .
A separable quantum process is then defined by . Different preparations—i.e., different combinations of and —may produce the same quantum process.
The set of length-ℓ-block density matrices for a quantum process is given by
where are the separable vectors given in Equation (5). Conveniently, their probabilities are determined by those of the underlying classical stochastic process : . Each is a finite subsystem of the pure quantum state over the infinite lattice. We use left-inclusive/right-exclusive indexing for density matrices as well.
For a given , one can also obtain a purification in a finite-dimensional Hilbert space [33]. It is important to note that, since does not have a unique pure-state decomposition, one cannot generally reconstruct the probabilities from it. Rather, contains only information accessible to an observer. And, if contains nonorthogonal qudit states ( for some , ), then an observer cannot unambiguously distinguish them.
In addition to separability, we also focus on stationary quantum processes, meaning
for all and . If is stationary, then will be stationary by construction.
For an i.i.d. quantum process, the joint probabilities of factor as in Equation (1), giving the quantum process the form of Equation (4). The length-ℓ-block density matrix is represented by a product state as follows:
with taking the form in Equation (3).
For an underlying classical process that is Markov, there are additional subtleties. Joint probabilities of factor as in Equation (2), so the joint probabilities of also factor so that
However, an observer cannot reliably distinguish between different states when measuring a quantum process, and the underlying Markov dynamic is hidden from observation. Thus, the general setting for memoryful quantum processes is that of hidden Markov processes. These are best introduced using concrete models that directly represent a process’s structure.
2.4. Presentations of Quantum Processes
A presentation for a quantum process is a model with internal states and a transition dynamic between them that emits pure quantum states rather than classical symbols. As for presentations of classical processes, we depict them with state transition diagrams. When is a Markov or hidden Markov process, can be represented with an extension of Ref. [14]’s classically controlled qubit sources (cCQSs) as follows.
A hidden Markov chain quantum source (HMCQS) is a triple consisting of
A finite set of internal states.A finite alphabet of pure qudit states, with each .A set of m transition matrices. That is, if the source is in state , with probability , it emits qudit while transitioning to internal state .
As with HMCs, the stationary distribution for an HMCQS satisfies .
Any HMCQS that exactly reproduces a quantum process is a generative HMCQS or generator of the process. Though quantum models cannot be predictive in the same sense as classical models, we can define an analog to classical unifilarity. An HMCQS is quantum unifilar if, for every state at time t, there exists at most a single measurement that determines the internal state at time . This closely parallels the definition of unifilarity in classical stochastic processes. We discuss several implications of quantum unifilarity later.
We call an HMCQS a classical controller of a quantum process, since there is nothing quantal about its internal states or transition dynamic. This is in contrast to related classes of quantum models that evolve a finite quantum system according to a quantum operation (defined via a set of Kraus operators) at each timestep. These include Quantum Markov Chains (QMCs) [34] and Hidden Quantum Markov Models (HQMMs) [35,36,37]. While HMCQSs emit separable quantum states, QMCs and HQMMs generate sequences of measurement outcomes (each corresponding to a particular Kraus operator) that form classical stochastic processes.
Anticipating future effort, we consider it worthwhile to draw out several observations on entanglement between successive qudits at this point. Entanglement means that finite-length qudit sequences are not separable and so are not described by Equation (5). Moreover, their sequence probabilities cannot be straightforwardly defined with reference to an underlying classical stochastic process.
That said, there are systematic ways of defining stationary such that the set of marginals describes all measurements over blocks of ℓ qudits. For example, if the source’s internal structure consists of a D-dimensional quantum system interacting unitarily with one qudit per time step, it generates a matrix product state (MPS) with a maximum bond dimension of D [38]. If the source operates stochastically (rather than unitarily), then many different MPSs can be emitted with varying probabilities. The collection is then described by matrix product density operators (MPDOs) [39]. We refer to these as entangled qudit processes. Their dynamical and informational analyses are left for elsewhere. The present goal is to layout the basics for those efforts.
2.5. Measured Processes
An agent observing a quantum process has many ways to measure it. Let M represent a measurement applied to the qudit in state . In general, M is a positive operator-valued measure (POVM) described by a set of positive semi-definite Hermitian operators on the Hilbert space of dimension d. Each corresponds to a possible measurement outcome y, and POVM elements must sum to the following identity:
Projection-valued measures (PVMs) are an important subclass of POVMs with an additional constraint: operators must be orthogonal projectors. PVMs have at most d elements. A PVM consisting only of rank-one projectors on is a von Neumann measurement and has exactly d elements [5].
A set of measurements applied to a block of ℓ qudits is described by some block POVM with elements on the Hilbert space of dimension . may include measurements in the joint basis of multiple qudits—measurements essential for fully characterizing entangled processes.
For separable processes we focus on “local” measurements—operators on a single qudit. The measurement operator for a block of ℓ qudits then takes a tensor product structure:
where each is a POVM on .
If we apply the same local POVM M to each qudit, then
We refer to this as a repeated POVM measurement.
An observer can also have multiple POVMs at their disposal and apply different measurements at different time steps according to some measurement protocols. We describe measurement protocols in more detail shortly.
For simplicity, the following ignores ’s post-measurement state and considers only the measurement outcomes . Thus, we take to be a stochastic map , with the random variables representing measurement outcomes.
When applying a measurement of the form of Equation (11) to a finite block of ℓ qudits, the outcomes factor into a block of ℓ classical random variables:
where the possible values of each are the POVM measurement outcomes . There are possible realizations of . We write a realization (word) of length ℓ as .
The probability of any particular measurement outcome for a block of ℓ qudits in state is
For with the separable form of Equation (6) and identical POVM measurements on each qudit as in Equation (11), we can decompose into ℓ local operators as follows:
For a separable qudit process, a sequence of local measurement outcomes can also be interpreted as the result of sending random variables from over the same memoryless noisy channel . decomposes into the deterministic preparation and our stochastic measurement :
(Appendix B presents a more thorough description of the classical-quantum channels and ).
This construction makes it clear that measurement outcomes correspond to classical random variables that take values and form a classical process , with probabilities defined by Equation (12). To express the relationship between a quantum process and a measured classical process, we write
where is a repeated, local POVM. If the qudit process is separable, we can also write
2.6. Adaptive Measurement Protocols
An observer does not need to repeat the same measurement on every qudit but may apply different POVMs at different time steps according to some algorithm. If the observer uses past measurement outcomes to inform their choice of POVM, we say that they are using an adaptive measurement protocol.
The following limits discussion to measurement protocols that have a deterministic finite automata (DFA) as their underlying controller. Similar constructions combining quantum measurement and DFAs have appeared in the context of quantum grammars [40,41,42].
A deterministic quantum measurement protocol (DQMP) is defined by the quintuple :
- A finite set of internal states.
- A unique start state .
- A set of POVMs , one for each internal state.
- An alphabet of m symbols corresponding to different measurement outcomes.
- A deterministic transition map .
If consists of only , then the DQMP is a repeated POVM measurement for POVM . When has more than one internal state, the POVMs corresponding to different states may have the same or a different number of elements. Likewise, the symbol sets corresponding to their measurement outcomes may be disjoint, or symbols may be repeated.
We can place the following bounds on the size of the set : , where is the set of operators corresponding to POVM , and is the size of the POVM with the most elements.
For DQMP and qudit process , obtaining a measured process is generically more difficult than for the case of repeated POVM measurements. When an observer begins using protocol at , they experience two distinct operating regimes: first the transient dynamic and then the recurrent dynamic. We briefly outline this process and return to the subject when we describe synchronization—a task deeply related to the transient dynamic—in Section 5.
begins in state at . For a given (stationary, ergodic) input , as , the DQMP approaches a stationary distribution over a subset of its internal states , where is the set of recurrent states. This distribution (and even which states are in ) depends on . The recurrent dynamic is determined by this stationary distribution and the transition probabilities between states in . Any state not in is in the transient state set .
The transient dynamic describes how goes from at to its recurrent dynamic over , which may occur at a finite time or only asymptotically as . In general, two dynamics produce two distinct measured processes: , which is stationary and ergodic by construction, and , which is not. The final measured process has two components—i.e., .
2.7. Discussion
These nested layers of complication suggest working through a concrete example and restating the overall goals.
Imagine that an observer measures a single qubit from a quantum source that emitted state using a projective measurement in the computational basis . The possible measurement outcomes and occur with the following probabilities:
respectively. These two values determine the distribution for the random variable and, by applying the same projective measurement to , we completely determine the statistics of the measured block. Continuing this procedure for defines the measured process .
Naturally, the observer can also choose to apply measurements in another basis, e.g., , where . This typically results in a measured process with radically different statistical features.
Finally, an observer could use an adaptive measurement protocol . It starts in state and measures with . If , it stays in and continues using . If , it transitions to a new internal state and uses on the next qubit. Regardless of the outcome of , it returns to and measures the next qubit with . The measured process will be distinct from both and and may consist of both a transient and recurrent component.
With this setting laid out, we can now more precisely state the questions the following development answers:
- Given the density matrices describing sequences of ℓ separable qudits, what are the general properties of sequences of measurement outcomes? This is Section 3’s focus. There, ’s quantum information properties bound the classical information properties of measurement sequences for certain classes of measurements.
- Given a hidden Markov chain quantum source, when is an observer with knowledge of the source able to determine the internal state (synchronize)? Can the observer remain synchronized at later times? Section 5 addresses this.
- If an observer encounters an unknown qudit source, how accurately can the observer estimate the informational properties of the emitted process through tomography with limited resources? How can they build approximate models of the source if they reconstruct for some finite ℓ? This is Section 6’s subject.
Additionally, Section 4 illustrates these general results and the required analysis methods using specific examples of qudit processes.
3. Information in Quantum Processes
We wish to develop an information-theoretic analysis of quantum processes for which the observed sequences depend on the observer’s choice of measurement. (Much of this parallels the classical information measures reviewed in Appendix A). This requires a more general approach using density matrices that contain all the information necessary to describe the outcome of any measurement performed on ℓ-qudit blocks. We use quantum information theory to study properties of the set of and then relate them to classical properties of measurement sequences described in Appendix A. We begin by briefly reviewing several basic quantities in quantum information theory. References [6,33] give a more complete picture of the subject.
3.1. von Neumann Entropy
In quantum information theory, the von Neumann entropy plays a role similar to that of the Shannon entropy in classical information theory. Given a mixed quantum state , the von Neumann entropy is
where the s are the eigenvalues of the density matrix . if and only if is a pure state. We use ; therefore, the units of the von Neumann entropy will be bits.
From Equation (13), the von Neumann entropy is the Shannon entropy of the eigenvalue distribution of density matrix . Therefore,
where is the Shannon entropy, and the minimum is taken over the set of all rank-one POVMs. The minimum will always be a PVM with projectors that compose ’s eigenbasis [6]. We use brackets to indicate that is a classical probability distribution over the measurement outcomes.
To monitor correlations between two quantum systems, we use the quantum relative entropy:
where and are the density operators of the two systems. The quantum relative entropy is non-negative and defined as follows:
with equality if and only if , a result known as Klein’s Inequality [33].
The joint quantum entropy for a state of a bipartite system is
We can further define a conditional quantum entropy of system A conditioned on system B as
where . Note that .
In contrast to the classical case, the conditional quantum entropy may be negative—a phenomenon leveraged in super-dense coding protocols [6]. Equivalently, the conditional quantum entropy can be written using the quantum relative entropy as
where is the identity operator on Hilbert space with dimension .
The quantum mutual information between quantum subsystems A and B is given by
The quantum mutual information is symmetric and non-negative. If the joint system is in a pure state, then will be zero, and . It can also be expressed as a quantum relative entropy as
A few additional well-known properties of the von Neumann entropy facilitate later results. First, each for separable qudit sequences is a finite mixture of states formed from length-ℓ words of an underlying classical process, so the following will be useful.
Lemma 1. Consider a random variable X that takes values with corresponding probabilities . Given a set of density matrices , the following inequality holds [33]:
with equality if and only if the all have support on orthogonal subspaces.
Second, since we use quantum channels to both prepare and measure a qudit process, we make use of the fact that the quantum relative entropy is monotonic [6]:
where is any quantum channel. This inequality becomes an equality if and only if there exists a recovery map such that and [43].
3.2. Quantum Block Entropy
Since stationary qudit processes are correlated across time, we explore how the von Neumann entropy for qudit blocks scales with block size. The following gives bounds on the possible measurement sequences one can observe from a quantum information source. As the von Neumann entropy generalizes the Shannon entropy, the results here (and many of the proofs) are natural generalizations of those in Appendix A. We also note that exactly determining becomes practically infeasible for large ℓ. And so, Section 6 addresses how to approximate properties and models for qudit processes when restricted to measurements of finite length blocks.
For a qudit process, we define the quantum block entropy as the von Neumann entropy of a block of ℓ consecutive qudits:
If is a pure state, . By the same logic as the classical case, .
Many properties of the classical block entropy hold for .
Proposition 1. For a stationary qudit process, is a nondecreasing function of ℓ.
Proof. As a consequence of the strong subadditivity of the von Neumann entropy [33],
Let , where , , and .Incorporating qudit process stationarity, we rewrite Equation (22) as follows:
where Equation (23) follows from stationarity.Thus, , for all , and is a nondecreasing function of ℓ. □
Proposition 2. For a stationary qudit process, is concave.
Proof. The von Neumann entropy is strongly subadditive [33], meaning that
For , let , where , , and We can rewrite Equation (24) by incorporating the stationarity of qudit processes:
where Equation (25) follows from stationarity.Thus, is concave. □
For a separable qudit process formed by passing a classical process through a classical-quantum channel, its block entropies are related in the following way:
Proposition 3. Let . The block entropies of and obey
for all ℓ, with equality if and only if consists of orthogonal pure states in of dimension .
Proof. Recall from Equation (6) that for separable qudit processes,
with each taking the separable form of Equation (5) and .We first note that, for all symbols in to be associated with orthogonal qudit states, the minimum dimension of the Hilbert space is .With written as a mixture of separable qudit words, we apply Lemma 1 to obtain
where . The second term evaluates to zero in the final line, since each is a pure state. That is, for all .Equality occurs if and only if the states have support on orthogonal subspaces, which requires that and all elements of are orthogonal. □
We cannot use to bound the block entropy of a measured process for general POVM measurements. For the case where the measurement consists only of rank-one POVMs (including all PVMs) however, the following holds:
with equality if and only if the measurement is performed in the minimum-entropy (eigen)basis of . This follows directly from Equation (14).
Proposition 4. Let , where is a repeated rank-one POVM measurement. The block entropies of and then obey
for all ℓ, with equality if and only if is a separable process with an orthogonal alphabet and uses a POVM whose operators include one projector for each element in .
Proof. The bound follows directly from Equation (26) because repeated rank-one POVM measurements are a subclass of the more general measurement sequence .The condition for equality also follows from Equation (26) but requires more justification. First, we consider measuring the single-qubit marginal with POVM M. For equality, each element of the eigenbasis of ( ) must have a corresponding operator in M that is a projector on that eigenspace ( ).We can write . Since we apply the same POVM to each qudit, all blocks of length ℓ must have eigenstates of the form —i.e., they must take the separable form of Equation (5)—making a separable process with a quantum alphabet of orthogonal states. The M consisting of projectors onto the states in is then the minimum-entropy measurement over blocks . Note that there may be other elements of the POVM that are not projectors if the probability of those measurement outcomes is 0 when applied to . (If the process does not make use of that part of Hilbert space, it does not matter how it is measured). □
To summarize, in the case of separable qudit processes, is upper-bounded by the underlying classical process’s block entropy . For repeated measurement with rank-one POVMs, serves as a lower bound on the block entropy of all classical measured processes . There is no direct relationship between and . Rather, it depends on the specifics of and .
3.3. von Neumann Entropy Rate
The von Neumann entropy rate of a qudit process is
The units of s are bits per time step. This quantity is equivalent to the mean entropy, first introduced in the context of quantum statistical mechanics [44]. The limit exists for all stationary processes [45]. Operationally, s is the optimal coding rate for a stationary quantum source [17].
Proposition 5. For a stationary qudit process, we can equivalently write the von Neumann entropy rate as
Proof. This proof closely follows the proof for the classical entropy rate in Ref. [46]. We begin by showing that exists and then that it is equivalent to the limit in Equation (27). The limit exists if is a decreasing, non-negative function of ℓ:
where Equation (29) follows from stationarity, and Equation (30) follows from the nondecreasing nature of . This, combined with the fact that is concave, means that exists.Now, we establish that . Through repeated application of Equation (17) to a block of length ℓ, we obtain the following chain rule for the von Neumann entropy:
We can modify indices (due to stationarity) and divide both sides by ℓ to obtain
The final steps require the following result, known as the Cesáro mean [46].
Lemma 2. If and , then . Taking the limit of both sides of Equation (32) and applying Lemma 2, we find
Using stationarity, . When combined with Equation (33), this proves that our two definitions of s are equivalent. □
To motivate a number of the following results, it is important to appreciate that simply because a process has a von Neumann entropy rate given by Equation (27) does not imply that an observer is able to perform a measurement on any qudit such that the uncertainty in that individual measurement is s. Rather, obtaining s corresponds to the measurement basis over the entire chain of qudits for which the distribution of outcomes has the minimal Shannon entropy. This basis is highly nonlocal or otherwise experimentally infeasible for many nontrivial examples. As in the classical case, s appears graphically as the slope of as , as shown in Figure 2.
For separable qudit processes, we can also relate s to the classical entropy rate of the underlying process as follows.
Proposition 6. Let . The von Neumann entropy rate s of then obeys the bound
where is the Shannon entropy rate of . Equality occurs if and only if consists of orthogonal pure states in of dimension .
Proof. Divide both sides of Proposition 3 by ℓ and take the limit of both sides as to obtain
From Equation (27), the left side is s, and from Equation (A1), the right side is . The condition for equality is inherited from Proposition 3, concluding the proof. □
Restricting once again to repeated measurements with rank-one POVMs, we can prove the following bound for measured processes.
Proposition 7. Let , and let be a repeated rank-one POVM. The measured entropy rate then obeys
where s is the von Neumann entropy rate of , with equality if and only if is a separable process with an orthogonal alphabet and uses a POVM whose operators include a projector for each element in .
Proof. Divide both sides of Proposition 4 by ℓ and take the limit of both sides as to obtain
The left side is s, and the right side . The conditions for equality are inherited from Proposition 4, concluding the proof. □
3.4. Quantum Redundancy
Unlike a classical process, the maximum entropy rate for a qudit process depends on the size of the Hilbert space rather than on the size of the alphabet . For Hilbert space of dimension d, the largest possible value of s is , corresponding to an i.i.d. sequence of qudits, each in a maximally mixed state .
A qudit process can be compressed down to its von Neumann entropy rate s. The amount that it can be compressed is the quantum redundancy:
Statistical biases in individual qudits and temporal correlations between them offer opportunities for compression. includes the effects of both.
For separable qudit processes, we can bound the quantum redundancy using properties of the underlying classical process:
Proposition 8. Let be a classical process with redundancy , symbol alphabet , and entropy rate , and let be a qudit process with redundancy , Hilbert space of dimension d, and entropy rate s.
For ,
with equality if and only if and consist of orthogonal pure states.
For ,
where the term is always positive, as indicated by Proposition (6).
Proof. First, consider :
The final line comes from Proposition 6, as does the condition for equality.For ,
There is no opportunity for equality. In this case, will not span , naturally leading to more redundancy.Finally, for ,
concluding the proof. □
We can also compare the classical redundancy of a measured process (obtained through repeated use of a rank-one POVM) to the quantum redundancy of the qudit process being measured.
Proposition 9. Let be a measured process such that and be a repeated rank-one POVM. Let have redundancy , and let have quantum redundancy . Then,
with equality if and only if , is a separable process with an orthogonal alphabet and uses a POVM whose operators include a projector for each element in .
Proof. A rank-one POVM on must have at least d elements; therefore, for ,
Going from the first line to the second provides a condition for equality: . Proposition 7 is used in the final line and provides the other conditions for equality. □
3.5. Quantum Entropy Gain
We can take discrete-time derivatives of , as was done for in [11]. This process is summarized in Appendix A. We call the first derivative of the quantum entropy gain as
for . The units for the quantum entropy gain are bits per time step, and we set the boundary condition at length to , where d is the Hilbert space dimension. Since is monotonically increasing, .
The quantum entropy gain is the amount of additional uncertainty introduced by including the qudit in a block, where that uncertainty is quantified by the von Neumann entropy.
By combining Equations (17) and (36), we can write as
This allows for relating the quantum entropy gain and the von Neumann entropy rate as follows:
Thus, paralleling the classical case, the quantum entropy gain serves as a finite-ℓ approximation of the von Neumann entropy rate:
serves as the best estimate for the entropy rate of a qudit process that can be made by an observer who only has access to measurement statistics for length-ℓ blocks of qudits.
The way in which the entropy rate estimate converges and its relationship to other information properties of a qudit process are summarized in Figure 3.
3.6. Quantum Predictability Gain
We call the second derivative of the quantum predictability gain, which is given by
for . The units of are bits per time step^2^. Since is concave, then . quantifies how much an observer’s estimate of the von Neumann entropy rate s improves if they enlarge their observations from blocks of to blocks of ℓ qudits. The generic convergence behavior of is shown in Figure 4.
For all higher-order discrete derivatives of (as with the classical block entropy),
This follows directly from the existence of the limit in Equation (27) for stationary quantum states.
3.7. Total Quantum Predictability
Up to this point, introducing new information-theoretic characteristics of separable quantum processes proceeded by taking discrete-time derivatives of the von Neumann block entropy. We can likewise integrate the functions , as is done for the classical case with Equation (A3). While this starts off straightforwardly, a number of interesting new informational quantities emerge.
These properties of qudit processes take the following general form:
where is the first value of ℓ for which is defined.
, the first of these, monitors the convergence of the quantum predictability gain to its limit of 0 for . We use to get the total quantum predictability :
The units of are bits per time step. Note that because for all ℓ.
can be interpreted by relating it to a previously established property of qudit processes: quantum redundancy.
Proposition 10. For a stationary qudit process,
Proof. Applying Equation (A3) to Equation (39), we find that
Here, the second line follows from Equation (37) and the third from Equation (34). □
Thus, is the total amount of predictable information per time step for a qudit process.
Rather immediately, one sees that the amount of information in an individual qudit decomposes into
is the amount of quantum information within a qudit that is predictable, whereas s is the amount of information that is irreducibly random.
The relation can be combined with Propositions 8 and 9 to prove two corollaries.
Corollary 1. Let be a classical process with total predictability , symbol alphabet , and entropy rate , and let be a qudit process with total quantum predictability , Hilbert space of dimension d, and entropy rate s.
For ,
with equality if and only if and consist of orthogonal pure states.
For ,
where the term is always positive, as derived from Proposition (6).
Proof. This follows immediately from combining Propositions 8 and 10. □
Corollary 2. Let be a measured process such that , and is a repeated rank-one POVM. Let have redundancy , and let have quantum redundancy . Then,
with equality if and only if , is a separable process with an orthogonal alphabet and uses a POVM whose operators include a projector for each element in .
Proof. This follows immediately from combining Propositions 9 and 10. □
Graphically, the total quantum predictability is the area between the predictability gain curve and its linear asymptote of 0, as seen in Figure 4. The von Neumann entropy rate and total predictability lend insight into compression limits for stationary sources. They do not indicate, however, whether that compression is achievable due to bias within individual qudit states or correlations between qudits. For that, we must continue our way back up the entropy hierarchy.
3.8. Quantum Excess Entropy
The convergence of to the true von Neumann entropy rate s is quantified with the quantum excess entropy:
The units for are bits. Paralleling the classical case, we refer to any qudit process with finite as finitary and those with infinite as infinitary.
We can further express in terms of the asymptotic behavior of .
Proposition 11. The quantum excess entropy can be written as
Proof. We evaluate the discrete integral in Equation (40) with Equation (A3) using partial sums:
since by definition. □
For finitary quantum processes, is the area between the entropy gain curve and its asymptote s, as seen in Figure 3. It also appears in Figure 2 as the vertical offset of the linear asymptote to the curve.
This leads to a natural scaling of the quantum block entropy as
as .
A clearer interpretation of as a quantum mutual information is provided by the following proposition.
Proposition 12. The quantum excess entropy can be written as
where and are two blocks of ℓ consecutive qudits with a shared boundary.
Proof. The quantum mutual information, from Equation (19), between two neighboring blocks of ℓ qudits can be expressed as
where the final line is obtained through repeated application of Equation (17).Taking ,
where the final line follows from Equation (32) and stationarity.This final expression is equivalent to the form of derived in Proposition 11, concluding the proof. □
is therefore a measure of all the correlations between two halves of the infinite sequence of qudits. if and only if a source is i.i.d. (with ).
We can relate for a separable qudit process to of the underlying classical process.
Proposition 13. Let be a classical process with alphabet and excess entropy , and let be a qudit process with alphabet and quantum excess entropy .
Then,
with equality if and only if consists of orthogonal states.
Proof. Consider , a block of length of the classical process . We can write realizations of into a classical register to form the following state:
where all are orthogonal. Then, we pass each symbol through the preparation channel to obtain blocks of our qudit process , where .We can express the quantum mutual information as a quantum relative entropy, Equation (20), giving the following relation:
where Equation (43) comes from the monotonicity of the quantum relative entropy in Equation (21). The condition for equality comes from Equation (21) as well. The set of states for which the recovery map must exist is , and this is only possible if all are distinguishable—i.e., orthogonal.Using Equation (A9) to write the excess entropy of as a limit, we see that
□
A similar bound appears when we apply a repeated POVM measurement to the quantum process to obtain a classical process.
Proposition 14. Let be a measured process such that , and let be a repeated rank-one POVM. Let have excess entropy , and let have quantum excess entropy . Then,
with equality if and only if is a separable process with an orthogonal alphabet and uses a POVM whose operators include a projector for each element in .
Proof. Consider —a block of length of the quantum process —and let be a repeated measurement of rank-one POVM M with elements so that , , and all are orthogonal. The repeated measurement applied over a block of length ℓ is then .We express the quantum mutual information as a quantum relative entropy (using Equation (20)) and apply Equation (21) to obtain
The condition for equality comes from Equation (21) as well. The set of states for which the recovery map must exist is , and this is only possible if all are orthogonal and M contains a projector for each . By the same argument as Proposition 4, must be a separable process, and must consist of orthogonal states.Taking the limit , we see that
□
Combining the above proofs, we obtain the following corollary relating the excess entropies of the underlying classical process and the measured process .
Corollary 3. Let be a classical process with excess entropy and alphabet , let be a separable qudit process with alphabet , and let be a measured process with excess entropy such that , where is a repeated rank-one POVM.
Then,
with equality if and only if consists of orthogonal states and uses a POVM whose operators include a projector for each element in .
Proof. This follows immediately from combining Propositions 13 and 14. □
An exact value of typically requires characterizing infinite-length sequences of qudits. However, we can write a finite-ℓ estimate of using Equation (41):
which generally underestimates ’s true value.
3.9. Quantum Transient Information
We now turn to look at how the quantum block entropy curve converges to its linear asymptote . We define the quantum transient information as
The units of are bits × time steps.
is represented graphically as the area between the curve and its linear asymptote for , as seen in Figure 2. We will see that distinguishes between periodic qudit processes that cannot be distinguished with previous information quantities such as and s.
Proposition 15. The transient quantum information can be written as
Proof. The proof reduces to the straightforward proof for transient information of a classical stochastic process, as defined in Ref. [11]. It depends only upon Equations (46) and (A3), which have the same form in the quantum case as the classical case. □
This expression allows us to estimate for a given quantum process as
which generally underestimates the true value of .
is related to the minimal amount of information necessary for an observer to synchronize to an HMCQS. We say an observer is synchronized when they are able to determine a source’s internal state. If converges to its linear asymptote at finite ℓ, then there exists an optimal POVM on (in the eigenbasis of ) that exactly determines the HMCQS’s internal state. Note that this is not guaranteed to be a repeated POVM or even consist of local POVMs. The information within that measurement that is useful for synchronization is quantified by . In contrast, if does not converge for finite ℓ, then no such POVM over any finite block of qudits exists, and an observer can (at best) only converge asymptotically to the source’s internal state. We will see in Section 5 that even this is not possible for most sources when we restrict ourselves to local measurements.
3.10. Quantum Markov Order
The quantum Markov order corresponds to the number of previous qudits on which the next qudit is conditionally dependent. A process has quantum Markov order if is the smallest value for which the following property holds:
A graphical interpretation is that when the block size reaches , levels off and has a constant slope—which is s, as seen in Figure 2. As a consequence, is the value of ℓ for which and converge to their asymptotic values, as seen in Figure 3 and Figure 4, respectively.
Note that a classical process is referred to as ‘Markov’ if . If a separable qudit process has an underlying classical process that is Markov, then it obeys the property in Equation (9) but does not generally have .
Consider a separable quantum process , where has Markov order , and has quantum Markov order . can be equal to, less than, or greater than . We will give a simple example of each case:
- trivially if consists of orthogonal states, in which case they have identical block entropy curves via Proposition 3.
- if and all symbols in are mapped to the same pure state . In this case, , as the resulting process is i.i.d.
- when , , and consists of nonorthogonal states. Frequently, , since arbitrary sequences of nonorthogonal states cannot reliably be distinguished with a finite POVM.
Similar rules apply when comparing to the Markov order of measured process :
- if is a separable process with an orthogonal alphabet and uses a POVM whose operators include a projector for each element in via Proposition 4.
- if consists of orthogonal states, , and is a repeated rank-one POVM that does not include projectors onto the states in .
- if and is a repeated POVM measurement with the one-element POVM, . Note that this is not a rank-one POVM.
Now, with a toolbox of quantum information properties in hand, the following section calculates (or estimates) their values for paradigmatic examples of qudit processes.
4. Qudit Processes
We now present a variety of separable qudit processes organized roughly by increasing structural complexity and demonstrate the extent to which their behavior can be quantified with Section 3’s informational measures. Their properties are summarized in Table 1 near the end when we have their informational measures in hand. Note that the following results on qudits do not markedly change if one simplifies to qubits.
4.1. I.I.D. Processes
Recall that an i.i.d. (independent and identically distributed) qudit process has no classical or quantum correlation between any of the qudits, and its length-ℓ density matrices are in a product state such that
where is the density matrix for a single time step.
The quantum block entropy takes the form ; thus, the von Neumann entropy rate is . This implies that a repeated projective measurement exists for which the measured process has a classical entropy rate of . The measurement that realizes this bound consists of orthogonal projectors , each of which is constructed from an eigenvector of . In the special case where is the maximally mixed state, , and any set of orthogonal projectors on a block gives a uniform distribution over all measurement outcomes , since there are no correlations between qudits , , and trivially. The output of a single-qudit source with uncorrelated noise can be considered an i.i.d. qudit process.
4.2. Quantum Presentations of Classical Processes
Any classical process with alphabet can be represented by a qudit process with orthogonal alphabet , where each symbol corresponds to a pure state . In this case, the encoding is trivial, and one can recover the underlying process via repeated measurement with orthogonal projectors . This requires that .
The information measures for the underlying process, the qudit process, and measurement outcomes obey
with equality for repeated measurement with a rank-one POVM whose elements include the projector set .
From this relation, we can see that many quantum information quantities such as s, , , and are equal to the classical properties of . Exceptions include quantities that depend on the relationship between d and , such as quantum redundancy.
Since the quantum-classical channel is trivial, the output process is the result of passing through a noisy classical channel, where the level of noise depends on the particular measurement scheme .
For a repeated rank-one POVM, for all ℓ from Propositions 3 and 4. Similar inequalities explicitly relate other classical process properties such as the entropy rates ( ), the predictabilities ( ), and the excess entropies (see Corollary 3).
4.3. Periodic Processes
A classical stochastic process is periodic with period p if it consists of repetitions of a template sequence—a length-p block of symbols. A periodic separable qudit process is one for which the underlying classical process is periodic.
For classical periodic processes, the block entropy curve reaches a maximal value at its Markov order and thereafter remains constant with increasing ℓ ( ). That maximal value is the excess entropy, which is entirely determined by the period according to the formula . Finally, an observer attempting to synchronize to different length-p templates may encounter more or less uncertainty in the process depending on the template itself, which is a feature captured by the transient information [11].
Periodic qudit processes share many of these properties (as we show via Section 3’s results) but also exhibit richer behavior, since they can consist of nonorthogonal qudit states. Figure 5 shows the quantum block entropies for the period-3 process consisting of the repeated quantum word , where . For = , we recover the block entropy for the classical period-3 word ‘001’. As decreases, becomes less distinguishable from .
From Propositions 6 and 13, it follows that periodic qudit processes with period p have and . (And, unless two classical symbols in are sent to the same pure state in in a way that reduces the effective period of the qudit process to less than p). However, the quantum block entropy curve does not necessarily reach its maximal value of for if contains nonorthogonal states. In this case, , since an observer cannot unambiguously distinguish where one length-p block begins and another one ends with any finite measurement.
Though all period-p qudit processes have the same von Neumann entropy rate and quantum excess entropy, they may be distinguished by their values for the quantum transient information in two different ways.
First, different quantum alphabets give different values of . Figure 5 demonstrates that, for a two-state qubit alphabet, as the states become more or less distinguishable, the quantum transient information increases. For (orthogonal alphabet), bits × time steps, whereas for , bits × time steps. These values of (and others in this section) are numerically approximated using Equation (47), with .
Second, can distinguish between different length-p words. Reference [11] shows that can distinguish between different period-5 classical words (‘00001’, ‘00011’, and ‘00101’), and generalizes this behavior. Whereas all period-3 words are equivalent to ‘001’ under global bit swap and translations, the same is not true for period-5 words. Section 5 discusses synchronizing to period-5 qudit sources in more detail and relates that task to the value of .
4.4. Quantum Golden Mean Processes
The classical Golden Mean process consists of all binary strings with no consecutive ‘1’s. It is a Markov process ( ), since the joint probabilities for blocks factor as in Equation (2), with , , , and . For the classical Golden Mean, bits per symbol and bits.
Replacing the classical symbol alphabet with the quantum alphabet , where , gives the - Quantum Golden Mean process introduced in Ref. [14]. Figure 6 shows its generator.
We can further generalize this process to the - Quantum Golden Mean process with quantum alphabet . This process’s quantum entropy rate is shown in Figure 7 for different values of . For , , and we recover the classical Golden Mean process. As decreases to 0, the states in become less distinguishable and s decreases, as is expected from Proposition 6.
Also in Figure 7, we see the entropy rate of the measured processes obtained by applying a repeated PVM to the - Quantum Golden Mean process. This PVM consists of projectors parametrized by the angle , where . As mandated by Proposition 7, for all and .
Figure 8 demonstrates how another of Section 3’s quantum information properties, the quantum excess entropy , relates to the excess entropy of the classical measured process. For all and , the bound from Proposition 14 ( ) holds, has maxima at , and when and . These quantities were estimated using .
Underlying the - Quantum Golden Mean is the classical Golden Mean process, which is Markov. Thus, it obeys the quantum Markov property of Equation (9) despite the fact that it has an infinite quantum Markov order for most values of . This has implications for an observer’s ability to synchronize to a Quantum Golden Mean source, which Section 5 explores.
4.5. 3-Symbol Quantum Golden Mean
Figure 9 shows the generator of the 3-Symbol Quantum Golden Mean process with alphabet . Though its generator shares the same internal states and transition probabilities as the - Quantum Golden Mean, the 3-Symbol Quantum Golden Mean does not have a one-to-one correspondence between the quantum alphabet and the generator states ( and ). Instead, , and these three states cannot all be orthogonal with .
However, unlike the - Quantum Golden Mean, we can calculate the quantum entropy rate directly from the generator because (1) it has the property of quantum unifilarity, and (2) it is possible to synchronize to it. Both are discussed at length in Section 5. For now, an HMCQS is quantum unifilar if and only if, for every , there exists some POVM such that an observer knowing and the outcome of can uniquely determine the internal state to which the HMCQS transitioned. The generator in Figure 9 meets this criterion with . can be any POVM.
For a classical, unifilar HMC, can be calculated as
where is the stationary state distribution. This result dates back to the foundations of information theory [7].
Similarly, for a quantum unifilar HMCQS to which one can synchronize, we can write
Let us walk through this logic for the 3-Symbol Quantum Golden Mean. In state A ( ), the generator emits a qubit either in state or , each with probability . The density matrix describing this qubit is a maximally mixed state, and any measurement performed on it involves 1 bit of irreducible randomness. If it is in state B ( ), it emits a qubit in state deterministically. Thus, averaging over the states, the von Neumann entropy rate is bits/time step.
Despite this, when restricted to measuring with a repeated rank-one POVM, the entropy rate of the observed classical process cannot reach the lower bound of bits per symbol because the minimum entropy basis when in state A is , while the minimum entropy basis when in state B is . However, an experimenter using the adaptive measurement protocol in Figure 10 would observe symbol sequences with an entropy rate of bits per symbol once they have synchronized.
They start by measuring in the basis and use the outcome of the initial measurement to select a new basis. If the outcome is ‘0’, they are able to synchronize to the process generator, which is necessarily in state A. They continue in state A measuring with POVM until they observe a ‘1’, at which point they transition to B and measure with , which is a zero-entropy measurement. They observe a ‘+’ and return to state A.
In this way, when using an adaptive measurement protocol, the recurrent part of the measurement sequence may have a lower entropy rate than for any repeated rank-one POVM measurement, even if the associated DQMP uses only rank-one PVMs, as in this example. These ideas are formalized and expanded upon in the next section.
4.6. Unifilar and Nonunifilar Qubit Sources
The last two classes of qubit processes to discuss are generated by the unifilar and nonunifilar qubit sources shown in Figure 11 and Figure 12, respectively. Both of these generators consist of two internal states ( ) and emit four possible pure-qubit states ( ) that form two orthogonal pairs. However, for each internal state of the unifilar qubit source, the two possible emitted states are orthogonal, and one can unambiguously determine the next internal state. In other words, it has the property of quantum unifilarity. The same is not true of nonunifilar qubit sources. The next section illustrates how this difference strongly affects an observer’s ability to synchronize.
By varying the parameter p, one can interpolate between one of the several simpler processes already analyzed. Starting with the unifilar qubit source in Figure 11, for , the generator becomes a period-2 source that emits the word . For , we obtain a two-state model that generates the i.i.d. maximally mixed process. And, as , the generator emits longer strings of either only or only qubits depending on whether the source is in A or B. At , the process becomes nonergodic (here demonstrated by the disconnection between internal states), and the source will only emit either or deterministically.
Similarly, the nonunifilar qubit source in Figure 12 simplifies for certain p’s. For , we obtain a period-2 source, this time with sequence , and for , it also generates the i.i.d. maximally mixed process. As , it emits long sequences of either all or all and becomes nonergodic for .
4.7. Unifilar Qutrit Source
Expanding beyond processes over qubits, we consider a single example of a qutrit ( ) process whose generator is shown in Figure 13 and that employs a five-qubit alphabet . Using a higher-dimensional Hilbert space makes more measurements available to an observer for synchronization. As a consequence, this example process exhibits behavior that is impossible with qubit processes alone: A subspace of Hilbert space (occupied by ) is always distinguishable from all other states in and can be reserved for synchronization. The other subspace (including , , , and ) consists of states that cannot be reliably distinguished. Note that this process is quantum unifilar, but one cannot remain synchronized by measuring in a single basis. The next section discusses multiple adaptive measurement protocols that can do so.
4.8. Discussion
Table 1 summarizes information properties for the above examples with either analytic results or numerical estimates. Together, these examples illustrate a range of different features of separable qudit processes. They demonstrate how the information properties defined and characterized in Section 3 are both indicative of underlying structural features of quantum information sources and strongly influenced by the distinguishability of states in a process’s quantum alphabet. Our analysis of the convergence of the quantum block entropy to its linear asymptote thus gives meaningful and interpretable ways of quantifying the randomness and correlation in a separable quantum process.
We continue by discussing two tasks that an observer might wish to perform when faced with a quantum information source emitting a separable quantum process. First, if they have prior knowledge of the internal structure of the source, they may want to determine the internal state it occupies during a given time step. This task is synchronization, and we discuss it in the next section. Second, if they have no knowledge of the source, they may want to measure the process it produces to infer its internal structure. This task is system identification, and we demonstrate how an observer can use a tomographic protocol to perform it in Section 6.
5. Synchronizing to a Quantum Source
How does an observer of a process with knowledge of its quantum generator determine its internal state? When an observer is certain about the internal state, the observer is synchronized to the quantum source. The following explores synchronizing to quantum processes—both the manner in which observations lead to inferring the source’s state and quantitative measures of partial and full synchronization.
Quantum measurement adds subtlety to this task in comparison to the task of synchronizing to a classical process given knowledge of its minimal unifilar model—the -machine—as described in Refs. [12,47].
5.1. States of Knowledge
Recall that a hidden Markov chain quantum source (HMCQS) consists of a set of internal states ( ), a pure-state alphabet ( ), and a set of labeled transition matrices ( ). We assume an observer has complete knowledge of the HMCQS that generates a process but can only infer the internal state at time by applying block measurement and observing outcome . They have no access to the qudits that were emitted before .
An observer’s best guess for the internal state of a source given different sequences of observations can be represented as distributions over the source’s internal states, which are known as mixed states (not to be confused with mixed quantum states, which are represented by density matrices). Classically, after observing a particular length-ℓ word , the observer is in the mixed state . After the next observation , they will transition to one of a set of new mixed states depending on the outcome , , and so on. The word corresponding to the new mixed state is a concatenation of w and the new observation . The set of mixed states for a classical process and the dynamic between them define a process’s mixed state presentation (MSP), which is unifilar by construction [48].
Using a classical process’s MSP rather than a nonunifilar generator of the process has many computational advantages. Two of interest are that it allows one to calculate the entropy rate for processes without finite unifilar presentations [49] and that it allows one to calculate the uncertainty an observer experiences while attempting to synchronize to a process’s generator [50].
For quantum processes, there is no unique MSP but rather a multiplicity of possible MSPs, each corresponding to a different choice of measurement protocol. For a given source and given measurement protocol , we can define a set of mixed states with each corresponding to the possible measurement sequence one can observe.
Consider the mixed states corresponding to length-ℓ sequences of observations. We restrict to consist of local POVMs, allowing for adaptive measurement. Given that an observer has applied measurement and seen measurement outcomes , their best guess about the generator’s internal state is represented by the conditional distribution .
For , an observer has no measurement outcomes with which to inform their prediction about the source’s internal state. However, they do know the stationary state distribution of the model (since it can be calculated directly from ). This serves as a ‘best guess’ of the source’s internal state absent any measurements. If the initial state distribution is not and the observer is aware of this fact, the mixed states for that process are . We omit the conditioning on if .
For most repeated PVM measurements of qubit processes, there are an uncountably infinite number of mixed states [14]. This measurement-induced complexity appears even for . The examples in this section step back from this complexity to focus on measurements that are sufficiently informative to allow an observer to synchronize with finite observation sequence and result in only finite or (at most) a countably infinite set of recurrent mixed states.
5.2. Average State Uncertainty and Synchronization Information
To compare synchronization behavior for different measurement protocols, it is useful to use entropic quantities rather than working with the set of mixed states and their dynamic directly. In particular, we look at the entropy of the possible mixed states for length-ℓ sequences of measurements.
An observer’s uncertainty about the source’s internal state after applying measurement sequence (according to some protocol ) and observing outcome is given by the Shannon entropy of the corresponding mixed-state distribution:
One can average over all measurement outcomes on a block of ℓ qudits to find the average state uncertainty:
For a given measurement protocol , this quantity generally converges to a finite value in the limit. This limit does not necessarily exist if the dynamics are not ergodic, for example, if there is some underlying periodicity. If such a limit exists, the asymptotic state uncertainty is
Both and are measured in bits.
If , then is a synchronizing observation, and an observer who sees outcome can precisely identify the source’s internal state. If , then any measurement outcome seen when applying protocol to ℓ qudits allows the observer to synchronize to the source. For classical and quantum processes, the Markov order is the first value of ℓ for which vanishes (for some in the quantum case). This generally does not occur for HMCQS with nonorthogonal states in except in the limit, since is generally infinite.
Being synchronized after measuring ℓ qudits does not guarantee that the observer remains synchronized after measuring qudit . In classical processes for which this occurs, persistent synchronization requires that the HMC of the process that the observer uses satisfies the additional condition of unifilarity. In the quantum setting, synchronization persists if and only if the underlying HMCQS is quantum unifilar, and when the observer is in internal state , their measurement protocol ensures they apply the measurement for which the internal state at time is completely determined.
This is equivalent to the statement ‘There exists some protocol with measurements and such that
for all ’.
The measurement that maintains synchronization when the source is in one internal state ( ) does not need to be the same as the measurement that maintains synchronization when the source is in another internal state ( ). When the measurement required to maintain synchronization depends upon the HMCQS’s current state, then adaptive measurement protocols are capable of maintaining synchronization even when no fixed-basis measurement can, as the following demonstrates with multiple examples.
Finally, the total amount of state uncertainty that an observer encounters while synchronizing to a process using a given measurement protocol is the synchronization information, which is given by
Note that, if the asymptotic state uncertainty is greater than 0, is infinite. When , we can estimate by terminating the sum at some finite ℓ.
For classical processes, the synchronization information is closely tied to the transient information . We will similarly connect to the quantum transient information and use it as a way to compare synchronization via different measurement protocols.
The remainder of this section explores synchronization to various models from Section 4. We pair each with a variety of measurements, both repeated POVMs and adaptive protocols, to demonstrate the way is affected by this choice.
5.3. Synchronizing to Quantum Presentations of Classical Processes
An HMCQS with a qudit alphabet consisting entirely of orthogonal states ( , for all and ), is a quantum presentation of the underlying classical process . An observer can perform a measurement using orthogonal projectors for all that unambiguously discriminates between the pure qudit states the source emits.
The task of synchronizing to such an HMCQS is equivalent to synchronizing to a source emitting classical symbols. If it is quantum unifilar, then the underlying classical HMC is unifilar, and synchronization is exponentially fast (on average) [12,47]. Absent unifilarity, an observer may not be able to synchronize even asymptotically—i.e., for all —despite measuring with orthogonal projectors.
5.4. Synchronizing to Periodic Processes
All periodic quantum processes have a vanishing von Neumann entropy rate—i.e., —and each state only has one incoming and one outgoing transition. Due to these simplifications, an observer simply determines the source’s phase to synchronize; i.e., which of the p internal states the source occupies at time t. Once this phase is determined for any t, it is also determined for all other t. The initial phase uncertainty is equivalent to the initial state uncertainty , since is uniform.
Any measurement protocol that distinguishes between states in gives information about the phase and allows an observer to synchronize to the source asymptotically. If two states in cannot be distinguished by , then synchronization may not be possible, even with an infinite number of measurements.
For classical periodic processes, an observer synchronizes at the Markov order , and the synchronization information and the transient information are equal: [11]. In contrast, quantum periodic processes generically have infinite Markov order, if there are nonorthogonal states in , and only asymptotically synchronize. Thus, . The condition for equality is that is the optimal measurement over the process. For , this means that must be a global measurement over the bi-infinite chain of qudits.
Figure 14 shows the average state uncertainty for an observer measuring three different period-5 qubit processes consisting of two alphabet states: and . All other period-5 sequences with this qubit alphabet are equivalent to these three under shift and swap symmetries. As expected, decreases from to for all combinations of sequence and measurement basis. That noted, the rate at which synchronization occurs—and the total amount of uncertainty seen by an observer, the synchronization information—depends on both the particular sequence and the particular repeated measurement applied to it.
For period-5 process with word ‘ ’, bits × time steps, using Equation (47), with . When measuring it in basis ( ), we find that bits ( bits). These are also estimated by calculating up to . Similarly, the bound between and holds for the other period-5 words. For word ‘ ’, bits × time steps, bits, and bits. For word ‘ ’, bits × time steps, bits, and bits.
At this point, we wish to emphasize that s, , and are equal for generic period-5 processes with nonorthogonal alphabets . Despite this, and identify physically relevant differences between these processes for the task of synchronization. These differences are intrinsic to the quantum process itself ( ) and also appear within the measurement outcomes an observer obtains ( ).
As a final comment on periodic processes, we note that while for all periodic quantum process, many measurement protocols (even those for which ) give a measured classical process with a nonzero entropy rate . In fact, no fixed-basis measurement of a periodic process results in an observed process with unless (i) all states in are orthogonal and (ii) the measurement is in an orthogonal basis which includes one projector for each state in .
In contrast, an observer using an adaptive measurement protocol can easily have zero uncertainty in the measurement outcomes once synchronized. For example, Figure 15 shows the recurrent states for a DQMP that swaps between two different measurements— and —depending on the internal state of the source. The sequence of measurement outcomes observed is deterministic, and the recurrent measured process is a period-5 process with word ‘ ’ and .
5.5. Synchronizing with PVMs
The - Quantum Golden Mean process has multiple synchronizing observations; see the generator in Figure 6. Measuring with and seeing a ‘1’ synchronizes the observer to internal state B; measuring with and seeing a ‘−’ synchronizes the observer to internal state A. Let us consider the mixed states produced by these two repeated PVMs in turn, starting with .
Figure 16 shows the mixed states for the measured process obtained by repeatedly applying . Observing synchronizing measurement means that the source just transitioned to state B while emitting a qubit, i.e., . Additionally, the source in state B can only transition to state A while emitting a qubit, so the observer will see a ‘0’, and . However, once the source is in state A, an observer easily desynchronizes from the source if they observe another ‘0’, as this is consistent with either of the two transitions to the source. As an observer sees more ‘0’s, they transition to mixed states further to the right of Figure 16. To summarize, measuring with makes use of synchronizing observations ‘1’ and ‘10’, but its MSP has a countably infinite number of recurrent states corresponding to sequences of n ‘0’s.
This measured process is an infinite-state classical renewal process whose information properties can be estimated using methods from Ref. [51]. After seeing n ‘0’s in a row, the next observation will be ‘1’ with the following probability:
We find bits per symbol, bits, and a single-symbol entropy of bits. ’s small value and the fact that is not significantly lower than indicate that the infinite-state renewal process presentation provides only a small predictive advantage over using a biased coin with . This is further evidenced by how converges exponentially quickly to its asymptotic value of .
Measuring with , symbol ‘−’ is a synchronizing observation and indicates the source is in state A. The recurrent mixed states for this observed process are shown in Figure 17, and they also form a classical renewal process. Observing n ‘+’s since the last ‘−’, the probability that the generator is in state A is . The measured process obtained with has bits per symbol, bits, and bits. Again, the infinite-state MSP provides only a small predictive advantage over a biased coin with . Also, converges exponentially quickly to .
Applying either of these two fixed-basis measurements gives an average state uncertainty that decreases monotonically with ℓ, as shown in Figure 18. Since this source is not quantum unifilar, an observer repeatedly synchronizes and desynchronizes while measuring this process regardless of basis, and does not approach 0 as . We find that bits and bits. Measuring with not only results in less state uncertainty asymptotically but also results in less average uncertainty for all values of ℓ.
Figure 18 also displays the average state uncertainty for two other relevant PVMs: for and . Recall that is the PVM consisting of projectors onto orthogonal states and . For comparison, corresponds to , and corresponds to .
For , the symbol states and give the observer the exact same distribution of measurement outcomes, and the observer cannot gain any information about the state of the source beyond the stationary state distribution. This can also been seen as the maximum of the asymptotic state uncertainty in Figure 19.
For —and for the majority of values of — has no synchronizing observations. Despite this, Figure 18 demonstrates that is lower for all ℓ than both bases that can exactly synchronize.
A measured process for generic does not have the renewal process structure of Figure 16 and Figure 17. Instead, in the absence of synchronizing observations, the number of mixed states generically grows exponentially with ℓ. If we approximate the MSP using length-ℓ observations, then there will be mixed states—one for each possible sequence of measurements—each corresponding to a different distribution over the source’s internal states. These infinite-state MSPs can be characterized by their statistical complexity dimension [52].
Despite the explosive complexity, many of these PVMs give lower average state uncertainties than and . is a representative example. The asymptotic state uncertainty when applying is shown in Figure 19. Note that the maximum value of occurs when , as discussed. The minimum asymptotic state uncertainty for this set of PVMs is bits, which occurs for . An observer may choose to use a basis with an exponential set of mixed states to lower their uncertainty in the source’s internal state at the cost of having to track the probability of a larger set of mixed states.
5.6. Maintaining Synchrony with Adaptive Measurement
Adaptive measurement protocols are also capable of maintaining synchronization when no fixed-basis measurement can. To appreciate this, consider Figure 11’s unifilar qubit source.
For , there are no synchronizing observations; however, almost any measurement reduces the average state uncertainty below . Additionally, if an observer comes to know the source’s state by other means (perhaps the source is initialized in state A at time ), then they are able to maintain synchronization with a simple adaptive measurement protocol .
This protocol, defined here only for recurrent states, is as follows. If the source is in state A, apply measurement , and if the source is in state B, apply measurement . It is only possible to define it this simply and maintain synchronization because the source is quantum unifilar.
Figure 20 shows the behavior of as the observer desynchronizes from the source initialized in state A. For each repeated PVM measurement, it eventually reaches an asymptotic value, though may be nonmonotonic.
5.7. Synchronizing to a Qutrit Source
A qubit source ( ) presents limited opportunities for unambiguous state discrimination and, hence, for synchronization. Qutrits allow for more general behavior and have been suggested for applications in quantum communication networks where one state is reserved specifically for synchronization [53].
Let us apply this logic to the unifilar qutrit source in Figure 13. Synchronizing observation sequences for this process are ‘2’, ‘ ’, and ‘ ’ that definitively place the source in states A, B, and C, respectively. Once synchronized, an observer may remain synchronized by measuring with when in state A, when in state B, and either of the above when in state C.
We explored synchronizing to this source with five different measurement protocols and compare them in Figure 21. The first two are repeated PVMs in the basis and the basis. The other three are adaptive measurement protocols that share a recurrent dynamic but have different transient states. Consider measuring in the ( ) basis until observing a ‘2’ and therefore synchronizing to source state A. Then, use when the source is in state A and when the source is in state B or C. We refer to this protocol as ( ). The fifth protocol is defined by the DQMP in Figure 22 that uses an adaptive protocol over three transient mixed states in addition to the protocol just defined for the recurrent states.
The average state uncertainties for an observer implementing these five measurement protocols are shown in Figure 21. The fixed-basis measurements do not lead to persistent synchronization and have a nonzero asymptotic state uncertainty ( bits and bits. If one measures in a fixed basis until synchronizing (by observing a ‘2’) and then takes advantage of the quantum unifilarity of the source to stay synchronized, then the asymptotic state uncertainty vanishes. This is the case for and , which have synchronization information values of bits and bits. The extra complexity of the measurement protocol admits additional synchronizing words (‘ ’ and ‘ ’) and lower synchronization information ( bits) than the simpler strategy of waiting to see a ‘2’. Note that these three synchronization information values are all greater than our estimate of the quantum transient information for this process ( bits × symbols). These values values of were estimated using .
5.8. Discussion
This section has detailed the process of synchronizing to a known qudit source. In contrast to sources of classical random variables, an observer may attempt to synchronize to a qudit source using a variety of measurement protocols. We can compare different protocols with two informational quantities introduced above: the asymptotic state uncertainty and the synchronization information .
For sources that are not quantum unifilar, no protocol can remain synchronized to the source. Nevertheless, for different measurement protocols and , we can compare and . The protocol with a lower value is better at the task of synchronization in the sense that an observer’s uncertainty in the source’s internal state will be lower on average.
This leads to a natural question: What is the best measurement protocol for synchronizing to a source? To answer it, we introduce a new protocol-independent property of a qudit process, the minimal asymptotic state uncertainty:
where the minimum is taken over all possible measurement protocols defined via DQMP. In practice, determining for a quantum process requires a proof that no measurement protocol can achieve a lower value. Fully exploring the space of measurement protocols is beyond the present scope. Nevertheless, we introduced candidates for in the above examples and found the minimal asymptotic state uncertainty for a restricted class of repeated PVMs numerically; recall Figure 19.
For sources that are quantum unifilar, we explored several protocols that are capable of persistent synchronization. For such processes, bits. We can compare different synchronizing measurement protocols and through their synchronization information values and . A lower value means that the observer experiences less state uncertainty while synchronizing. What is the minimal amount of state uncertainty an observer can experience? We define the minimal synchronization information for a quantum process as
This minimum is also taken over all DQMPs, establishing that a given protocol is minimal and nontrivial.
One open question prompted by this work is ‘Is it always possible to synchronize to a source that is quantum unifilar?’ Equivalently, ‘Does for all processes generated by quantum unifilar sources?’ Answering these questions will also require a greater understanding of the space of DQMPs. Progress may involve proving the existence or nonexistence of a protocol that is able to synchronize to the unifilar qubit source in Figure 11.
Finally, we recount several reasons why synchronization is an important task not only for determining a source’s internal state but also for improving predictions of future measurement outcomes. Generally, HMCQSs have inherent stochasticity. Periodic sources are an exception. Through synchronization, we may substantially reduce the uncertainty in measurement outcomes. In the extreme case where a source’s internal state has only one possible transition, this uncertainty vanishes, and we can measure the next qudit with a PVM which has a deterministic outcome.
For example, if we know a - Quantum Golden Mean generator is in state B, the next qudit is , and we will always see ‘0’ if we apply the measurement . Thus, we consider synchronization as a form of dynamical inference where an observer uses knowledge of both a source’s internal structure and a sequence of measurement outcomes to inform a more accurate prediction of future measurement outcomes. Each mixed state corresponds to a different prediction.
6. Quantum Process System Identification
Synchronization is a task that is performed when an observer has an accurate model (here, a HMCQS) of the quantum information source. The next natural question is the following: How does an observer create an accurate model of an unknown quantum information source? This section begins to answer this question. It starts by briefly reviewing how one infers a classical information source from data. It then discusses how to identify the state of a single qudit with quantum state tomography. By combining these two ideas we arrive at an inference method for stationary qudit sequences.
6.1. Classical System Identification
An observer of an unknown stationary classical process is limited to use only the available data—distributions of words over symbol alphabet —to infer the source’s structure. We denote the distribution of length-ℓ words as .
For , we obtain a distribution over symbols in . With , we can reconstruct a memoryless (i.i.d.) model of the source with one internal state, and each is obtained directly from .
For , we use to create conditional probability distributions for the next symbol conditioned on the previous one; i.e., , for all , where . From these conditional distributions, we may construct a Markov approximation of the source with states, one corresponding to each symbol. The conditional distributions set the transition matrices for Markov approximation of the source, i.e., .
Similarly, for , we obtain a length-ℓ HMM approximation of the source by conditioning on length words, each of which corresponds to a different internal state. In this simplified picture, the number of internal states of the model grows exponentially with ℓ, as the number of possible words of length ℓ is . This leads to numerical problems when inferring processes with correlations over long periods of time. That said, there are methods for both combining states that reflect identical predictions of future symbols and for performing inference over machine topologies with a certain number of states—known as Bayesian Structural Inference [54]—to determine the most likely -machine for the source.
6.2. Tomography of a Qudit
We cannot directly apply the procedure for inferring classical source dynamics from word distributions to qudit processes, since observations depend on the measurement basis/protocol one uses. To fully characterize a stationary quantum source, we must instead take many measurements in different bases to reconstruct the qudit density matrices. This task is known as quantum state tomography. We begin by inferring an individual qudit state before introducing a general method for quantum system identification using separable sequences of qudits.
Tomographic reconstruction of a single unknown qudit density matrix through measurement is challenging for two main reasons:
- ’s complete description requires a number of parameters that scales exponentially with the state’s Hilbert space dimension.
- Quantum measurement is probabilistic, so one must prepare and measure many copies of to estimate a single parameter.
Specific combinations of measurements are particularly useful for this task. For example, can be inferred by measuring with a set of mutually unbiased bases (MUBs) [3] or a single informationally complete POVM (IC-POVM) [4]. For a qubit, one possible MUB consists of the x, y, and z bases. By measuring many copies of the qubit in each of these three bases, one obtains three probabilities— , , and —that uniquely determine the density matrix . We do not discuss the necessary number of measurements to determine these parameters to within a desired tolerance for general qudit tomography, a question which is well studied [33,55].
An example of an IC-POVM for a qubit consists of the projectors onto states:
This is also a symmetric IC-POVM, or SIC-POVM, because any combination of two projectors has the same inner product.
By measuring many identical copies of with the same IC-POVM, one obtains a probability distribution over the possible measurement outcomes. This provides parameters (due to normalization) that uniquely determine the density matrix .
The existence and properties of SIC-POVMs in higher-dimensional Hilbert spaces is an active area of research [56].
6.3. Tomography of a Qudit Process
Now that we know how to estimate the density matrix for an individual qudit, we can begin analyzing length-ℓ density matrices . We will measure each qudit in turn rather than perform a joint measurement over the entire length-ℓ block of qudits. This is a luxury afforded to us because we are focusing on separable qudit sequences—the generic case of entangled qudit processes requires measuring in nonlocal bases.
For , we reconstruct as described above and obtain a memoryless (i.i.d.) estimate of the source that emits qudit at every time step, following Equation (4). Unless is a pure state, there are many single-state HMCQSs that generate this process, since many different pure-state ensembles correspond to the same density matrix. A unique memoryless model of the source may be obtained by diagonalizing and having the source emit each pure eigenstate with probability equal to the corresponding eigenvalue .
For , we must reconstruct the two-qudit density matrix . (Recall that our indexing is left-inclusive and right-exclusive; therefore, is the joint state of the qudits for ). Due to stationarity, (the joint state of the two qudits) must be consistent with the one-qudit marginals, i.e., .
We will describe the iterative procedure for reconstructing for qubits in detail. When , has 15 real parameters that must be determined via tomography. Tomography on the one-qubit marginals determines three parameters, and the condition of stationarity fixes three more. The state can be reconstructed fully by considering combinations of the set of mutually unbiased measurements. For two qubits, this means the 16 combinations of Pauli matrices ( , , , and so on) [57]. To fully characterize , nine of these values must be determined—those not involving the identity operators and which are fixed by the one-qubit marginals. For , this procedure can be modified by using a set of mutually unbiased bases in that higher-dimensional Hilbert space.
After determining , one can continue on to determine (63 real parameters for qubits). Many of these parameters are fixed by the previous tomography on the one-qubit marginals (3 parameters), the two-qubit marginals (15 parameters), and their stationarity conditions (6 and 15 parameters, respectively). Combinations of three one-site Pauli matrices are sufficient for full reconstruction. One may continue this procedure for larger ℓ, typically until the number of measurements becomes experimentally infeasible.
6.4. Cost of I.I.D.
Quantum information sources are often assumed to be i.i.d. [6,33]. If an observer performing quantum state tomography assumes that an unknown quantum information source is i.i.d., then they will not go beyond determining the one-qudit marginal . If the qudits are instead correlated, this will lead to an overestimate of the source’s randomness. They will erroneously conclude that each qudit is in state and that the entropy rate is bits per time step. The latter overestimates the true entropy rate by a factor of . An observer can obtain better estimates for the entropy rate and other informational quantities by following the above procedure and tomographically reconstructing blocks of qudits of length ℓ.
To demonstrate the degree to which this assumption may mislead an observer, consider the process generated by a nonunifilar qubit source in Figure 12. Assume that the source begins in its stationary state distribution: . For any p such that , is the maximally mixed state, and an observer assuming an i.i.d. process estimates that bit per time step. This is only an accurate description of the source for , whereas, for many values of p, this source has significant correlations between subsequent qubits.
Let us take a closer look at the extreme values of p. For , the process is period-2 with bits per time step and bit. This pattern can be easily detected by measuring in the basis, where a measurement of 0 (1) immediately synchronizes an observer to state B (A). As , the two source states become increasingly disconnected, , and . An observer measuring in the basis observes a + (−) and is likely to measure another + (−). This source’s rich and varied behavior at different values of p will go entirely unappreciated when considering only the one-qubit density matrix, .
6.5. Finite-Length Estimation of Information Properties
We just saw an example of how, if an observer assumes a source is i.i.d., they will generally underestimate the structure and correlation of the quantum process and will overestimate its entropy rate. The same is true if they only perform tomography on blocks of qudits up to finite length ℓ. Consider now an experimenter who tomographically reconstructs the density matrix and then assumes there are no additional (longer-range) correlations within the qudit process.
Their estimates for the quantum entropy rate, quantum excess entropy, and quantum transient information are given by Equations (38), (45), and (47), respectively.
The difference between the length-ℓ estimate of an information property and its true value depends on the process’s internal structure and quantum alphabet. The estimates for the nonunifilar and unifilar qubit sources in Figure 23 and Figure 24 represent two extremes in this respect. As previously discussed, the single-qubit density matrix for the nonunifilar qubit source is the maximally mixed state. If an observer instead reconstructs , they significantly improve their estimate of s, , and . However, for , these estimates do not improve dramatically. There is always a tradeoff between the number of experiments necessary to reconstruct the process tomographically and the accuracy of the estimates obtained from that reconstruction. For this source, strikes a balance between those two resources.
In contrast, for the unifilar qubit source, the estimates of s, , and improve steadily as ℓ increases. Longer-range correlations are captured by increasing the length of the reconstructed density matrices . When observing this source, it is likely worth finding for the largest ℓ that is experimentally feasible.
When faced with an unknown quantum source, how does one pick an appropriate value of ℓ? One strategy is to increase ℓ until the correction made to the relevant information quantities by going to is below some threshold. Stationarity (and the resulting concavity of the quantum block entropy) ensures that future corrections will also be below that threshold.
6.6. Tomography with a Known Quantum Alphabet
If an observer has additional knowledge of what possible pure states an HMCQS may emit (i.e., the quantum alphabet ), they can leverage this knowledge to simplify the task of system identification by inferring the word probabilities of the underlying classical process rather than performing full tomographic reconstruction. For qubits, we can represent this simplification geometrically via the Bloch sphere; see Figure 25. Each point on the surface of the Bloch sphere represents a pure qubit state in —such as and on the poles of the z axis—and each interior point represents a possible qubit density matrix. The following assumes that each element of is unique.
6.6.1. l=1
The length-1 density matrix must satisfy the equation
After finding via tomography, one may rearrange this equation to infer the length-1 word distributions of , given that . The feasibility of this task depends on the relationship between and d. For the qubit case ( ), one can uniquely infer if , assuming no degenerate states in .
When is known, it may not be necessary to fully reconstruct . We first present several simple examples before giving a general algorithm for finding without performing full-state tomography on .
For and , the possible values of are restricted to a chord within the Bloch sphere defined by with . One needs only to determine the parameter p rather than reconstruct in its entirety. An observer can also pick a uniquely informative measurement to determine p. The optimal PVM for doing so is one whose antipodal projectors can be connected with the diameter of the Bloch sphere that runs parallel to the line of possible values of . For example, if , then the set of possible density matrices lies on the line segment in Figure 25a. The best PVM is then with and measurement outcomes (corresponding to a projection on pure-state ) and (corresponding to the orthogonal projector). In this case, it can easily be shown that and that . and would also be able to determine p but require more samples to determine p to within some desired tolerance.
For and , the possible values of are confined to a simplex in the Bloch sphere defined by , with and . One may determine the parameters and rather than the three parameters usually required to characterize a qubit mixed state. There are two simple choices of measurements to do so: using a IC-POVM or using two different PVMs.
For the first case, consider measuring with a SIC-POVM with elements , with each described by Equation (52). The probability of observing measurement y can be written as
One can rearrange the system of 4 equations (one for each POVM element) to obtain a unique set of ’s.
Alternatively, one uses two PVMs whose projectors can be connected by (ideally orthogonal) diameters of the Bloch sphere that are parallel to the simplex of possible values. This will yield two parameters that uniquely determine a point on the simplex. An example with is shown in Figure 25b, for which the possible values of are confined to the interior of a triangle in the Bloch sphere. One can determine and by measuring with orthogonal PVMs and (among many other combinations), in which case ‘+’ , and ‘0’ .
For and , the possible values of are confined to a tetrahedron in the Bloch sphere whose vertices are the elements of , and one cannot uniquely infer the classical symbol distribution from a fully reconstructed . For example, if , and is the maximally mixed state, this could correspond to any mixture of the form with . Thus, for and , the tomographic advantage to knowing is reduced but not eliminated, as an observer can immediately exclude any value of that lies outside the convex polyhedron defined by the elements of . This is shown in Figure 25c, where the region of possible values is confined to less than of the volume of the Bloch sphere.
Similar simplifications apply for (qutrits) when is known. Full tomography of an arbitrary mixed qutrit state requires the determination of eight parameters, whereas determining the classical distribution given requires parameters. This presents an advantage in general when . We do not explicitly construct measurements that can realize this advantage, as a geometric understanding of mixed states over the eight-dimensional space is significantly more involved than the three-dimensional Bloch sphere describing mixed states over .
For a generic qudit, the number of parameters required for full tomography is . And so, we expect that knowledge of gives a clear tomographic advantage (fewer parameters must be determined) when .
We are now prepared to give a general protocol for and arbitrary d. We wish to recover the underlying distribution of alphabet states ( , ) from measurement statistics alone. First, we construct a POVM with elements: for each in , and . Each is a parameter which can be varied to ensure that is positive semi-definite. By applying this POVM to , we obtain a distribution ( ) over the possible measurements. The first n are related to our desired distribution by
with one equation for each . If a set of ’s is a solution to this system of linear equations, it is consistent with the observed measurements. The solution will be unique for .
6.6.2. l=2
Knowing provides further advantage when considering the tomography of multiple qudits. The distribution over classical words of length ℓ has parameters, whereas full tomography of ℓ qudits requires the determination of parameters.
For ,
Once again, we consider the case of and explicitly. There are four length-2 classical word probabilities, but there are three constraints imposed by (i) normalization, (ii) stationarity, and (iii) consistency with the one-qubit marginal. Thus, one only needs to determine a single parameter to reconstruct .
Consider the task of reconstructing the length-2 density matrix produced by the - Quantum Golden Mean generator in Figure 6 with the knowledge that . One would first analyze the one-qubit density matrix to find that and .
The word probability is the only necessary additional information to find . The following simple measurement protocol can determine : Measure two consecutive qubits with . If the qubit is in state , one will never see outcome ‘−.’ If the qubit is in state , one will see outcome ‘−,’ with probability . And so, ‘ ’ .
This procedure can be generalized to arbitrary two-element alphabets . First, measure two consecutive qudits with a PVM , where one element is a projector onto with outcome ‘1’ and the orthogonal projector corresponds to measurement outcome ‘ ’. Second,
This provides a clear advantage over the usual nine parameters necessary to reconstruct , as it takes into account that the one-qubit marginals and stationarity each impose three constraints.
For and , we must determine additional parameters of the underlying classical distribution over . We do so by repeatedly applying the SIC-POVM with elements , with each described by Equation (52).
The probability of observing the length-2 measurement sequence can be written as
There are s. If a set of s is a solution to this system of equations, it is consistent with the measurement statistics, and the solution will be unique for .
For and arbitrary d, we may construct a POVM from , as we did for . It is then possible to infer a set of that solves the resulting system of equations from measurement statistics. For length-2 words, the equations are
There are equations, one for each length-2 measurement sequence corresponding to the POVM elements . Calculating the other possible outcomes (corresponding to ) is redundant due to normalization.
6.6.3. l≥3
Extending this analysis to a length-ℓ density matrix takes the form
where each has the form of Equation (5). One can determine the length-ℓ word distributions uniquely for the general case where .
The various measurement strategies explored above for can be extended to arbitrary values of ℓ. We will explicitly describe two: using a SIC-POVM for and using a POVM constructed from for arbitrary d.
Consider repeatedly applying the SIC-POVM with elements , with each described by Equation (52). Taking ℓ measurements, one observes a length-ℓ word . is the four-element alphabet of measured symbols.
The probability of observing the length-ℓ measurement sequence can be written as
where , and the factor of comes from the POVM elements. Each of the sequences has a probability that can be estimated from measurement. This system of equations can be solved to find underlying length-ℓ word probabilities that are consistent with measurements. If , this solution is unique.
We can also measure ℓ times with a POVM constructed directly from . In this case, the resulting equations are
As before, one infers the underlying word probabilities ( ’s) uniquely in the case where .
6.7. Source Reconstruction
After observing a separable qudit process and finding the length-ℓ density matrix , can one infer the HMCQS that generated it? The following reconstructs the source that generates the of an unknown process for different values of ℓ. Note that a source which generates may fail to generate .
6.7.1. l=1
After determining , an observer may construct an i.i.d. approximation of the quantum information source. Given any decomposition of into pure states —as in Equation (3)—the corresponding HMCQS consists of one internal state , and the single transition probability for each is its corresponding probability in the decomposition of . A unique model may be obtained by taking ’s eigendecomposition. In this case, the model emits each eigenstate with a probability equal to the corresponding eigenvalue: .
6.7.2. l=2
With , an observer begins to model a source that generates correlations between qudits. Doing so requires finding a separable decomposition of of Equation (53)’s form. If is known, multiple procedures for finding such a separable decomposition have been introduced above. For a generic two-qudit density matrix, determining whether it is separable or entangled is generally NP-hard [58]. There are also many necessary and sufficient conditions for separability; for example, there is the Positive Partial Transpose (PPT) criterion [59].
The following assumes that the observer has a separable decomposition of into alphabet states that may or may not be the source’s alphabet . In general, the sets of basis states in the decomposition of a two-qudit density matrix may differ, but we require a symmetric decomposition such that the basis states of both qudits are . From this decomposition, they can construct an HMCQS with an underlying Markov dynamic described by Equation (9). This HMCQS has internal states and transition probabilities that can be calculated from . Different separable decompositions of yield different HMCQSs, whose statistics over longer-length sequences may differ. Determining which is a more accurate model of the source requires performing tomography on to refine the model.
6.7.3. l≥3
Finding a separable decomposition becomes more computationally expensive as the number of qudits increases [60]. Nevertheless, if one obtains a separable decomposition of where the basis states for all 3 qudits are , then one can create a length-2 Markov approximation of the source. Each length-2 sequence of qudits in corresponds to a different internal HMCQS state. Thus, it has internal states, unless some length-2 sequences are forbidden. The transition probabilities take the form . Only transition probabilities that obey concatenation are nonzero.
One can continue in this manner, approximating the source given a separable decomposition of , to obtain an HMCQS with states or fewer if some sequences are forbidden. Each state corresponds to a word of length , where the pure states composing the word are drawn from . The number of internal states in the model grows exponentially with ℓ, but not all of these states may lead to unique future predictions. If so, they can be combined without a loss of predictivity, as is done in classical computational mechanics. A general algorithm for doing so is beyond the present scope.
6.8. Discussion
The preceding section detailed many aspects of identifying an unknown quantum process by tomographically reconstructing the length-ℓ density matrices . When correlations exist between qudits, this provides a predictive advantage over the common assumption that sources are i.i.d.. Starting with the method for reconstructing classical processes, we developed a variety of measurement protocols for different values of d, , and ℓ to find a process’s statistics when is known. We then introduced effective models for quantum sources derived entirely from separable decompositions of the density matrices reconstructed via tomography.
Since our aim is to harness correlations to improve predictions of future measurement outcomes, it is worth asking, how accurately can future measurement outcomes be predicted? To begin to answer this, the following now defines maximally predictive measurements for the special case of and for arbitrary ℓ.
For a correlated qudit process, an observer with knowledge of the length-2 density matrix may perform a measurement on the first qudit and condition upon the outcome to reduce their uncertainty when measuring the second qudit. They apply (with possible outcomes ) on the first qubit . This leaves the joint system in the classical quantum state:
where the are the qudit density matrices conditioned on outcome .
The conditional von Neumann entropy of the second qubit is then
The rank-one measurement with the minimal uncertainty in measurement outcomes is in the eigenbasis of . Different values generally correspond to different minimal-entropy measurements on the second qubit.
For two qubits, we can find the PVM for which the conditional von Neumann entropy of the second qubit is minimized. This leads to a basis-independent property of the process:
where the minimum is taken over all PVMs on .
If an experimenter reconstructs , a measurement on all but the last qudit in the block with a measurement protocol with possible outcomes leaves the block in the classical-quantum state:
To minimize the conditional von Neumann entropy of the ℓth qubit, we calculate
where the minimum is taken over all local measurement protocols on .
Further development necessitates exploring the space of measurement protocols to find those with the greatest predictive advantage over i.i.d. models for arbitrary separable qudit processes.
Finally, we note that our procedure for source reconstruction requires a number of model states that grows exponentially with ℓ. Future work on finding minimal models from density matrices will also require combining states with identical predictions and performing inference over possible model topologies, as with classical Bayesian Structural Inference.
7. Conclusions
Inspired by prior information-theoretic studies of classical stochastic processes, we introduced methods to quantify structure and information production for stationary quantum information sources. We identified properties related to the quantum block entropy that allow one to determine the amount of randomness and structure within a given qudit process. We gave bounds on informational properties of the resulting measured classical processes. In particular, we showed that they cannot have a lower entropy rate or block entropy (at any ℓ) than the original quantum process.
We analyzed a number of hidden Markov chain quantum sources (HMCQSs), explaining how an observer synchronizes to a source’s internal states via measuring the emitted qudits. If the source allows for synchronizing observations, then we showed that adaptive measurement protocols are capable of synchronizing and maintaining synchronization when fixed-basis measurements cannot.
Sequels will extend these methods and results in a number of ways. Despite focusing here on separable quantum sequences for simplicity, entangled qudit sequences can similarly be studied by combining an HMCQS and a D-dimensional quantum system capable of sequentially generating matrix product states [24]. Doing so will open up the study of entropy convergence of matrix product operators [61].
Many results exist for classical stochastic processes that may be extended to quantum processes. For example, there exist closed-form expressions for informational measures for nondiagonalizable classical dynamics [50,62,63,64]. Extending these to quantum dynamics would allow for more accurate determination of the quantum information properties introduced here. Similarly, the preceding lays the groundwork for fluctuation theorems and large deviation theories of separable quantum processes. Finally, it will be worthwhile to develop a causal equivalence relation for quantum stochastic processes and develop quantum -machines by extending classical results [65].
Separable quantum sequences also serve as a resource for information processing by finite-state quantum information transducers that transform one quantum process to another. Beyond interest in their own right, such operations have thermodynamic consequences, either requiring work to operate (as overcoming dissipation induced by Landauer erasure [66]) or acting as a quantum version of information-powered engines capable of leveraging environmental correlations to perform useful work [26,67,68]. This behavior has already been demonstrated for certain quantum processes [69].
Finally, spatially extended, ground, and thermal states of spin chains under various Hamiltonians are quantum processes. And so, the quantum information measures introduced here can serve to classify these states according to the source complexity required to generate them.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Heisenberg W. Über den anschaulichen inhalt der quantentheoretischen kinematik und mechanik Z. Phys.19274317219810.1007/BF 01397280 · doi ↗
- 2Peres A. Two simple proofs of the Kochen-Specker theorem J. Phys. A 199124 L 17510.1088/0305-4470/24/4/003 · doi ↗
- 3Wootters W.K. Fields B.D. Optimal state-determination by mutually unbiased measurements Ann. Phys.198919136338110.1016/0003-4916(89)90322-9 · doi ↗
- 4Renes J.M. Blume-Kohout R. Scott A.J. Caves C.M. Symmetric informationally complete quantum measurements J. Math. Phys.2004452171218010.1063/1.1737053 · doi ↗
- 5von Neumann J. Mathematical Foundations of Quantum Mechanics: New Edition Princeton University Press Princeton, NJ, USA 2018
- 6Wilde M. Quantum Information Theory 2nd ed.Cambridge University Press Cambridge, UK 2017
- 7Shannon C.E. A mathematical theory of communication Bell Sys. Tech. J.194827379656379–423+623–65610.1002/j.1538-7305.1948.tb 01338.x · doi ↗
- 8Schumacher B. Quantum coding Phys. Rev. A 1995512738274710.1103/Phys Rev A.51.27389911903 · doi ↗ · pubmed ↗
