Quantum Shannon Information Theory—Design of Communication, Ciphers, and Sensors
Osamu Hirota

TL;DR
This paper explores quantum Shannon information theory and its potential to improve communication systems, ciphers, and sensors beyond classical limits.
Contribution
The paper introduces a theoretical framework for overcoming limitations in quantum Shannon information theory and presents a perfect secure cipher and sensors as examples.
Findings
Quantum Shannon information theory can surpass classical communication performance if properly applied.
There are significant limitations in utilizing quantum effects for real-world communication systems.
A perfect secure cipher is proposed that does not degrade communication performance.
Abstract
One of the key aspects of Shannon theory is that it provides guidance for designing the most efficient systems, such as minimizing errors and clarifying the limits of coding. This theory has seen great developments in the 50 years since 1948. It has played a vital role in enabling the development of modern ultra-fast, stable, and highly dependable information and communication systems. Shannon theory is supported by statistical communication theories such as detection and estimation theory. The theory of communication systems that transmit Shannon information using quantum media is called quantum Shannon information theory, and research began in the 1960s. The theoretical formulation comparable to conventional Shannon theory has been completed. Its important role is to suggest that application of quantum effects will surpass existing communication performance. It would be meaningless if…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8
Figure 9
Figure 10
Figure 11
Figure 12
Figure 13Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsQuantum Information and Cryptography · Quantum Mechanics and Applications · Molecular Communication and Nanonetworks
1. Introduction
In the 1970s, while upholding the MIT tradition, R.S. Kennedy and H.A. Haus of MIT shifted communications science toward a research direction that incorporates fundamental theories of communications that take quantum effects into account (Appendix A). Kennedy took the lead in this direction and began nurturing young researchers, which has resulted in research that has had a global impact in a variety of fields. The current boom in quantum information science can be traced back to their leadership.
In this field, theory for systems that transmit conventional information using quantum states is called quantum Shannon information theory. In order to construct such a theory, probability theory is needed to describe quantum mechanical phenomena in an information-theoretic way. It is formulated as quantum probability theory, which has a different mathematical system from conventional probability theory and can faithfully describe quantum mechanical phenomena. For these reasons, it is necessary to build communication theory for the above communication systems based on quantum probability theory. It is called quantum communication theory (or quantum detection and estimation theory) and was developed by Helstrom [1], Holevo [2], Yuen [3], and other researchers.
Like conventional Shannon theory, the construction of quantum Shannon information theory started based on quantum communication theory, the studying of the properties of Shannon mutual information in systems with quantum effects. The most important achievement of the evolution from quantum communication theory to quantum Shannon information theory is the proof of Shannon’s second theorem (channel coding theorem) for quantum communication systems. First, Hausladen et al. of the Jozsa group proved it by skillfully using the properties of random coding and quantum measurement on a typical subspace of the Hilbert space spanned by pure signal quantum states for the case where the quantum state communication channel is noiseless [4].
Following from this result, Holevo [5] and Schumacher-Westmoreland [6] proved that the discrete channel capacity of a general quantum channel with noise is given by Holevo information. This is now called the Holevo–Schumacher-Westmoreland theorem.
It is well known that the theorems on the coding limit are the most important ones in the application of information theory among the major theorems in information theory. In particular, the channel coding theorem shows that there exists a coding scheme that is error-free for an infinitely long typical sequence and that the information rate is ultimately equal to the channel capacity.
In response to such a limiting existence theorem, Gallager completed a theory that does not use a typical sequence and gives a limit to the average error probability of code words for a finite-length code system. This allows us to know the limiting characteristics of errors for a finite-length code system for any information rate up to the channel capacity when a channel matrix is given. This is called the theory of reliability functions, and it is well known that it made a major contribution to the process of constructing Shannon theory [7].
Thus, as a natural evolution of research, Holevo’s group introduced a reliability function corresponding to quantum communication channels following Gallager’s concept [8,9]. In addition, Helstrom, C. Bendjaballa et al. [10,11,12], and M. Ban et al. [13,14] formulated the cut-off rate of quantum systems. These trials are applications of Gallager theory to quantum systems. Just as Gallager’s reliability function theory was effective in analyzing communication systems following classical physics, the quantum reliability function and cut-off rate are also considered to be effective in analyzing communication systems with quantum effects. Based on this perspective, Holevo has completed a unified mathematical foundation for quantum systems [15].
In this paper, we summarize the development from quantum communication theory to quantum Shannon information theory in Section 2, Section 3, Section 4 and Section 5 and explain its applications to real-world communication technologies, showing quantum advantage for communication, ciphers, and sensors. The most dramatic result of this theory is the discovery of quantum stream ciphers, which solve the Shannon impossibility theorem for symmetric key ciphers and realize perfectly secure ciphers with short keys.
2. General Quantum Communication Channel Model
In this section, we denote the theoretical structure of general quantum systems and provide an explanation of the mathematical theory of quantum communication channels. In quantum theory for transmission of the information defined by Shannon, Shannon information as a finite set of alphabets is mapped to a quantum state. Therefore, the overall communication channel consists of a quantum state-to-quantum state communication channel and a quantum measurement channel that converts the received quantum state into a classical signal via measurement (Figure 1).
In order to analyze them as information theory, it is necessary to set up a communication channel model. A communication channel that transmits a quantum state onto which the signal is mapped is called a quantum state transmission channel. This channel is described by a completely positive map in the general mathematical sense [15].
Definition 1. *Let be two -algebras and be an operator system. A linear map is called completely positive if is positive for all and for all . The space of all such maps is denoted by .
The key issue is the physical reality of such abstract descriptions. In Shannon theory, a channel is considered noiseless when the relationship between the input and output symbols is injective. From this perspective, it is important that the quantum state carrying Shannon information does not lose its quantum properties along the channel. Fortunately, the following theorem has been shown for real-world channels by Helstrom [16].
Theorem 1( ). In the transmission of quantum states through an optical communication channel with energy loss, the quantum state that can maintain a pure state is a coherent state which is defined by , where is a photon annihilation operator.
This theorem has an affinity with modern communication technology. That is, the maximum capacity of reliable information transmission in general optical channels such as fiber and free space is achieved by a coherent state, considering the channel characteristics, the bandwidth of the light source, and the receiver.
In fact, in optical fiber communication, energy loss is the main issue. When the transmitting quantum state is a coherent state, which is the quantum state of general laser light, it maintains a pure state even if there is any loss, so it is treated as noiseless in the sense of quantum state transmission.
In free-space optical communication, in addition to energy loss, external noise is added, so the quantum state transmission channel becomes a noisier channel (Figure 2). But still, the coherent state provides the maximum capacity, which is discussed in Section 5.
Remark 1. When a single photon (number state), a squeezed state, quantum entangled states, etc., are used as the transmitter state, energy loss or decoherence transforms these states into a mixed state, and the channel becomes non-injective. Thus, light sources of quantum states with strong quantum properties cannot meet the performance requirements of modern communication, which requires over 100 Gbit/s.
Next, it is necessary to describe the process of identifying the quantum state that has arrived at the receiver. In this quantum measurement process, unavoidable noise due to the most basic observation operation in quantum mechanics appears. If this is modeled as quantum noise, it can be modeled as a quantum measurement communication channel. In other words, unavoidable errors occur in the identification of quantum states, and an optimization problem arises regarding their decision.
This type of model was proposed by Helstrom in 1967 and is called quantum detection and estimation theory [1]. The noise model of the photoelectric conversion process in the receiver of laser optical communication is the simplest physical example of quantum noise. In order to describe this quantum measurement channel, quantum probability theory, which describes the quantum measurement process, is required. Below we briefly present the mathematics of quantum probability as a basis for the following discussion.
3. Mathematical Foundation of Generalized Quantum Measurement and Decision Operator
3.1. Mathematical Formulae of Generalized Quantum Measurement
The mathematical structure of quantum mechanics is given based on the Hilbert space theory constructed by von Neumann. The representation of the quantum measurement process is formulated using the eigen state and eigen value of a self-adjoint operator. This is defined as follows.
Definition 2. Let us assume that the quantum system has the self-adjoint operator as physical observable and its quantum state ρ. The Born rule in the standard quantum measurement of the observable is described as follows:
where is called projection valued measure.
The set of density operator is a convex set, its extreme points being the one-dimensional projection. The corresponding states are called pure. Any measurement with values in the real number space is described by an affine map of the set of the states (density operator) to the set of probability distributions on .
However, P. Benioff and others have been discussing the mathematical generalization of the projection process to describe the diversity of measurement processes. Finally the generalized quantum measurement process is described in the following form [17].
Definition 3. Let us consider a generalized resolution of identity , i.e., the collections of Hermitian operators, satisfying . establishes the one-to-one correspondence between affine maps of the set of density operators to the set of probability distributions on and the resolution of identities. In some cases, is called the positive operator valued measure (POVM).
This formulation makes it possible to formulate the Born effect in quantum measurements without going through concrete physical quantities (observable).
3.2. Quantum Decision Operator
In quantum communication theory, the POVM is used as the decision operator by following operational correspondence.
Let us assume the set of quantum states. The decision result through the quantum measurement is described by a compact set of the decision operator , with , such that
where I is the identity operator.
Remark 2. The decision operator does not just represent the probability of a quantum measurement process but the probability of error or detection by the receiver’s decision. In other words, it must be understood that it involves a decision by the observer [1].
The above remark, from the origins of Helstrom’s formulation, is easier to understand if we interpret the quantum decision operator as the generalization of Wald’s decision function in a classical system [18].
In classical communication theory, the decision is applied to the given probability function of the variable of the received signal. When one applies the standard quantum measurement, the probability function is given, and the decision function is applied to its probability function as in classical detection theory. However, in the quantum case, the decision operator does not need the explicit probability function of the variable of the received signal. The decision operator directly outputs the probability value of the correctness or incorrectness of the decision without going through the probability function of the measurement process. As an effect of the above fact, the possibility that the discrimination capability of the quantum signal in the quantum measurement process will exceed the discrimination capability by the standard likelihood decision based on standard quantum measurement and its probability function arises. This is equivalent to violating communication-theoretic causality [19,20].
However, this can occur only in the discrete signal set. Its effect disappears in the case of continuous variables. That is, in quantum estimation theory, there is no such quantum advantage. These observations will be discussed in the subsequent sections.
3.3. Decision Operator Based on Entangled Measurement (or Collective Measurement)
As mentioned above, a quantum measurement process is considered a communication channel. In information theory, information is generally transmitted as a code word of length n. When extended to the n-th extension, quantum communication channels need to be treated differently from conventional extension channels.
Let us consider a finite alphabet with quantum state . The code word is a combination of the alphabet with the length n. That is, the input code , and the output is , where i is the order of the sequence in the code, , and . The compound state is in a tensor product Hilbert space . When the n-th extended signal state is used, the operation of treating a sequence of quantum states of length n as a single quantum state in the n-th extended Hilbert space is called collective or entangled measurement-based decoding. In this case, these structures provide unique performance in quantum Shannon information theory, which will be described in the next section. In cases of individual and entangled measurement, optimizing the decision operator that represents the decision process is an important issue.
4. Structure of Quantum Detection and Estimation Theory
4.1. Quantum Detection Theory
4.1.1. Basic Formula
The appearance of quantum effects in the measurement process of signals and the result of the decision are simultaneously characterized by the quantum decision operator. Thus, the quantum Bayes rule is formulated without going through the likelihood ratio as follows:
where the a priori probability must be for admissibility in decision theory. The necessary and sufficient conditions for are given by Holevo [2] and Yuen [3].
Theorem 2( ). The necessary and sufficient condition for on the quantum Bayes rule is given by
On the other hand, following classical detection theory, the quantum minimax rule for a signal system in which the functional with respect to is a convex function is defined as
and the necessary and sufficient conditions are as shown below [21].
Theorem 3( ). The necessary and sufficient condition for on the quantum minimax rule is given by
For further formulae, please see the references.
4.1.2. Useful Analytical Issues
In general, it is very difficult to find the solutions of the above two quantum detection rules. However, when the signal system satisfies certain conditions or the decision operator can be set in advance, the optimal theory becomes very simple. Some examples of these are described below.
A set of quantum states that conforms to the following definition is particularly convenient.
Definition 4. Let G be a group with an operation ∘. The set of quantum state signals is called group covariant if there exist unitary operators such that
It characterizes quantum states [17].
Let us explain the case of setting a decision operator in advance. Belavkin [22] introduced the following decision operator called square root measurement (SRM).
Definition 5. The decision operator called a square root measurement (SRM) is given by
where Γ is a Gram operator.
Based on the above formula, the general properties of the quantum Bayes rule and quantum minimax rule based on the above decision operator are given by Ban and Osaki [23,24].
4.1.3. Decision Operator Based on SRM of Entangled Measurement
In the case of the operation of mapping an individual decision to the quantum states corresponding to each slot of a code word, the decision operator and decision probability are given by
where , for . In this case, there is no correlation in the signal decision process.
On the other hand, one can define the decision operator based on the entangled measurement. Let us denote the quantum state of n-th extended quantum state as follows:
where . Then we can adopt the square root measurement as follows:
where . This is called decision operator based on entangled measurement. In this case, in addition to the quantum effects due to the decision action on each alphabet, quantum effects due to entanglement can be expected.
4.1.4. Quantum Advantage in Detection
The performance of optical communications, which is designed using classical communication theory, is determined by the performance of existing receivers such as photon-counting, homodyne, and heterodyne receivers. If the performance of a communication system designed using quantum theory exceeds conventional performance, it is called quantum advantage or quantum gain. The quantum gain characteristics by the quantum decision operator are exemplified below. The concrete advantage of the decision operator based on entangled measurement is discussed in Section 5.
Theorem 4. If the signal set is covariant, the optimum decision operator is given by the decision operator based on the square root measurement. And the error probability of quantum Bayes and quantum minimax rules for M covariant signals can be given as follows:
where .
Here we show the simplest example. Assuming that the signal system is binary PSK with , the quantum solution is as follows [1]:
This is called the Helstrom bound. The decision operator in this case is given as follows:
where
where .
In the case of a homodyne receiver (classical solution), the measurement operator is
The average error probability is given by the likelihood test based on . It corresponds to the post-measurement procedure. The result does not show quantum advantage.
Separate analysis is required to determine what physical process the quantum optimal solution in Equation (16) corresponds to. An example is the Dolinar receiver [1].
4.2. Quantum Estimation Theory
Quantum estimation theory was also pioneered by Helstrom, Holevo, Personick, and Yuen. Then M.G.A. Paris et al. discussed many aspect of its application [25]. In this section, we show the formulation and discuss a certain problem.
4.2.1. Formulation
When a physical signal parameter is continuous, a set of the quantum states can be treated as a quantum state system corresponding to a continuous signal. Optimization theory goes to quantum estimation from quantum detection theory. Therefore, the decision must deal with a continuous quantity . So for the density operator , the decision is defined as an infinitesimal operator.
where for the correct probability. Thus, quantum Bayes estimation can be formulated by replacing the discrete system with an infinitesimal decision operator. In this case, the optimum POVM corresponds to the measurement process of the observable as the parameter. That is, the optimum operator is given by the eigenstate of the operator of the physical observable. Thus, the estimation mechanism will use the maximum likelihood method, just like in classical methods, based on . Hence, it is easy to understand that it has no quantum advantage in the decision process.
Alternatively, one can adopt the minimum mean square error (MMSE) as the evaluation function. Personick formulated an MMSE estimator which is applicable to the linear filtering of random signal sequences [26]. On the other hand, the Cramér–Rao bound is formulated by Helstrom as follows [1]:
Theorem 5. The estimation bound for a single parameter is given by the following formula:
where is called the symmetric logarithm derivative and is given by
The equality holds in the following case:
where is the estimate, is a function of x, and the estimate operator is given by the operator corresponding to the parameter.
This means that the optimum measurement is the standard quantum measurement and that there is no quantum advantage in the measurement process itself. In addition, there is still a certain difficulty, as discussed in the following.
4.2.2. Example for Coherent-State Signal
In optical communications and optical radar, signals generally have complex amplitudes (quadrature amplitude):
Its quantum counterpart is the photon annihilation operator , where
Here and are non-commutative observables. The problem is how to formulate the simultaneous estimation of such non-commutative observables.
(a) Single-parameter estimation
In general, the density operator of an optical field is given as follows:
where N is background noise energy and is the signal. When we consider the estimation of the single parameter of , the solution is given as follows:
As a result, we have
where is the effect of quantum noise. The estimate operator indicates a homodyne receiver. Thus, it has no quantum advantage.
(b) Non-commutative parameter estimation
Let us consider the simultaneous estimation of the non-commutative observables and . This is important for the evaluation of optical phase estimation such as
In this case, Theorem 5 does not provide the correct solution for the design of a communication system. The quantum optimum measurement for non-commutative quadrature amplitude can be given by the Yuen–Lax quantum Cramér–Rao bound [27].
Theorem 6. The estimation bound for complex amplitudes is given by the following formula:
where the right logarithm derivative is defined by
where is a photon annihilation operator and it corresponds to a heterodyne measurement. Then the bound is given as follows:
where is the simultaneous measurement effect of the non-commutable observables.
If one adopts the separate measurement of the quadrature amplitude by two balanced homodyne receivers, from Theorem 1, the coherent state maintains the coherent state at the output of the half mirror. Here each measurement may adopt Theorem 5.
Each bound is given by
However, one should not ignore the fact that the signal energy is half at each branch. Thus, the signal-to-noise ratio for the quantum part is
This means that a set of two balanced homodyne receiver systems is equivalent to a heterodyne receiver for the coherent-state signal. Theorem 6 includes the above effect under fixed energy conditions.
Now consider the case where the signals are in a state other than coherent state. In separate measurement of non-commutative parameters, the quantum state is disturbed by a loss of half mirror. Therefore, it is not accurate to evaluate communication performance based on the conventional quantum Cramér–Rao bound only.
4.2.3. Application of Lie Algebra for Non-Commutative Parameters
We have been studying the theory of quantum state control and simultaneous measurement of non-commutative quantities based on Lie algebra. Here we introduce it and its application to the formulation of estimation theory. First, we define the formulation as follows.
Definition 6. Algorithm:
- Step 1: Derive the estimation operators for simultaneous estimation of non-commutative quantities based on the symmetric logarithmic differential operator formulae in Equations (32) and (33).
- Step 2: Construct a decision operator that expresses simultaneous measurement of non-commuting quantities based on the minimum uncertain state for the two operators obtained.
(a) Construction of decision operator
Definition 7. An algebra is called a Lie algebra when its multiplication satisfies the following two conditions:
where is the commutation relation.
An example of its application in quantum mechanics is the Heisenberg–Weyl group: . The unitary representation of the subgroup of is
where is the element of Lie algebra.
Definition 8. For any element of a Hilbert space, the quantum state constructed by the formula
is called the generalized coherent state.
Here let us consider non-compact groups in Lie algebras. With respect to Hermite forms, a set that constitutes linear isometric groups is a non-compact group. SU(1,1) is a typical example.
The Lie algebra of SU(1,1) has the following elements: . Their commutation relations are
Then we define . The unitary representation of SU(1,1) is given by
where is given by the property of the Casimir operator. The following state is also the generalized coherent state:
where is called a base state. When the above state satisfies the minimum uncertainty relation for two observables, it is called the ideal generalized coherent state.
Here, for the quadrature amplitude, the elements of SU(1,1) are as follows:
Then we have
Then the following state is the ideal generalized coherent state:
where . Equations (45) and (46) give the minimum uncertainty state for and .
Let us define the decision operator as step 1:
This corresponds to the generalized heterodyne receiver [28]. Let us show how to use the above formula in the following.
(b) Estimation bound
Using these formulae, the modified quantum Cramér–Rao bound for the simultaneous estimation of non-commutative parameters of a coherent-state signal without background noise is given by step 2 as follows [28]:
Then the total minimum bound is
This is equivalent to the Yuen–Lax bound.
4.2.4. Importance of Signal-to-Noise Ratio in Estimation
In general, one should keep the following in mind: The performance of the estimation cannot be evaluated only by the estimation bound. Following communication theory, one needs the concept of the signal-to-noise ratio, which means the operational meaning in the technology.
Let us show an example of the evaluation for a continuous parameter. The squeezed state reduces the quantum noise of one of the quadrature amplitudes, and the purpose is to utilize the signal with that low noise. However, we have the following formulation.
Theorem 7( [29]). We assume that as a signal state. Let us define the SNR of this state as follows:
Under the constraint of the signal energy , the maximum is given by
where
The estimation bound is given by
When the state is a coherent state, .
As mentioned above, low noise does not mean good performance. The is the final evaluation for communication technology.
When there is an energy loss in the channel, the SNR is
The estimation bound is
When the loss is large, the above SNR becomes , which is worse than the coherent state.
Since evaluations for problems in technology require operational meaning, we emphasize the points to note in “quantum communication theory and quantum mathematical statistics” (Figure 3).
5. Quantum Shannon Information Transmission Theory
Shannon theory is a design theory for transmitting signals through noisy communication channels with as few errors as possible. Shannon defined mutual information as the evaluation function, and its maximum value is called the channel capacity. Its operational meaning is the maximum rate (a kind of efficiency) at which errors can be minimized.
In this section, we explain the foundation of design theory for systems in which communication signals are transmitted by optical signals of quantum nature. From Theorem 1, it is necessary to assume a coherent state for quantum design to provide advantages over existing communication capabilities in real-world environments.
5.1. Channel Capacity
5.1.1. Finite, Discrete Alphabet System
Shannon mutual information in the quantum measurement channel is written as follows:
where . The theory of maximization of mutual information is twinned with the above quantum decision theory, and it is denoted as follows:
Thus, it has the same theoretical structure as the optimal theory for detection theory. As a result, the basic result was derived by Holevo [2] as shown below.
Theorem 8( ). The necessary condition for maximum mutual information with respect to the decision operators for a simple set of states is given as follows:
where the solution is called accessible information.
On the other hand, there is a problem of Davies in the case of the optimization of mutual information [30]. A partial solution to this is given by [31] under Jozsa’s guidance.
We consider the concrete property of the above. Several properties of mutual information were discussed in the serial papers by M. Osaki [32]. As in signal detection theory, the mutual information obtained by the optimal decision operator is greater than that obtained by a classical measurement process. The most useful results are as follows.
Remark 3. When the signal states are group covariant, the quantum Bayes and minimax decision operators satisfy the above necessary condition for mutual information. In a real environment with large losses or background noise, no quantum state exists that can exceed the mutual information provided by communication based on coherent states.
Now we still have a difficult problem that is proof of sufficiency. According to Osaki’s analysis, the quantum minimax decision operator may provide the maximum mutual information in the practical region such as in the set of coherent states.
On the other hand, let us consider the n-th extension of the alphabet and adopt the decision operator based on entangled measurement. Then the following conjecture is imposed:
Hausladen et al. [4] proved it for the pure-state channel, and Holevo, Schumacher, and Westmoreland proved it for the general states [5,6].
Theorem 9( ). The upper bound of maximum mutual information and the capacity are given by Holevo information as follows:
where is the von Neumann entropy.
The above theorem is the channel capacity formula for a quantum Shannon information transmission system.
Remark 4. The maximum absolute value of (HSW) capacity in a real environment is given by the coherent state.
The numerical properties of HSW capacity for coherent-state signals have been analyzed by our group [33,34].
5.1.2. Infinite Alphabet System (Continuous)
After the analysis for discrete alphabet systems, the analysis was extended to analog signals with constrained power inputs, among which are channels with additive quantum Gaussian noise. This is the issue of the infinite alphabet scheme. Here we denote a simple result on the final capacity formula of quantum Gaussian channels of free space. According to the quantum Shannon theory established by Holevo and others, the capacity formula of optical quantum communication for free space lossy Gaussian channels is discussed in [35,36,37].
Theorem 10( ). The capacity formula of the single-mode quantum lossy Gaussian noise channel is given as follows [36]:
where the above is given by the coherent state; , and are average photon numbers of the signal and additive noise at the receiver; and 1 of means the quantum noise.
The above formula is, in general, greater than the Shannon classical capacity.
where the above formula is given by a heterodyne receiver. Thus, the main parameter of capacity in Shannon theory for continuous alphabets is the .
5.1.3. Quantum Advantage in Capacity
Let us discuss quantum advantage. In the transmission system of Shannon information, the quantum advantage for a discrete alphabet is given by
where means the capacity for a digital optical fiber communication system designed by conventional communication theory. The above relation of quantum advantages results from the error mitigation effect in the quantum decision mechanism and coding designed by quantum detection theory.
On the other hand, the quantum advantage in the case of infinite alphabets (continuous) is given by
When the background noise is very large, the quantum advantage disappears.
5.1.4. Implementation Problem
It is important to discuss technologies that will actually realize quantum gain in terms of capacity. The quantum gain for comes from the effect of the decision operator of a single shot. The quantum gain of comes from both the decision operator based on entangled measurement and coding scheme. Below are some examples of discrete and continuous systems.
(a) Finite, discrete case
The problem, in this case, is an issue of superadditivity of the capacity with respect to n. There are two elements to achieving superadditivity. One is code construction consisting of a sequence of quantum states. The second is a construction of decision operator and its generalization based on entangled measurement; see Equation (12). Many pioneering analyses of the former have already been published, but the latter remains a very difficult problem. Several challenging attempts based on measurement effect have been made [38,39,40,41], and some analytical examples based on coding and measurement are given in [42,43].
(b) Continuous case
Generally, quantum gain disappears when the background noise is large or the signal energy is large. Therefore, we discussed how to achieve this when there is no background noise and the received signal is very weak from an ultra-long distance. In this case, it is known that the capacity can be attained by discretization [44].
5.2. Quantum Reliability Function and Quantum Cut-Off Rate
5.2.1. Reliability Function
The operational meaning of Shannon mutual information is the efficiency of coding and decoding in a noisy channel. The unit is bits/symbol, and it is not the actual amount of information. Reliability functions are introduced to clarify the operational meaning of Shannon mutual information and channel capacity. Here, we show their quantum equivalents. In general, the reliability function is defined as follows:
The upper bound of the average error probability of a code of length n is as follows:
We will avoid mathematical details and discuss this in a simplified form.
Theorem 11( [8,9]). The upper bound on the average error probability of a pure-state code for a quantum channel is given by
where
where M is the number of symbols.
Now the maximization problem becomes
where is an eigenvalue of and it is given by the eigenvalue of the following matrix:
The above formula assumes the decision operator based on entangled measurement.
If quantum measurements are individual measurements, the reliability function is given by
where and the optimum is given by the minimum error probability conditions [10,11,12]. So this is called semi-quantum reliability function. Ban and Kurokawa clarified the difference between the two definitions [13,14].
5.2.2. Quantum Cut-Off Rate
The quantum cut-off rate is defined from formulation of the quantum reliability function of Burnashev · Holevo in the paper of Figure 4. It is given by [13,14]
Definition 9. The quantum cut-off rate is defined as follows.
where .
When the optimization with respect to is performed, we have the following simple form.
Theorem 12( ). The quantum cut-off rate is given by
under the condition for the optimum values of ξ as follows:
When the a priori probability is uniform, it becomes
where .
Thus, the relation between the reliability function and the cut-off rate is given by
Here let us introduce the definition by Helstrom based on individual measurement.
Definition 10. Here a cut-off rate based on quantum individual measurement is defined as follows:
And the upper bound of the above formula is given by
where . As a result, we have
The example will be given in the next section.
In the above, we introduced the special case of the pure state. For the general case, refer to Holevo’s paper [9].
6. Examples of Reliability Function and Cut-Off Rate
6.1. Finite, Discrete Alphabet System
Let us show some examples of the reliability function and cut-off rate for the finite alphabet.
6.1.1. Analytical Method
The signal system is set to 3-ary PSK, which is composed of coherent states that enable ultra-high speeds over long distances.
In this case, the modified Gram matrix is as follows:
The eigenvalues of the above are
where
where is the average photon number per pulse at the receiver. From these, we can derive the quantum reliability function and quantum cut-off.
On the other hand, if we assume individual measurements, we will adopt individual optimal decisions. In that case, the elements of the communication channel will be as follows:
where
Figure 5 shows a numerical example. The communication system we set up here is intended for deep space communication, where the transmitted signal is sufficiently large and the transmission attenuation is extremely large, resulting in an extremely small received signal.
6.1.2. Quantum Advantage of Decision Operator Based on Entangled Measurement
The role of quantum reliability function theory is to show how small the error rate of the transmitted code can be when the transmission rate of code is set. In the design theory of conventional optical communications, classical devices such as heterodynes and energy detectors are used as receivers, and reliability functions are evaluated using classical theory. The question is what benefits the quantum theory introduced here will bring to conventional optical communications. In quantum theory, quantum effects appear in quantum measurements, i.e., at the receiver. From the above theory, individual measurements only reduce the effect of quantum noise on individual signals. On the other hand, entangled measurements can take into account the quantum interference effect among signals due to measurement in addition to the individual quantum effects, thereby further reducing the quantum noise effect and increasing the reliability function and cut-off rate. As a result, when the code error rate is fixed (depending on the application), the code length required to achieve the same error rate can be significantly shortened. See Figure 6.
6.2. Infinite Alphabet System (Continuous)
6.2.1. Reliability Function
The case of continuous systems with an infinite alphabet was introduced in the paper of Figure 7 [46]. This discussion is extremely important because it can clarify how quantum gain appears, i.e., in comparison with discrete systems.
Let us take the input alphabet A, an arbitrary Borel subset in a finite-dimensional Euclidean space. The input is described by an a priori probability on A. The energy constraint is posed as follows:
We adopt the product Hilbert space with the input alphabet ^n^ consisting of code words of length n, and the density operator
We define the set of code of size M that is a sequence . is a family of decision operators in based on entangled measurement, satisfying . The error probability for a code word is , where .
Let us consider the speed of the exponential decay of the error probability when and below the following capacity:
Following the Shannon idea of random coding, let us consider the random ensemble of M code words of length n. We have the following theorem [46].
Theorem 13( ). Let us define as the minimum error probability with respect to coding and the decision operator. Here we introduce the reliability function as follows:
The lower bound of the reliability function is given by
where and mean the random coding bound and expurgated bound, respectively. They are
where
*where f is a function satisfying the condition for the central limit theorem for a random variable *
This corresponds to the energy constraint.
The above equations are primitive formulae of quantum reliability function theory [46].
6.2.2. Cut-Off Rate
In general, it is difficult to obtain the reliability function. Therefore, it is useful to define and calculate the cut-off rate in the same way as in the discrete system. It is given by the following relation:
where s is fixed to 1.
6.2.3. Example of Cut-Off Rate for Gaussian Channel
In real communication, the only useful quantum states of continuous quantities are coherent states. Let us assume that the coherent-state alphabet is disturbed by Gaussian noise.
Even in this case, it is difficult to obtain an exact formula for the reliability function. However, we have the exact solution of the cut-off rate as follows:
where , is the energy of the code word, and , with
In this case, the quantum gain is only the entangled effect of the decision process, i.e., the double quantum gain disappears as in the discrete system.
6.3. Importance of Cut-Off Rate and Quantum Advantage
According to reliability function theory, when the rate exceeds the cut-off rate and enters the capacity region, an extremely long code length is required to achieve a sufficiently small error rate. This means increased communication delays and has come to be seen as the biggest drawback of modern communication technology. Therefore, rather than achieving capacity, a coding technique that minimizes code length in the cut-off rate region is essential. Based on the theory in this section, it is preferable to utilize the effect of entangled measurement to shorten code length rather than increase capacity. This is because the amount of information that can be transmitted (bit/s) can be increased by orders of magnitude with optical communication. Therefore, there is no longer any need to forcefully achieve channel capacity, which is efficiency. Rather, delay is the major issue.
In other words, we can point out the following: The primary requirements of modern communication systems are low latency and ultra-high speed. This is because communication systems are fundamentally bidirectional. The challenge for the former lies in shortening the code length, while the latter depends on bandwidth characteristics. Since channel capacity is merely a matter of efficiency, if sufficient bandwidth is available, there is no need to pursue channel capacity. Channel capacity becomes a concern only in inherently unidirectional communication scenarios, such as deep space. Quantum systems, being based on optical communication, have ample bandwidth. However, shortening the code length remains a challenge. Therefore, as pointed out here, it is desirable to develop techniques for shortening code length based on the cut-off rate. Further details will be discussed in the following paper.
7. Discovery of a Cipher That Breaks the Shannon Impossibility Theorem
In the previous section, applications of quantum effects to the modern optical communications were considered for achieving high-performance communication based on quantum Shannon theory.
In this section and the sections that follow, we introduce applications of quantum Shannon theory to several technologies, such as cipher and radar. In the case of ciphers, to achieve information-theoretic security for symmetric key ciphers, it is necessary to realize a mechanism that can hide not only data but also secret keys by using quantum noise. This can consist of modulation and demodulation technologies based on the above theories (Figure 8). In this section, we will introduce the basic concept.
7.1. New Principle for Ciphers
First of all, we will introduce the application of quantum Shannon theory to the design of ciphers. This is the most exciting application of quantum information science, as it promises to completely outperform conventional functionality in the real world.
There are several no-go theorems in quantum mechanics. Quantum Shannon theory can be considered a theory for designing optimal communication systems that utilize these.
The most important application is to solve the Shannon impossibility theorem, which limits the security of classical cryptography. The Shannon impossibility theorem is given as follows.
Theorem 14( ). The information-theoretic security of a symmetric key cipher is limited by the Shannon entropy of the key.
where X is the message, is the shared secret key, and Y is the ciphertext received by an eavesdropper. The perfect secure cipher is only given by a one-time pad cipher.
The principle proposed to break the above theorem is the KCQ principle (keyed communication in quantum noise). It was disclosed as a white paper in 2000 and made public in 2003 [47].
Principle 1[ ]. The non-orthogonality of a set of signal states can be increased by randomization due to a secret key. One can create a differentiation in reception performance such that the performance of the receiver with a key to randomized quantum states is superior to that of the receiver without the key. That is,
where and C are the error performance of signal detection and the capacity, respectively. The indexes B and E mean Bob and Eve. These inequalities are called advantage creation based on KCQ.
Yuen’s main focus was quantum key distribution (QKD) as an application of his own principle. I introduced Yuen’s concept for lifting the Shannon impossibility theorem of symmetric key ciphers based on the above principle in 2004 (Figure 9) [48]. Since then, our collaborative research has brought various important advances [49,50,51]. In the following, we show a survey of how to apply quantum communication theory to realize a cipher based on KCQ. Its main concept is to use quantum modulation as encryption.
7.2. Optical Quantum Modulation as Encryption Based on the New Principle
In this section, we will explain the model of the encryption–decryption scheme by optical quantum modulation–demodulation and clarify the principle behind the appearance of quantum noise effects to hide the data and the shared secret key. We mainly deal with quantum stream ciphers and quantum block cipher formats.
(a) Quantum stream cipher
The first example of ciphers designed according to this principle is the quantum stream cipher. For detailed diagrams of the structure, see references [49,50,51].
Let us denote the protocol, where the data (plaintext) is a sequence of binary signals:
- (i)The sender prepares a big number of communication bases consisting of two non-orthogonal states (coherent state with high power) such as
where . One of them is selected by using a pseudorandom number generator (PRNG) with a secret key.
- (ii)The sender then transmits binary data using the selected binary communication basis.
where is the random mapping function due to the same PRNG (see [51]).
- (iii)A receiver who has the same pseudorandom number with the key can identify the communication basis, so they always receive binary signals with small error, because the signal amplitude of binary coherent states is large enough.
A receiver who does not have the key must identify the quantum states of that make up the many communication bases, which increases the error. The reason is that the discrimination error of a multi-quantum state is larger than that of a binary quantum state according to quantum detection theory.
The designer of this cipher could use this error to completely erase the information on the secret key and pseudorandom number structure shared by the sender and receiver, optimizing the scheme based on quantum Shannon theory (see reference [51] for the specific protocol and structure).
As a result, the ciphertext for the eavesdropper is completely masked by quantum error, and it becomes possible to break the Shannon limit (Shannon impossible theorem), which determines the limitation of information-theoretic security of symmetric key ciphers. That is,
where is the ciphertext received by the eavesdropper.
Shannon’s focus was only on ciphertext-only attacks, but in modern times, it is necessary to deal with known-plaintext attacks. For this reason, the following evaluation method has been established [52]. The quantitative evaluation of information-theoretic security is evaluated by the unicity distance for known-plaintext attacks.
Definition 11. The unicity distance of known-plaintext attacks for quantum stream ciphers is defined as follows:
It is the minimum value of the ciphertext sequence required for an eavesdropper to be able to guess the key. Then it is given by the following theorem [52].
Theorem 15( ). The lower bound of the generalized unicity distance for KPA is given as follows:
where is the maximum amount of mutual information (accessible information) for the eavesdropper’s measurement from a set of quantum states with (running key sequence) as a variable. can be replaced with Holevo capacity.
However, the original idea did not guarantee sufficient security against known-plaintext attacks. In 2007, we solved that problem. That is, one can attain by generalized randomization [53] (see reference [51] for the detailed concept). In other words, while the one-time pad requires a secret key of the same length as the message, this mechanism can encrypt any message in an information-theoretically secure manner using only a short secret key of a few hundred bits.
As a result, these quantum stream ciphers can provide the encryption system with 100 Gbit/s ∼10 Tbit/s over a communication distance of 1000 km∼10,000 km, guaranteeing information-theoretic security. More details are provided in Section 7.3.
(b) Quantum block cipher
We can consider the block cipher form based on the new principle. Quantum versions of classical block ciphers are called quantum block ciphers. Here we introduce the encryption theory for the quantum block cipher using unitary transformation by Sohma [54,55], which is also a scheme of quantum modulation.
First, let us consider ary coherent state composed by M blocks of coherent stated as follows:
We consider an operator that extends the Heisenberg commutation relation for self-adjoint operators on Hilbert spaces to the Weyl–Segal commutation relation.
Theorem 16. From the Stone–von Neumann theorem, we can construct the following formula. The quantum characteristic function for the class of quantum Gaussian state is given as follows:
where T means transpose and where
and where are the canonical conjugate operators. Then in Equation (111) is called a symplectic matrix, and it is given by
- in Equation (111) is called the unitary operator associated with symplectic transformation , and the following apply:*
- (i) A set of with different elements is prepared.
- (ii) One is selected from the set using a pseudorandom sequence with the secret key .
- (iii) A quantum ciphertext is generated by a unitary transformation associated with the selected as follows.
Here let us denote a vector of complex amplitudes α of coherent state by
then we have the following relation:
As a result, the unitary transformation for the coherent-state sequence is given as follows:
Thus, randomization of complex amplitudes through converts the basic codes of coherent states into quantum ciphertext with arbitrary complex amplitude by . When , the above is called phase randomization.
On the other hand, Bob’s decryption procedure involves applying the inverse of a pseudorandomly selected unitary transformation to the quantum ciphertext by using the same pseudorandom numbers. As a result, a receiver with the key can always receive the basic quantum state code signal before the randomization. For a more detailed explanation, see the study in [56].
7.3. Social Implementation
7.3.1. Development of Transceiver for Quantum Stream Cipher
Finally, we will explain the development status of transceivers for practical use of the above encryption. Figure 10 shows the history of the development of the transceiver. In 2002, a handmade prototype was developed, and communication experiments were successfully conducted at 125 Mbps. Subsequently, with the cooperation of Panasonic and Hitachi, a two-way prototype transceiver was developed. Currently, the development of a commercial transceiver for standard quantum stream ciphers has been completed, and we are now in the phase of improving it to generalized quantum stream ciphers with several randomizations.
On the other hand, P. Kumar and his group at Northwestern University independently developed the world’s first high-performance transceiver of quantum stream ciphers in 2002, 2003, and 2009 [57,58,59]. They also demonstrated it for applications in aircraft communication.
7.3.2. Application to Global Optical Network of 100 Gbit/s of Quantum Stream Cipher
When developing new capabilities in cryptography, it is unacceptable to degrade conventional communication performance. To avoid deterioration of communication characteristics, the application of the coherent state, which is the miraculous quantum state, is essential. The cryptographic techniques introduced in this section do not degrade communication characteristics.
This encryption device can be operated without changing the structure of the current global optical network system with optical amplifier repeaters. In other words, encryption of ultra-high-speed optical communications can be completed simply by replacing the current optical transceivers with quantum stream transceivers.
Key distribution for the upper layers of the network utilizes quantum-resistant or PQC, and after key sharing, it can be instantly replaced with a new key using a quantum stream cipher. Thus, we recommend that global network applications in the real world follow the scheme shown in Figure 11. That is, the best solution for defense against today’s urgent cyber attacks is to implement a combined scheme of the quantum-resistant public key cryptography as recommended by the NIST and quantum stream ciphers to protect data [60,61]. Much research and development are being carried out towards these goals [62,63,64,65,66].
On the other hand, the threat of eavesdropping from undersea cables will become extremely important. This is because in undersea communication networks that are approximately 10,000 km long, signals are transmitted from light to light via optical amplifiers. Furthermore, technology has been established to eavesdrop on optical signals from optical amplifier repeaters.
The quantum stream cipher is expected to be a technology that can address this issue, and preliminary experiments have already been carried out [67]. In such systems, keys are pre-installed, and no key distribution is required.
7.3.3. Business for Quantum Stream Cipher Service
Commercialization of quantum stream cipher transceivers and their application systems began in the United States in 2025. Services that meet customer needs will be available within a few years.
8. Sensor Applications Beyond the Standard Quantum Limit
8.1. Bell State Based on Entangled Coherent State
In general, the Bell states consist of four orthogonal states. However, the four entangled states based on non-orthogonal states are called quasi-Bell states. As a specific example, we present the following states based on coherent states:
where are constants for normalization, i.e., , ; ; and .
Some of these quasi-Bell states are not orthogonal to each other. Here, if , then their Gram matrix becomes very simple, as follows:
where . Let us employ the entanglement of formation as the degree of entanglement. The degrees of entanglement for quasi-Bell states becomes
where , and we have the special property of [68]. Thus and have the perfect entanglement.
These physical properties and the mechanisms of teleportation were elucidated by S.J. van Enk [69].
8.2. Quantum Reading Scheme
The term of quantum reading as a sensor was pioneered by Pirandola [70] (Appendix B). We will employ the PSK scheme such that the memory on the classical disk consists of the flat and concave areas, which correspond to “ ” and “ ”, respectively. This type of memory was invented in the beginning of the 1980s, employing a gas laser with high coherence as the light source. Later, its scheme was replaced with a laser diode with high coherence in which the decision scheme consists of the effect of reflection wave injection in laser diodes [71].
For the quantum version of such kind of sensor, one can describe the phase shift by an unitary operator as follows:
where a and are the annihilation and creation operators for bosonic systems, respectively. Here, the phase factor is set to between 0 and 2 .
The reading method in the current PSK scheme is to illuminate the laser light (coherent state) on a disk and to read the phase difference. That is, the signal states of light to discriminate are
where is the identity operator.
8.3. Error-Free Sensor Applicable to Reaction Control
Let us introduce an application of the quasi-Bell state in reaction control systems [72]. Here we restrict the problem to binary detection. So detection targets are two quantum states. Since the phase difference will be , the problem is to read the phase shift from the steady state or input state.
The coherent states are prepared in the current classical memory, and the target signal model becomes
If one employs a conventional homodyne receiver to discriminate the above quantum states, the limitation is imposed by the so-called quantum shot noise limit. This is the standard quantum limit in modern information technology.
However, in general, it is reasonable to employ a quantum optimum receiver in order to overcome the standard quantum measurement which is achieved by a homodyne receiver. In this case, from Equation (14), the limitation depends on the inner product between the two quantum states. For example, the inner product of two coherent states is
Let us assume that the of quasi-Bell states in Equation (118) is employed as the light source and the B mode illuminates a memory disk. The reflection effect operates in B mode, so the channel model is
Then, the target states become
Thus, the input state is changed to . The inner product between the above two entangled coherent states in the quasi-Bell state is
That is, the inner product becomes zero, and it is independent of the energy of the light source. This is a property of the quasi-Bell state. To check this special property, one can examine the different phase shift as follows:
where .
Finally, let us employ the quantum optimum receiver for the binary pure state of two modes. The ultimate detection performance of systems with the coherent state and homodyne, coherent state and quantum receiver, and quasi-Bell state and quantum receiver are given as
respectively. Thus, one can see that the property of the quasi-Bell state provides an attractive improvement, and this property can be obtained only by the combination of the nonclassical state and the quantum optimum receiver. Although there are many nonclassical states, almost all are not changed from the input state to the orthogonal state to the original input state just by reflection. So we interpret that such effectiveness of the quasi-Bell state comes from the special phenomena of entanglement based on non-orthogonal states, which is a feature of quasi-Bell states.
9. Conclusions
This paper explains the development of quantum communication theory and its link to Shannon information transmission theory. In particular, it shows how quantum gain appears in the performance of functions such as communications designed using these theories. While quantum applications are currently being actively discussed, new mechanisms that sacrifice some of the conventional performance when applying quantum properties cannot be applied to real-world situations. In other words, quantum communication using qubits, etc., is not suitable for social implementation, because such schemes cannot maintain the current communication capabilities. In addition, we explained that improving delay characteristics (latency) is essential to communication systems in the 21st century. Therefore, it is essential to advantageously apply quantum effects to modern optical communications, which have the highest performance for communication functions. The theory introduced here has been shown to be compatible with such system design. In particular, the quantum stream cipher, as shown in Figure 12, when compared with conventional technology, is expected to provide new possibilities.
Finally, Appendix B, Appendix C and Appendix D discuss paths for future development. In conclusion, readers of this paper will be able to understand the structure of the design principles of ultra-high-speed communication, quantum stream ciphers, and quantum sensors.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Helstrom C.W. Quantum Detection and Estimation Theory Academic Press Cambridge, MA, USA 1976
- 2Holevo A.S. Statistical decision theory for quantum systems J. Multivar. Anal.1973333710.1016/0047-259X(73)90028-6 · doi ↗
- 3Yuen H.P. Kennedy R.S. Lax M. Optimum testing of multiple hypotheses in quantum detection theory IEEE Trans. Inf. Theory 19752112513410.1109/TIT.1975.1055351 · doi ↗
- 4Hausladen P. Jozsa R. Schumacher B. Westmoreland M. Wootters W. Classical information capacity of a quantum channel Phys. Rev. A 199654186910.1103/Phys Rev A.54.18699913673 · doi ↗ · pubmed ↗
- 5Holevo A.S. The capacity of the quantum channel with general signal state IEEE Trans. Inf. Theory 1998126910.1109/18.651037 · doi ↗
- 6Schumacher B. Westmoreland M. Sending classical information via noisy quantum channels Physical. Rev. A 19975613110.1103/Phys Rev A.56.131 · doi ↗
- 7Gallager R.G. Information Theory and Reliable Communication Wileys Hoboken, NJ, USA 1968
- 8Burnashev M.V. Holevo A.S. On reliability function of quantum communication channel Probl. Peredachi Inform.198834113
