Approaching the ultimate capacity limit in deep-space optical communication
Konrad Banaszek, Ludwig Kunz, Marcin Jarzyna, Micha{\l}, Jachura

TL;DR
This paper explores how combining emerging photonic technologies can approach the quantum-limited capacity of deep-space optical communication links, enabling near-optimal data rates in photon-starved conditions.
Contribution
It proposes a theoretical framework integrating quantum pulse gating, photon-number-resolved detection, and high-order modulation to nearly reach the quantum capacity limit in optical communication.
Findings
Theoretical demonstration of near-quantum-limited capacity scaling.
Identification of key photonic technologies enabling this limit.
Potential for significantly improved deep-space communication rates.
Abstract
The information capacity of an optical channel under power constraints is ultimately limited by the quantum nature of transmitted signals. We discuss currently available and emerging photonic technologies whose combination can be shown theoretically to enable nearly quantum-limited operation of a noisy optical communication link in the photon-starved regime, with the information rate scaling linearly in the detected signal power. The key ingredients are quantum pulse gating to facilitate mode selectivity, photon-number-resolved direct detection, and a photon-efficient high-order modulation format such as pulse position modulation, frequency shift keying, or binary phase shift keyed Hadamard words decoded optically using structured receivers.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
\authorinfo
Further author information: (Send correspondence to K.B.)
K.B.: E-mail: [email protected]
Approaching the ultimate capacity limit in deep-space optical communication
Konrad Banaszek
Centre for Quantum Optical Technologies, University of Warsaw, Banacha 2c, 02-097 Warszawa, Poland
Faculty of Physics, University of Warsaw, Pasteura 5, 02-093 Warszawa, Poland
Ludwig Kunz
Centre for Quantum Optical Technologies, University of Warsaw, Banacha 2c, 02-097 Warszawa, Poland
Faculty of Physics, University of Warsaw, Pasteura 5, 02-093 Warszawa, Poland
Marcin Jarzyna
Centre for Quantum Optical Technologies, University of Warsaw, Banacha 2c, 02-097 Warszawa, Poland
Michał Jachura
Centre for Quantum Optical Technologies, University of Warsaw, Banacha 2c, 02-097 Warszawa, Poland
Faculty of Physics, University of Warsaw, Pasteura 5, 02-093 Warszawa, Poland
Abstract
The information capacity of an optical channel under power constraints is ultimately limited by the quantum nature of transmitted signals. We discuss currently available and emerging photonic technologies whose combination can be shown theoretically to enable nearly quantum-limited operation of a noisy optical communication link in the photon-starved regime, with the information rate scaling linearly in the detected signal power. The key ingredients are quantum pulse gating to facilitate mode selectivity, photon-number-resolved direct detection, and a photon-efficient high-order modulation format such as pulse position modulation, frequency shift keying, or binary phase shift keyed Hadamard words decoded optically using structured receivers.
keywords:
Photon-starved communication, photon information efficiency, additive Gaussian noise, Holevo’s bound, noise rejection, matched filter, photon counting
1 INTRODUCTION
Quantum mechanics defines the ultimate capacity limits of optical communication links.[1] In recent years, a good deal of effort has been put into devising schemes that would attain this limit in the photon-starved regime, when the propagating optical signals experience substantial attenuation bringing the number of detected photons per slot to much less than one on average.[2, 3, 4, 5, 6, 7] Such a regime is routinely encountered in implemented and planned optical links for transferring data collected by scientific instruments onboard deep-space missions.[8, 9, 10, 11]
In this contribution, we address the role of excess optical noise, which adds a random fluctuating component to the complex amplitude of the transmitted signal. Such noise originates e.g. from stray light scattering in a free-space optical channel. We point out that the excess noise defines the ultimate quantum mechanical limit on the photon information efficiency attainable in deep-space optical communication systems. Interestingly, this bound has a very weak, logarithmic dependence on the excess noise power spectral density. We discuss assumptions underlying its derivation and identify emerging photonic technologies that could make it relevant to the operation of actual deep-space optical communication links.
2 CAPACITY LIMITS
Throughout this paper we will assume that the time axis is divided into discrete slots. The duration of a single slot is defined by the inverse of the bandwidth of the optical channel, . A convenient figure of merit is the average optical energy detected in a single slot, expressed in the units of the energy of a single photon at the carrier frequency . Here is Planck’s constant. The resulting quantity is the average detected photon number per slot
[TABLE]
where is the trasmitter power and is a linear factor characterizing the overall channel transmission. For notational simplicity we will take to incorporate also the efficiency of the detection system. If in the course of propagation the signal acquires a stochastic component generated by broadband Gaussian noise with power spectral density , its contribution can be quantified using the average number of background photons detected per slot,
[TABLE]
The information capacity of an optical channel is typically analyzed under the assumption of a fixed average power. If both field quadratures are utilized to encode information and are read out via shot-noise limited heterodyne detection, the capacity per slot takes the form
[TABLE]
and the optimal constellation in the complex amplitude plane is given by a phase-invariant two-dimensional Gaussian distribution centered at zero. When information is encoded only in one field quadrature and retrieved using a homodyne receiver operating at the shot-noise limit, the capacity per slot reads
[TABLE]
In this case the optimal constellation is one-dimensional with the Gaussian distribution for the detected quadrature and the conjugate quadrature set to zero.
Expressions given in Eq. (3) and Eq. (4) follow from the Shannon-Hartley theorem. The fractions appearing in the argument of the logarithm have the straightforward interpretation of the signal-to-noise ratio. In each case the noise term in the denominator is a sum of two contributions. The first one, equal to one owing to the choice of units made here, comes from the detection process assumed to operate at the shot noise limit.[12, 13] The second contribution stems from the excess noise added to the signal propagating through the channel. The shot noise defines natural units for the signal and the background noise strength used in definitions (1) and (2).
The ultimate quantum mechanical capacity[14] is derived under the assumption that at the output of the channel one can implement the most general measurement permitted by quantum theory. We will refer to this quantity as the Holevo capacity. The explicit expression for the Holevo capacity in the case of a noisy channel reads:[1]
[TABLE]
The above expression has a different mathematical form compared to Eqs. (3) and (4). Somewhat bafflingly from the perspective of the classical theory, the signal-to-noise ratio term can no longer be identified in a straightforward manner.
In the following, it will be convenient to switch to photon information efficiency (PIE) defined by as a performance measure. PIE provides the proportionality factor between the attainable information rate and the detected photon flux. Fig. 1 depicts PIE in the photon-starved regime for several values of the background noise in the range . A dramatic difference is seen in the asymptotic limit between the conventional coherent (heterodyne or homodyne) detection on one hand and the Holevo bound on the other hand. Analytical expressions for the asymptotic values of PIE can be easily obtained by expanding respective capacities up to the linear term in , which yields:
[TABLE]
For heterodyne and homodyne detection, PIE is upper bounded respectively by and of information. Furthermore, as long as the background noise plays a negligible role, being dominated by the detection shot noise which remains at a constant level. In contrast, the Holevo bound for PIE depends dramatically on the background noise strength. In the regime it exhibits a simple logarithmic dependence . For example, sets the effective upper bound on PIE slightly at less than 10 bits per photon. Based on Fig. 1 one can infer a simple rule of thumb that the asymptotic value of the Holevo PIE bound is reached when the signal strength becomes lower than the background noise strength, i.e. .
3 Noise rejection
It is illuminating to discuss in more detail the noise model underlying the capacity limits presented in the preceding section. The optical field emerging from a noisy channel, shown schematically in Fig. 2(a), carries information in the complex amplitude of a certain normalized signal mode (waveform) characterized by its envelope . Consider broadband additive Gaussian excess noise, visualized in Fig. 2(a) as a semi-transparent strip overlaid on the signal waveform. The noise can be decomposed in an basis orthonormal of modes shown in Fig. 2(a), chosen such that the first mode is identical with the signal mode. Thus the signal mode carries the modulated amplitude plus the noise contribution, while all the other modes contain only the background noise. A single realization of the field can be written as with the envelope given by a sum
[TABLE]
We will choose the units for the complex amplitudes multiplying mode functions such that their squared absolute values specify the average numbers of photons in individual modes. For broadband noise, the stochastic amplitudes are characterized by the same phase-invariant Gaussian distribution with a zero mean, the second moment equal to
[TABLE]
and lack of correlations, for .
Expressions for channel capacities presented in Eqs. (3)–(5) are derived assuming that and that only the noise present in the signal mode is taken into account, while contributions from modes are rejected. This assumption is naturally satisfied for coherent detection, which measures only the amplitude of a specific waveform matching the temporal shape of the local oscillator.[15] Alternatively, the detected mode can be defined in digital signal postprocessing if the detector electronic output is sampled at a sufficiently high rate. On the other hand, the applicability of such a model to the direct detection scenario requires a more careful inspection.
According to the photodetection theory,[16] the photocount statistics is a function of the optical power time-integrated over the detection window. Clearly, with given by Eq. (7) such an integral will contain contributions not only from the mode , but also from other modes . In principle, the contribution to the optical field from the signal mode in Eq. (7) can be separated using the matched filter[17] such that the convolution
[TABLE]
yields at time . Optically, convolution in the temporal domain can be realized by sending the signal through a spectral filter with the amplitude transfer function given by the Fourier transform of . Detecting subsequently the output field over a short time window centered at would provide response dependent only on the amplitude of the signal mode. However, the basic drawback of this scheme is the loss of the signal power. This can be seen most easily by inspecting the effective power transfer coefficients that specify how much optical power from individual modes contributes to the detection window of duration :
[TABLE]
Fig. 2(b) shows the coefficients assuming Gauss-Hermite modes and unit peak transmission of the spectral filter. It is seen that while the selectivity improves with shorter time windows , the efficiency of selecting the signal component drops significantly below . Importantly, even for long time windows the power transfer is limited to slightly below . The resulting trade-off between the efficiency and the selectivity is depicted in Fig. 2(c).
The capacity limits defined in Eqs. (3)–(5) involve an absolute, input-independent scale for the signal and noise strengths. This scale is defined by the detector shot-noise level in the case of coherent detection, or more fundamentally by the granular nature of electromagnetic radiation taken into account in the Holevo bound. Consequently, to approach the capacity limit it would be necessary to pick up the signal mode with 100% efficiency and 100% selectivity.
Deficiencies of conventional spectral filtering described above are in principle absent in the recently introduced technique of quantum pulse gating [18, 19]. The basic idea is to select from a composite field, such as the one given in Eq. (7), a single temporal mode with efficiency in a way that does not alter its quantum statistical properties. Such a quantum pulse gate (QPG) can be implemented using three-wave mixing in a nonlinear medium with carefully engineered phase matching properties. The composite field is sent into the medium along with an auxiliary pulse so that sum-frequency generation takes place only between the mode of interest and the auxiliary pulse mode. At the output of the medium the frequency-upconverted mode is separated spectrally, using e.g. a dichroic mirror, from the remaining field. In a proof-of-principle demonstration with near-infrared ultrashort sub-picosecond pulses,[20] conversion efficiency exceeding for the Gauss-Hermite signal mode has been achieved with less than power transferred for the first-order mismatched mode . While theoretical limits on the conversion efficiency in a single-stage QPG have been indicated, the coherent nature of the sum-frequency generation process allows one to overcome them by by cascading two or more QPGs with individual conversion efficiencies lower than one.[21, 22] It is worth to mention also other functionality of quantum pulse gating, such as changing the bandwidth of the gated signal[23] and converting the carrier frequency to the spectral range where more efficient photon counting is available.
4 Pulse position modulation
The standard strategy to achieve high PIE in photon-starved optical communication is to employ the pulse position modulation (PPM) format with direct detection. Photon counting as a detection technique goes beyond the measurement of field quadratures, which makes void capacity limits resulting from the application of the Shannon-Hartley theorem to readout based on conventional coherent detection.
In a typical ground receiver setup for dowlink transmission the photon counting detector responds to radiation present in multiple temporal and spatial modes. In the temporal degree of freedom, the effective number of modes is roughly given the product of the slot duration and the bandwidth of the spectral filter used to suppress the background noise outside the signal spectrum.[24] In the spatial degree of freedom, atmospheric turbulence distorts the signal wavefront well beyond the diffraction limit. The distorted wavefront needs to be accommodated with a sufficiently large active area of the detector that consequently becomes sensitive to background radiation present in spatial modes other than the signal one. When background counts are generated by noise contributions from a large number of weakly excited modes, their statistics can be modelled by a Poissonian distribution. The PPM limits for such a multimode noise model, assuming a photon counting detector operated in the Geiger mode, i.e. providing a binary response whether at least one photon has been detected in a given slot or none at all, have been recently analyzed by Zwoliński at al.[25] The basic conclusion is that for a fixed background noise level one can in principle achieve non-zero photon information efficiency in the limit of a diminishing signal strength. The prerequisites are complete decoding to recover information from all sequences of photocounts that may occur within individual PPM frames and sufficient concentration of optical energy in signal pulses, which should contain between and photon at the detection stage for noise strengths ranging from to background counts per slot.[26]
Obviously, the multimode noise model cannot be directly compared with the Holevo capacity bound. From the theoretical viewpoint, background noise present in modes other than the signal mode can be rejected without affecting the signal itself and therefore it does not fundamentally limit the link capacity. Although achieving in practice single-mode direct detection of the signal waveform would be challenging, it may no longer seem totally outlandish on the second thought. Adaptive optics technology enables compensation of wavefront distortion effects and could facilitate delivery of the received signal to the detection stage in a single spatial mode. Further, as discussed in Sec. 3, the technique of quantum pulse gating provides in principle means to pick up the temporal signal mode with nearly 100% efficiency and very good selectivity.
Given the above motivation, it is interesting to examine the efficiency of a PPM link assuming the single-mode model for the background noise. In this scenario, the probabilities of registering photocounts on a photon number resolving detector respectively for an empty slot “0” and a pulse “1” are given by:[27]
[TABLE]
Here is the PPM order and denotes the th Laguerre polynomial. For a given average detected signal photon number and the background noise strength optimization has been carried out over the PPM order taking as the cost function a lower bound on the PPM efficiency based on relative entropy. Two types of direct detection have been considered: photon number resolved (PNR) detection, which gives the actual photocount number , and Geiger-mode detection, which discriminates only between and .
Fig. 3(a) depicts results of optimization for background noise strengths , and . The most interesting feature is that for PNR detection the gap between the optimized PPM photon information efficiency and the Holevo bound closes with diminishing signal strength. Indeed, it can be shown mathematically that the capacity per unit cost for PPM with PNR detection approaches the Holevo limit.[28] Photon number resolving capability is essential here: it is seen that for Geiger-mode detection the attainable PIE stays below the Holevo bound. This result is in a stark contrast with the noiseless () scenario, where a double-logarithmic gap opens up with the diminishing signal strength between optimized PPM PIE and the Holevo bound.[29, 30, 31]
5 Outlook
Achieving information efficiency identified in the preceding section requires implementation of the PPM format with very high orders, as shown in Fig. 3(b). Basically, the pulses should be bright enough to generate several photocounts on the detector to facilitate robust discrimination against background counts. This detection regime resembles that employed recently in low-intensity LiDAR technology,[32] where the light intensity in the few-photon range also needs to be measured within a short time window. Another prerequisite to approach the Holevo limit is soft-decoding of the receiver output with efficiency attaining the Shannon information limit, which is implicitly assumed in the relative-entropy bound used in numerical calculations.
The principal technological challenge on the transmitter side to produce a high-order PPM signal is the high peak-to-average power ratio of the laser source. This obstacle could be removed by switching to one of other high-order modulation formats visualized using time-frequency diagrams presented in Fig. 4 that are capable of delivering equivalent information efficiency.[33] In the case of the PPM format depicted in Fig. 4(a), the modulation bandwidth defines the length of a single slot in the time domain as . Therefore one -ary PPM symbol has duration , covers the area (time) (frequency), and carries optical energy measured at the detection stage. As illustrated in Fig. 4(b), the same time-frequency area can be sliced in the spectral domain into frequency slots, each of spectral width and temporal duration , which yields the frequency shift keying (FSK) modulation format. The optical energy of each symbol is now spread evenly across the entire time frame. In the context of free-space optical communication, scalable FSK modulation has been demonstrated by selecting individual spectral lines from a frequency comb using a sequence of electrooptically controlled Mach-Zehnder interferometric filters.[34] On the other hand, the readout of FSK symbols would require a high-resolution, low-loss spectrometer followed by an array of photon couting detectors.
Yet another option to prepare symbols covering the time-frequency area with a uniform distribution of instantaneous power is to generate sequences of binary phase shift keyed (BPSK) pulses in consecutive time slots, as shown in Fig. 4(c). When signs for BPSK sequences are taken as rows of Hadamard matrices, it is possible to convert all-optically received pulse trains into the PPM format using structured receivers that implement optical interference between multiple time slots.[3] Importantly, scalable designs for structured optical receivers have been presented.[35] A major advantage of the modulation format based on such BPSK Hadamard words is that pulse sequences could be generated by a transmitter based on standard telecom components. Furthermore, the modulation order is software-defined in the encoding layer of the communication system and changing it does not require any hardware adjustments. The main challenge of implementing this modulation format would be the construction of a reliable structured optical receiver. Elements of the required technology are shared with other setups for free-space optical communication, such as links utilizing the differential phase shift keying (DPSK) format[36] and quantum key distribution based on time-bin qubit encoding.[37] Looking at less mature technologies, the BPSK signal might be a good candidate to implement noise rejection using the quantum pulse gating technique, provided that temporal synchronization with the receiver setup can be established.
Acknowledgements.
We acknowledge insightful discussions with C. Antonelli, H. Eisenberg, A. Mecozzi, and M. Shtaif. This work is part of the project “Quantum Optical Communication Systems” carried out within the TEAM programme of the Foundation for Polish Science co-financed by the European Union under the European Regional Development Fund.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Giovannetti, V., García-Patrón, R., Cerf, N. J., and Holevo, A. S., “Ultimate classical communication rates of quantum optical channels,” Nature Photon. 8 , 796–800 (2014).
- 2[2] Erkmen, B. I., Moision, B. E., and Birnbaum, K. M., “A review of the information capacity of single-mode free-space optical communication,” in [ Free-Space Laser Communication Technologies XXII ], Hemmati, H., ed., 7587 , 75870 N, SPIE (2010).
- 3[3] Guha, S., “Structured optical receivers to attain superadditive capacity and the Holevo limit,” Phys. Rev. Lett. 106 (24), 240502 (2011).
- 4[4] Erkmen, B. I., Moision, B. E., Dolinar, S. J., Birnbaum, K. M., and Divsalar, D., “On approaching the ultimate limits of communication using a photon-counting detector,” in [ Free-Space Laser Communication Technologies XXIV ], 8246 , 824605, SPIE (2012).
- 5[5] Takeoka, M. and Guha, S., “Capacity of optical communication in loss and noise with general quantum gaussian receivers,” Phys. Rev. A 89 (4), 042309 (2014).
- 6[6] Rosati, M., Mari, A., and Giovannetti, V., “Multiphase Hadamard receivers for classical communication on lossy bosonic channels,” Phys. Rev. A 94 (6), 062325 (2016).
- 7[7] Chung, H. W., Guha, S., and Zheng, L., “Capacity of optical communications over a lossy bosonic channel with a receiver employing the most general coherent electro-optic feedback control,” Phys. Rev. A 96 (1), 012320 (2017).
- 8[8] Hemmati, H., Biswas, A., and Djordjevic, I. B., “Deep-space optical communications: Future perspectives and applications,” Proc. IEEE 99 (11), 2020–2039 (2011).
