Fundamental limit of resolving two point sources limited by an arbitrary point spread function
Ronan Kerviche, Saikat Guha, Amit Ashok

TL;DR
This paper extends the analysis of the fundamental limits of resolving two point sources beyond Gaussian PSFs to arbitrary PSFs, revealing optimal measurement modes and challenging traditional resolution limits.
Contribution
It generalizes the Fisher Information analysis to arbitrary PSFs and identifies the optimal measurement modes for superresolution imaging.
Findings
Optimal modes are sinc-Bessel for hard apertures.
Resolution limit is not constrained by Rayleigh criterion.
Exact minimum mean squared error calculations support the analysis.
Abstract
Estimating the angular separation between two incoherently radiating monochromatic point sources is a canonical toy problem to quantify spatial resolution in imaging. In recent work, Tsang {\em et al.} showed, using a Fisher Information analysis, that Rayleigh's resolution limit is just an artifact of the conventional wisdom of intensity measurement in the image plane. They showed that the optimal sensitivity of estimating the angle is only a function of the total photons collected during the camera's integration time but entirely independent of the angular separation itself no matter how small it is, and found the information-optimal mode basis, intensity detection in which achieves the aforesaid performance. We extend the above analysis, which was done for a Gaussian point spread function (PSF) to a hard-aperture pupil proving the information optimality of image-plane sinc-Bessel…
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Fundamental limit of resolving two point sources limited by an arbitrary point spread function
Ronan Kerviche
University of Arizona
Email: [email protected]
Saikat Guha
Raytheon BBN Technologies
Email: [email protected]
Amit Ashok This work was supported by the DARPA REVEAL program under contract number HR0011-16-C-0026. While preparing this paper we became aware of related work [9], which has some overlap with results presented in this paper. All the detailed proofs are relegated to an Appendix at the end of the paper. University of Arizona
Email: [email protected]
Abstract
Estimating the angular separation between two incoherently radiating monochromatic point sources is a canonical toy problem to quantify spatial resolution in imaging. In recent work, Tsang et al. showed, using a Fisher Information analysis, that Rayleigh’s resolution limit is just an artifact of the conventional wisdom of intensity measurement in the image plane. They showed that the optimal sensitivity of estimating the angle is only a function of the total photons collected during the camera’s integration time but entirely independent of the angular separation itself no matter how small it is, and found the information-optimal mode basis, intensity detection in which achieves the aforesaid performance. We extend the above analysis, which was done for a Gaussian point spread function (PSF) to a hard-aperture pupil proving the information optimality of image-plane sinc-Bessel modes, and generalize the result further to an arbitrary PSF. We obtain new counterintuitive insights on energy vs. information content in spatial modes, and extend the Fisher Information analysis to exact calculations of minimum mean squared error, both for Gaussian and hard aperture pupils.
I Introduction and Background
Consider estimating the angular separation between two incoherently-radiating -wavelength quasi monochromatic point sources in the far field that are symmetrically disposed about the line of sight. The aperture of the camera has diameter , and during the integration time the total mean photon number collected is denoted . A conventional camera uses a lens in the plane of the aperture pupil to focus the image in an image plane, and detects the image-plane intensity pattern using a detector pixel array. The field amplitude in the image plane is an aperture-blurred version of the true object profile, i.e., a scaled version of convolution of the object-plane field (two independent delta functions for the above problem) with the amplitude spread function (ASF) of the camera’s aperture. It is well known that no matter what is, the minimum mean squared error (MMSE) of estimating can be made arbitrarily small by using a long enough exposure, i.e., by taking . Rayleigh showed that, for a conventional camera that measures the intensity in the image plane, even if that intensity measurement is done with infinitely many infinitesimally-tiny shot-noise-limited detector pixels, when decreases below [1], the mean squared error (MSE) of estimating drastically degrades (increases) for the integration time (hence ) held fixed. In a recent breakthrough result, Tsang et al. showed that, assuming a Gaussian ASF, intensity measurement in the infinite Hermite-Gauss (HG) basis in the image-plane coordinates attains a Fisher Information that is independent of no matter how small is , and equals the high- MSE attained by conventional image-plane direct detection [5]. They also showed that the quantum Fisher Information (QFI) for estimating —which makes no assumptions on how the optical field collected by the aperture gets pre-processed and detected—equals , establishing that a linear spatial mode sorting prior to detection, which separates mutually-orthogonal HG modes and detects each with individual detector pixels, is an optimal detector for this problem. This showed that Rayleigh’s criterion is an artifact of the conventional philosophy of intensity measurement in the image plane. There is rich information content in the phase of the image to extract, which optimally using a shot-noise-limited intensity measurement, one must use a non-trivial spatial-mode transformation to the aperture field prior to detection to manipulate the post-detection shot noise so as to maximize the information content about in the noisy detection outcomes.
This result opens up a variety of interesting questions, some important ones being: (a) what is the information-optimal mode basis for the two-point-source problem with a hard-aperture pupil (sinc ASF) and for other general ASFs, (b) what is the right minimal set of modes that carry almost all the relevant information about a passive imaging problem, (c) what is the actual advantage in MSE (Fisher Information provides a lower bound on the MMSE, via the Cramer Rao lower bound (CRLB), which is not always achievable), and (d) how does this theory generalize to more complex imaging problems, and to broadband light.
In this paper, we focus on the two-point-source problem described above. Our contributions are summarized below:
1. With a rectangular hard aperture, i.e. sinc ASF, we show that measuring in the sinc-Bessel (SB) mode basis achieves the QFI and the Fisher Information is independent of , analogous to measuring in the HG mode basis with a Gaussian aperture.
2. We evaluate the exact MMSE of estimating with the optimal mode basis and compare with the CRLB.
3. We illustrate a counterintuitive distribution of energy vs. information in the individual modes of the optimal mode basis, which provide insights on efficient measurement design. In particular we find that if one were to extract and detect a single mode and its orthogonal complement (e.g., the binary SPADE of [5]), the mode is optimal rather than the mode, in the deep sub-Rayleigh limit. We discuss information loss due to leaky mode separation.
4. We provide the optimal binary SPADE measurement for a general ASF which attains the QFI in the low limit, and new insights into constructing information optimal mode bases.
II Optimal modes for hard aperture
Let be the (generally complex-valued) energy-normalized Amplitude Spread Function (ASF) of the aperture, i.e., . The image plane field is an incoherent sum of two symmetrically-shifted copies of the ASF at , each of which is perfectly self coherent. If one projects the image plane field onto the complex-valued spatial mode , , the fraction of the intensity in the image-plane field that appears in the mode is given by:
[TABLE]
where denotes complex conjugate. We will call the measurement function for mode . In [5], the authors showed that if is Gaussian, then projecting the image-plane field simultaneously onto the infinite Hermite Gauss (HG) orthonormal mode basis functions , , attains a vector of measurements whose Fisher Information content on is quantum optimal, and independent of . We will develop a similar strategy but for the practically relevant case of a space-limited (hard) aperture. This produces a cardinal sine () ASF, i.e., .
In order to match the energy distribution of the PSF, we will choose an orthonormal basis for which the ASF is the first basis function. This is the case of the Spherical Bessel Functions of the First Kind, , which are all either even or odd. Note that . It is simple to verify that . We will introduce a spatial scale factor of dimensions of length in the image plane coordinate () to capture the actual ‘length’ of the ASF in the image plane. It will depend upon the diameter of the aperture and the focal length of the imaging system. The fraction of the total energy collected in the mode is given by:
[TABLE]
From [8], equation 1.10.50 :
[TABLE]
shows that the sinc Bessel (SB) modes capture all the energy in the image-plane field. With being the total mean photon number collected over the camera’s integration time, the number of photons in the SB mode is .
Each of separated SB modes is detected using a shot-noise-limited detector. The total number of orthogonal temporal modes in the collected field , where is the integration time and is the bandwidth of the light around its center frequency. The number of photons per temporal mode at optical frequencies. . It is simple to show that with and , and with the photon statistics in the individual modes being distributed with the thermal (geometric, or Bose Einstein) distribution, that the total number of photons in the spatial mode is Poisson distributed with mean . In order to calculate the Fisher Information of in the SB mode, we will rely on the following result:
Lemma 1** (Fisher Information For A Poisson Corrupted Process)**
Let be a function with values in mapping the variable of interest to a measurement where is the Poisson distribution. Then the Fisher Information associated to the process can be written as:
[TABLE]
This expression is similar to that obtained for a Gaussian-corrupted measurement process, , where is the sensitivity and is the noise variance. The Fisher Information in the SB mode evaluates to:
[TABLE]
Lemma 2** (Series of Spherical Bessel Function Of The First Kind)**
We have the following result on a series of Spherical Bessel Function of the First Kind on any finite interval of and containing [math] (see Appendix for proof):
[TABLE]
The measurement outputs on any orthogonal mode set are statistically independent Poisson random variables. So, the total Fisher Information for the vector-parametrized estimator is equal to the sum of the individual Fisher Informations from each mode. Using Lemma 2, we deduce that measuring all the SB modes leads to a -independent Fisher Information, i.e.,
[TABLE]
In [5], it was shown that in (8) is the QFI for estimating with a hard aperture. Hence, we now have a proof that a specific SB mode sorting prior to direct detection achieves the QFI, in the sense that the classical Fisher Information of the SB-mode measurement exactly matches the QFI.
III Energy vs. Information Content in Modes
With the Gaussian ASF , the measurement function and Fisher information in the image-plane HG mode, , are respectively given by [5]:
[TABLE]
The total Fisher information from measuring all the (infinitely many) HG modes equals the QFI bound for any , i.e. [5],
[TABLE]
The Fisher Information attained by infinite-spatial-resolution image-plane direct detection is given by,
[TABLE]
where is the normalized spatial distribution of energy in the image plane, also the probability density function of measuring a photon at spatial position , conditioned on . approaches the QFI in (11) for , but goes to zero as . So, the information advantage of the HG mode measurement over image-plane direct detection is maximum at small (sub Rayleigh regime) [5].
Comparing (9) and (10) with (2) and (6), we see several analogous trends. First, measuring all the modes (HG or SB, respectively) in either case captures all the image-plane energy for any . Hence, any spatial mode orthogonal to the span of the respective mode sets would neither have any energy nor any information content. Further, in both cases the mode captures all of the energy at . Finally, as noted in [6], only the HG mode contributes to the total Fisher information at low for the Gaussian aperture case. With a hard aperture, the SB mode has that exact same property.
In what follows, we will consider information contributions from individual modes and in sets of modes (without resolving modes in the set). We first consider the following lemma:
Lemma 3** (Fisher Information Inequality On Aggregated Measurements)**
Consider measurement functions corresponding to an orthonormal family of modes. Let us say we make an aggregated measurement where we project the image plane field on to a collection of modes , i.e., an effective measurement function . This measurement cannot give us more information than the sum of the information in the individual modes in the set, i.e.,
[TABLE]
We now want to find a single mode whose information content does not go to zero (i.e., goes to a constant ) as . To see the requirement on , let us consider the following:
Proposition 4** (Insensitivity Property)**
Given any properly normalized and continuous mode over and any ASF , the first order derivative of the associated measurement function goes to [math] as .
Hence, in order for the information content in mode , to go to a constant as , the following equivalence relation must be satisfied:
[TABLE]
For the above to hold, it is necessary (but not sufficient in general) for as . Thus, should neither have sensitivity nor should it capture any energy at . Yet, if (14) is satisfied, measuring will produce non zero information for . Note that the insensitivity property applies to in (12) for the direct measurement as well: it equals [math] regardless of the ASF. So an image-plane direct measurement provides no information about when .
A binary SPADE (Bin-SPADE) receiver measures a single mode and its orthogonal component (i.e., the leftover energy) leading to a simple implementation [5]. For both Gaussian and hard apertures, BinSPADE receivers constructed for being the respective mode (the ASF mode) and one with the mode ( HG or SB mode, respectively) attain a non-zero information at , which equals the respective QFI limit. We will refer to these two receivers as 0-BinSPADE and 1-BinSPADE, respectively. The Fisher Information for these two measurements for the Gaussian ASF are given by:
[TABLE]
whereas, for sinc ASF (and SB modes), the respective Fisher Informations are given by:
[TABLE]
Even though both BinSPADE receivers attain the QFI for , the former significantly outperforms the latter for higher (see Fig. 1). However 1-BinSPADE is much more robust to imperfect implementation. Let us say is a leakage parameter such that the BinSPADE receiver measures and its orthogonal complement. Then, absolutely any results in to collapse to [math] at . On the contrary, the performance of 1-BinSPADE at small remains fairly stable and leakage tolerant, with the loss in Fisher information being proportional to the energy loss in the detected mode, thus continuing to satisfy the condition stated above and hence retaining a non-zero information at .
IV Mean Squared Error (MSE) Analysis
Let us recall the following inequality between the Mean Square Error () of an estimator , its bias , and the Fisher Information :
[TABLE]
where is the first derivative of the bias function with respect to . For comparing the MSE attained by mode-sorting receivers and image-plane detection, we will use the Normalized Root Mean Square Error, and the adapted Cramer-Rao bound defined as [3, 4].
For a set of independent and identically distributed (i.i.d.) measurements drawn from the conditional distribution (as will be the case when a set of orthogonal image-plane spatial modes are detected simultaneously), the Maximum Likelihood Estimator (MLE) is given by:
[TABLE]
Assuming the prior knowledge of the total photon number collected during the integration time, the MLE for a measurement that just measures the mode, is given by:
[TABLE]
For the -BinSPADE receiver (i.e., one that measures mode and its orthogonal complement), the total photon number can be inferred from the sum count over the two measurements:
[TABLE]
When is high, becomes a good estimator for and the two previous expressions become similar.
In Figs. 2 and 3, we show the performance of MLE for the 0 and 1-BinSPADE receivers for the Gaussian and hard apertures, respectively, for three choices of the total number of detected photons. The MLE performs very close to the CRB at high SNR and generally follows the same trend across all . As noted by Tsang et al. in [5], the estimator can become super efficient when it is biased, which happens at very low and large . In both graphs, we also plot the numerically-evaluated CRB for image-plane direct measurement using (12). The MSE attained by the BinSPADEs are seen to outperform image-plane detection by a clear margin for much smaller than the Rayleigh separation (). Despite the superior Fisher information of the BinSPADE receivers, the NRMSE highlights the unavoidable precision decay at low . Both the BinSPADEs and direct-measurement receivers can reach arbitrary precision in estimating provided they can collect a large enough photon flux (from increased exposure time). But the BinSPADEs remain more efficient than the direct measurement strategy in the sub Rayleigh regime.
V Efficient Binary SPADE For Arbitrary ASF
Consider a general (complex-valued) ASF, , , in , and its autocorrelation function,
[TABLE]
Using (1) the measurement functions associated with measuring the mode and its orthogonal complement are:
[TABLE]
The Fisher Information for this generalized 0-BinSPADE receiver evaluates to:
[TABLE]
where is the first derivative of . As for the two cases (Gaussian and hard aperture) studied before, this generalized 0-BinSPADE receiver collects all the image-plane energy in the [math] (ASF) mode for . This follows from the peak property of the autocorrelation function and the normalization of the ASF, which gives us . It is interesting to note that is independent of any phase present in the aperture function and is solely based on its intensity profile. Thus, most optical aberrations such as defocusing, spherical aberration or coma, among others, if taken into account into the projection mode, do not degrade the information, unlike the image-plane direct measurement.
Assuming admits a second order expansion near , one can write a Taylor series as follows:
[TABLE]
with , and . One can further show that:
[TABLE]
which is a -independent constant as before. Note that a real ASF benefits from the fact that . Also, from the Wiener-Khinchin theorem, increasing is equivalent to greater spatial variations in the ASF profile, i.e., sharper or numerous edges. It is simple to verify that (27) reduces to the corresponding formulas for the QFI for the Gaussian (, ) and hard rectangular apertures (, ), attained by the respective [math]-BinSPADE receivers as discussed above.
Let us now construct an orthonormal basis for a general ASF by using derivatives of . Let us first note the following identity relating the derivative of to that of :
[TABLE]
Clearly, the functions need not be orthogonal, and hence cannot be used for parallel measurements. We do a Gram-Schmidt orthogonalization by selecting weights such that each measurement mode can be written as a linear combination of the functions,
[TABLE]
such that forms an orthonormal basis (after removing some possibly identically null functions). It is easy to verify that (due to the energy normalization property of ) and hence the first mode () simply equals the ASF. The measurement functions for a simultaneous measurement of the modes can be expressed as:
[TABLE]
We consider a generalized -BinSPADE receiver associated with the mode. The corresponding measurement function is given by:
[TABLE]
and the associated Fisher Information is given by:
[TABLE]
Assuming can be expanded as in (26), we find that the generalized -BinSPADE receiver attains the same Fisher Information as that of the generalized [math]-BinSPADE receiver: , at . As discussed before in the context of Gaussian and hard rectangular apertures, a similar robustness advantage (for 1 over 0-BinSPADE) exists in the general case. Whether or not holds for a general complex-valued with our construction of the measurement modes, is left open. In future work, it will also be interesting to investigate a systematic generalization of the optimal modes for a pre-detection mode-sorting based receiver for more complex imaging problems, and to prove that the quantum Fisher Information limit for any incoherent-light imaging problem can be attained by an appropriate receiver that applies a pre-detection linear mode transformation.
Appendix
Proof:
Let be a function modeling the output of a process on a variable that is corrupted by Poisson noise:
[TABLE]
such that is twice differentiable with respect to . We note and , respectively its first and second derivatives. Then we can write the corresponding Fisher Information as:
[TABLE]
∎
Proof:
If we consider the output of multiple independent measurement functions , all twice differentiable with respect to , and their respective outputs , we have for the Fisher Information:
[TABLE]
∎
Proof:
We consider the two following series of functions over a finite interval of containing zero, for some fixed positive integer :
[TABLE]
We have the following loose upper bound on the Spherical Bessel Functions, from their series definition:
[TABLE]
We can pick such that, and we can test that, for the first series (40) we have the convergence bound:
[TABLE]
As the underlying series is positive, monotonic and converges (by the ratio test), we can choose a large enough to reduce the remainder, and subsequently the bound, to any small we wish. In other words:
[TABLE]
We can thus conclude that the function series is uniformly Cauchy and converges uniformly over .
For the second series (41) we have:
[TABLE]
where the same conclusion applies.
Finally, we consider the new set of series of functions, related respectively to and :
[TABLE]
We note that their point-wise convergence can easily be established at , as and, considering the previous results, we can apply the Differentiation Theorem to obtain their respective uniform convergence as well as the relations:
[TABLE]
∎
Proof:
We will use the following two results from [8] (Equations 1.10.50 and 1.10.52), where denotes the Sine Integral.
[TABLE]
By combining them, we obtain:
[TABLE]
Thanks to the Differentiation results of the previous lemma (51), we can write from the derivative of (53):
[TABLE]
We then find the following as the solution of a first order linear differential equation involving (55) and (57) as well as the property (52) and the constraint that the series is equal to 0 at :
[TABLE]
With this result, we can compute the derivative of (55) to express the following series and its respective shifted version:
[TABLE]
The next series is also found as the solution of another differential equation involving (61) and (62), with the same value constraint at than previously:
[TABLE]
And after deriving (61) we obtain:
[TABLE]
Finally, we can combine the results (57) to (64) to get:
[TABLE]
∎
Proof:
Considering any continuous and differentiable ASF as well as mode , the measurement function for two point sources separated by can be written as:
[TABLE]
We have for the limit of its first derivative:
[TABLE]
We can proceed similarly for the Fisher Information in the case of direct detection. We recall the expression:
[TABLE]
In the case of two point sources, the normalized intensity profile can be written with the previous ASF as : . One can write for its first derivative:
[TABLE]
∎
Proof:
Let be an aggregated measurement over a collection of measurement functions from orthogonal modes, i.e. , all corrupted by Poisson noise. At a location where , we have the following measurement function and Fisher Information:
[TABLE]
We can simplify the notations for the current location into sets and and observe that:
[TABLE]
Here, if or are equal to zero or their product is negative, and with the previous positivity constraint, the sum is clearly positive. Otherwise, one can write:
[TABLE]
As we have, and . Thus, (72) is always positive and we can conclude with the inequality:
[TABLE]
∎
Remark 5** (Generalized projection modes for a BinSPADE receiver)**
One can expand the generalized mode expression (29) in the case of the two first modes for a normalized ASF into :
[TABLE]
where is the first derivative of the ASF, is the autocorrelation of and , are respectively its first and second derivatives. Here, one can note, as the autocorrelation is hermitian, that is purely imaginary. Thus in the case of a purely real ASF it is equal to zero and the first mode can be constructed with only .
Remark 6** (Fisher Information of a leaky BinSPADE receiver)**
Considering a leakage factor , we define the efficiency of a BinSPADE receiver as . We have the following expressions of the Fisher Information for 0 and 1-BinSPADE respectively; first, for a Gaussian aperture:
[TABLE]
For a rectangular aperture:
[TABLE]
Finally, for a generic ASF , properly normalized and in :
[TABLE]
[TABLE]
One can notice from the common normalization of the ASF that the autocorrelation is equal to one at ; and as it is also hermitian, that is purely imaginary. Hence, in (81) the numerator always tend to zero for , but the denominator only does so if the efficiency is exactly equal to one. Thus, for any , the generalized 0-BinSPADE does not collect information for narrow separation angles : near [math].
On the other hand, with the expansion (26) one can develop the limit of (82) as :
[TABLE]
Thus, the generalized 1-BinSPADE is resilient to leakage in the deep sub-Rayleigh limit.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Lord Rayleigh F.R.S., “Investigations in optics, with special reference to the spectroscope”, Philosophical Magazine Series 5, 8 , 49 (1879).
- 2[2] C. E. Shannon, “A mathematical theory of communication,” Bell Syst. Tech. J., 27 , pp. 379–423 and 623–656 (1948).
- 3[3] Rao, Calyampudi Radakrishna (1945). ”Information and the accuracy attainable in the estimation of statistical parameters”. Bulletin of the Calcutta Mathematical Society. 37: 81–89. MR 0015748.
- 4[4] Cramer, Harald (1946). Mathematical Methods of Statistics. Princeton, NJ: Princeton Univ. Press. ISBN 0-691-08004-6. OCLC 185436716
- 5[5] M. Tsang, R. Nair, and X.M. Lu, “Quantum Theory of Superresolution for Two Incoherent Optical Point Sources,” Phys. Rev. X 6 , 031033 (2016)
- 6[6] R. Nair and M. Tsang, “Far-Field Superresolution of Thermal Electromagnetic Sources at the Quantum Limit,” Phys. Rev. Lett. 117 , 190801 (2016).
- 7[7] C. Lupo and S. Pirandola, “Ultimate Precision Bound of Quantum and Subwavelength Imaging,” Phys. Rev. Lett. 117 , 190802 (2016).
- 8[8] M. Abramowitz, I. Stegun, “Handbook of Mathematical Functions,” U.S. National Bureau of Standards, Washington, DC, (1964).
