Experimental Verification of PCH-EM Algorithm for Characterizing DSERN   Image Sensors

Aaron Hendrickson; David P. Haefner; Nicholas R. Shade; Eric R. Fossum

arXiv:2302.14654·physics.ins-det·June 29, 2023

Experimental Verification of PCH-EM Algorithm for Characterizing DSERN Image Sensors

Aaron Hendrickson, David P. Haefner, Nicholas R. Shade, Eric R. Fossum

PDF

Open Access

TL;DR

This paper experimentally verifies the PCH-EM algorithm's effectiveness in characterizing DSERN image sensors, demonstrating its accuracy across various exposure and noise levels and validating the PCD model's predictive capability.

Contribution

It provides a comprehensive experimental validation of the PCH-EM algorithm for DSERN sensors and confirms the PCD model's accuracy in predicting sensor behavior.

Findings

01

PCH-EM accurately characterizes DSERN pixels across a wide range of conditions.

02

The PCD model effectively predicts the ensemble distribution of the sensor.

03

Experimental results align well with model predictions, confirming their applicability.

Abstract

The Photon Counting Histogram Expectation Maximization (PCH-EM) algorithm has recently been reported as a candidate method for the characterization of Deep Sub-Electron Read Noise (DSERN) image sensors. This work describes a comprehensive demonstration of the PCH-EM algorithm applied to a DSERN capable quanta image sensor. The results show that PCH-EM is able to characterize DSERN pixels for a large span of quanta exposure and read noise values. The per-pixel characterization results of the sensor are combined with the proposed Photon Counting Distribution (PCD) model to demonstrate the ability of PCH-EM to predict the ensemble distribution of the device. The agreement between experimental observations and model predictions demonstrates both the applicability of the PCD model in the DSERN regime as well as the ability of the PCH-EM algorithm to accurately estimate the underlying model…

Equations57

X

X

K

R

f_{X} (x ∣ θ) = k = 0 \sum \infty \frac{e ^{- H} H ^{k}}{k !} ϕ (x; μ + k / g, σ^{2}),

f_{X} (x ∣ θ) = k = 0 \sum \infty \frac{e ^{- H} H ^{k}}{k !} ϕ (x; μ + k / g, σ^{2}),

H_{t + 1}

H_{t + 1}

g_{t + 1}

μ_{t + 1}

σ_{t + 1}^{2}

A_{t}

A_{t}

B_{t}

C_{t}

γ_{nk}^{(t)} = \frac{\frac{e ^{- H_{t}} H _{t}^{k}}{k !} ϕ ( x _{n} ; μ _{t} + k / g _{t} , σ _{t}^{2} )}{\sum _{ℓ = 0}^{\infty} \frac{e ^{- H_{t}} H _{t}^{ℓ}}{ℓ !} ϕ ( x _{n} ; μ _{t} + ℓ / g _{t} , σ _{t}^{2} )},

γ_{nk}^{(t)} = \frac{\frac{e ^{- H_{t}} H _{t}^{k}}{k !} ϕ ( x _{n} ; μ _{t} + k / g _{t} , σ _{t}^{2} )}{\sum _{ℓ = 0}^{\infty} \frac{e ^{- H_{t}} H _{t}^{ℓ}}{ℓ !} ϕ ( x _{n} ; μ _{t} + ℓ / g _{t} , σ _{t}^{2} )},

\tilde{k}_{n} = k arg max γ_{nk}^{(t)} .

\tilde{k}_{n} = k arg max γ_{nk}^{(t)} .

E ∣ H, g, μ, σ^{2}

E ∣ H, g, μ, σ^{2}

(H, g, μ, σ^{2})

f_{E} (x) = \iiiint_{Θ} f_{X} (x ∣ θ) f_{θ} (θ) d θ,

f_{E} (x) = \iiiint_{Θ} f_{X} (x ∣ θ) f_{θ} (θ) d θ,

E^{'} ∣ H, σ_{e -}^{2}

E^{'} ∣ H, σ_{e -}^{2}

(H, σ_{e -}^{2})

f_{E^{'}} (x) = \iint_{Θ^{'}} f_{X} (x ∣ H, 1, 0, σ_{e -}^{2}) f_{θ^{'}} (θ^{'}) d θ^{'},

f_{E^{'}} (x) = \iint_{Θ^{'}} f_{X} (x ∣ H, 1, 0, σ_{e -}^{2}) f_{θ^{'}} (θ^{'}) d θ^{'},

K_{e} ∣ H

K_{e} ∣ H

H

p_{K_{e}} (k) = \int_{H} \frac{e ^{- H} H ^{k}}{k !} f_{H} (H) d H = \frac{( - 1 ) ^{k}}{k !} \partial_{t}^{k} M_{H} (- t) ∣_{t = 1},

p_{K_{e}} (k) = \int_{H} \frac{e ^{- H} H ^{k}}{k !} f_{H} (H) d H = \frac{( - 1 ) ^{k}}{k !} \partial_{t}^{k} M_{H} (- t) ∣_{t = 1},

E (E) = E (E (E ∣ θ)) = E (μ) + E (H / g) .

E (E) = E (E (E ∣ θ)) = E (μ) + E (H / g) .

Var (E)

Var (E)

= Var (μ + H / g) + E (σ^{2}) + E (H / g^{2}) .

Var (E) = Var (μ) + Var (H / g) + E (σ^{2}) + E (H / g^{2}) + 2 (E (μ H / g) - E (μ) E (H / g)) .

Var (E) = Var (μ) + Var (H / g) + E (σ^{2}) + E (H / g^{2}) + 2 (E (μ H / g) - E (μ) E (H / g)) .

f_{X} (x ∣ θ) = k = 0 \sum \infty P (K = k) f_{X ∣ K} (x ∣ k) ϕ (x; μ + k / g, σ^{2}),

f_{X} (x ∣ θ) = k = 0 \sum \infty P (K = k) f_{X ∣ K} (x ∣ k) ϕ (x; μ + k / g, σ^{2}),

Var (X ∣ K = k) = σ^{2},

Var (X ∣ K = k) = σ^{2},

f_{E} (x) = k = 0 \sum \infty P (K_{e} = k) f_{E ∣ K_{e}} (x ∣ k) \frac{E _{θ} ( e ^{- H} H ^{k} ϕ ( x ; μ + k / g , σ ^{2} ))}{E _{θ} ( e ^{- H} H ^{k} )},

f_{E} (x) = k = 0 \sum \infty P (K_{e} = k) f_{E ∣ K_{e}} (x ∣ k) \frac{E _{θ} ( e ^{- H} H ^{k} ϕ ( x ; μ + k / g , σ ^{2} ))}{E _{θ} ( e ^{- H} H ^{k} )},

Var (E ∣ K_{e} = k) = E (E^{2} ∣ K_{e} = k) - (E (E ∣ K_{e} = k))^{2},

Var (E ∣ K_{e} = k) = E (E^{2} ∣ K_{e} = k) - (E (E ∣ K_{e} = k))^{2},

E (E^{2} ∣ K_{e} = k) = \frac{E _{θ} ( e ^{- H} H ^{k} ( σ ^{2} + ( μ + k / g ) ^{2} )}{E _{θ} ( e ^{- H} H ^{k} )}

E (E^{2} ∣ K_{e} = k) = \frac{E _{θ} ( e ^{- H} H ^{k} ( σ ^{2} + ( μ + k / g ) ^{2} )}{E _{θ} ( e ^{- H} H ^{k} )}

E (E ∣ K_{e} = k) = \frac{E _{θ} ( e ^{- H} H ^{k} ( μ + k / g ))}{E _{θ} ( e ^{- H} H ^{k} )} .

E (E ∣ K_{e} = k) = \frac{E _{θ} ( e ^{- H} H ^{k} ( μ + k / g ))}{E _{θ} ( e ^{- H} H ^{k} )} .

Var (E^{'} ∣ K_{e} = k) = \frac{E _{θ} ( e ^{- H} H ^{k} σ _{e -}^{2} )}{E _{θ} ( e ^{- H} H ^{k} )} = H ⊥ σ_{e -}^{2} E_{θ} (σ_{e -}^{2}),

Var (E^{'} ∣ K_{e} = k) = \frac{E _{θ} ( e ^{- H} H ^{k} σ _{e -}^{2} )}{E _{θ} ( e ^{- H} H ^{k} )} = H ⊥ σ_{e -}^{2} E_{θ} (σ_{e -}^{2}),

x_{ij}^{'} = F_{β (2, 2)}^{- 1} (\tilde{F} (x_{ij}))

x_{ij}^{'} = F_{β (2, 2)}^{- 1} (\tilde{F} (x_{ij}))

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsCCD and CMOS Imaging Sensors · Electron and X-Ray Spectroscopy Techniques · Infrared Target Detection Methodologies

Full text

Experimental Verification of PCH-EM Algorithm for Characterizing DSERN Image Sensors

Aaron Hendrickson, David P. Haefner, Nicholas R. Shade , and Eric R. Fossum

Abstract

The Photon Counting Histogram Expectation Maximization (PCH-EM) algorithm has recently been reported as a candidate method for the characterization of Deep Sub-Electron Read Noise (DSERN) image sensors. This work describes a comprehensive demonstration of the PCH-EM algorithm applied to a DSERN capable quanta image sensor. The results show that PCH-EM is able to characterize DSERN pixels for a large span of quanta exposure and read noise values. The per-pixel characterization results of the sensor are combined with the proposed Photon Counting Distribution (PCD) model to demonstrate the ability of PCH-EM to predict the ensemble distribution of the device. The agreement between experimental observations and model predictions demonstrates both the applicability of the PCD model in the DSERN regime as well as the ability of the PCH-EM algorithm to accurately estimate the underlying model parameters.

Index Terms:

conversion gain, DSERN, EM algorithm, PCH, PCH-EM, photon counting, QIS, quanta exposure, read noise.

I Introduction

As the detection precision of advanced camera technology improves, the ability to properly characterize and evaluate modern image sensors only becomes more important. While the traditional Photon Transfer (PT) method [1, 2, 3] can be applied to Deep Sub-Electron Read Noise (DSERN) image sensors, it has been shown there are other algorithms that can improve the accuracy and precision of the camera characterization [4, 5, 6]. Specifically, both the Photon Counting Histogram (PCH) method [4, 7, 8, 9] and recently introduced Maximum Likelihood Estimation (MLE) based method [5] have been demonstrated to incur less uncertainty in their estimates as compared to the PT method. Recently, Hendrickson and Haefner proposed a fourth method, Photon Counting Histogram Expectation Maximization (PCH-EM), that improves on these techniques by providing an automated algorithm for simultaneous maximum likelihood estimation of quanta exposure, conversion gain, bias (DC offset), and read noise of DSERN pixels from a single sample of data [6].

Due to the cutting edge nature of DSERN capable sensors, the PCH-EM algorithm was initially demonstrated using simulated Monte Carlo experiments. In this paper, through the use of an early photon-counting-capable Quanta Image Sensor (QIS) from Gigajot Technology Inc., a more comprehensive demonstration of the PCH-EM algorithm and verification of the associated Photon Counting Distribution (PCD) model is provided. This is accomplished by first reviewing the assumed mathematical model and theoretical framework behind the PCH-EM method. New theory pertaining to ensemble statistics of DSERN sensors is also introduced. Experimental conditions and data capture methods needed for dark current characterization with PCH-EM are provided. The experimental observations are evaluated through the PCH-EM algorithm, providing a full characterization of the sensor giving per-pixel estimates of dark current, conversion gain, bias, and read noise all from a single sequence of images captured under dark conditions. The per-pixel characterization results are then combined with the PCD model to predict the ensemble distribution for the device, showing that the model is able to predict the distribution of the raw sensor data. This agreement demonstrates both the applicability of the PCD model in the DSERN regime as well as the PCH-EM algorithm’s ability to accurately estimate the underlying model parameters.

II Theory

II-A The PCD

The digital output of a DSERN pixel is modeled as

[TABLE]

where $H$ is the quanta exposure $(e\text{-})$ , $\sigma_{R}$ the input referred analog read noise $(e\text{-})$ , $g$ the conversion gain $(e\text{-}/\mathrm{DN})$ , $\mu$ is the pixel bias $(\mathrm{DN})$ , and $\lceil\cdot\rfloor$ denotes rounding to the nearest integer. As such, the random variable $X$ represents the random process of adding noise $(R)$ to a number of electrons $(K)$ followed by the application of gain, offset, and finally quantization. Note that this is a general sensor model not specific to DSERN devices. What differentiates DSERN pixels is the fact that the signal corrupting noise $R$ is sufficiently small so that the electron number $K$ can be reasonably estimated.

Assuming $g\ll\sigma_{R}$ , quantization (rounding) in (1) can be modeled as an additive noise component so that the distribution of $X$ is reasonably approximated by the Photon Counting Distribution (PCD) [6]

[TABLE]

where $\theta=(H,g,\mu,\sigma^{2})$ are the PCD parameters and $\phi(x;\mu,\sigma^{2})=\frac{1}{\sqrt{2\pi\sigma^{2}}}\exp(-(x-\mu)^{2}/2\sigma^{2})$ is the Gaussian probability density with mean $\mu$ and variance $\sigma^{2}$ . In (2), $\sigma=(\sigma_{R}^{2}/g^{2}+\sigma_{Q}^{2})^{1/2}$ is the combined read and quantization noise in $(\mathrm{DN})$ .

II-B The PCH-EM Algorithm

Given a random sample $\mathbf{x}=\{x_{1},\dots,x_{N}\}$ with $x_{n}\overset{\mathrm{iid}}{\sim}\operatorname{PCD}(H,g,\mu,\sigma^{2})$ and an initial estimate of the parameters $\theta_{0}=(H_{0},g_{0},\mu_{0},\sigma_{0}^{2})$ , the PCH-EM algorithm iteratively updates the parameter estimates via the update equations [6]

[TABLE]

where $\bar{x}=\frac{1}{N}\sum_{n=1}^{N}x_{n}$ and $\overline{x^{2}}=\frac{1}{N}\sum_{n=1}^{N}x_{n}^{2}$ are the first two sample moments and

[TABLE]

where

[TABLE]

are the so-called membership probabilities; representing a probability distribution of the unknown electron number associated with each observation $x_{n}$ . As such, the membership probabilities satisfy $\sum_{k=0}^{\infty}\gamma_{nk}^{(t)}=1$ .

In each iteration, the algorithm takes the current estimate $\theta_{t}$ and then performs an Expectation (E) step to compute the $\gamma_{nk}^{(t)}$ followed by a Maximization (M) step to update the estimate according to (3). In doing so, the algorithm guarantees an increase in the likelihood of the sample at each iteration such that a local maxima of the likelihood function is always achieved111Assuming the starting point $\theta_{0}$ is sufficiently good, PCH-EM achieves the global maximum of the likelihood function so that the final estimates are maximum likelihood estimates for their respective parameters. [10]. The algorithm halts when a specified convergence criteria is met.

In the context of machine learning, the general EM algorithm can be viewed as a density-based clustering algorithm, assigning labels to each datapoint based on what cluster the datapoint is most likely to belong to. In the context of PCH-EM, the Gaussian components comprising the PCD are the clusters, with the electron number determining which cluster an observation belongs. As such, a natural byproduct of the PCH-EM algorithm is the ability to map each observation $x_{n}$ to a nonnegative integer $\tilde{k}_{n}$ representing a best estimate for the electron number associated with each observation. This post-process denoising of the sensor data is accomplished by applying the membership probabilities via

[TABLE]

In essence, the mapping $\tilde{k}_{n}:x_{n}\to\mathbb{N}_{0}$ described in (6) is clustering the data by its mostly likely electron number in an optimal manner as to reduce bit error rates [11].

To see this optimal clustering in action, consider the example $x_{n}\sim\operatorname{PCD}(1.8,1,0,(0.33)^{2})$ , where the values of the parameters $g=1$ and $\mu=0$ are selected so that the data can be interpreted as being in units of $e\text{-}$ . Figure 1 shows the PCD along with the optimal cluster edges obtained from the quantization described in (6). While the PCD peaks are centered at nonnegative integers, it can be seen that the cluster edges are not directly centered between the peaks nor are the clusters of equal size. This nonuniform clustering ensures optimal estimation of the electron number for each observation.

II-C Ensemble Statistics

The PCD in (2) describes the distribution of data produced by a single DSERN pixel. When considering data produced by an array of DSERN pixels, each with potentially different parameters, the parameters themselves can be modeled as random variables. Denoting $E$ as the random variable describing the ensemble of pixels leads to the hierarchical model

[TABLE]

so that the distribution of $E$ is given by the Ensemble PCD (EPCD)

[TABLE]

where $F_{\theta}$ is the joint distribution of the parameters (with corresponding joint density $f_{\theta}$ ) and $\Theta\subset\mathbb{R}^{4}$ is the parameter space denoting all possible values of the parameter vector $\theta$ . Alternatively, the EPCD can be written as $f_{E}(x)=\mathsf{E}_{\theta}(f_{X}(x|\theta))$ , where $\mathsf{E}_{\theta}$ denotes the expected value w.r.t. $\theta$ . The moments of the EPCD can be given in terms of the moments of the parameters as shown in Appendix A.

Unlike the per-pixel PCD, the peaks (local maxima) in the EPCD typically disappear at higher signal levels (c.f. Figure 4 (top)), which is indicative of conversion gain nonuniformity (see Appendix B). For this reason, it is also useful to consider the ensemble distribution after correcting conversion gain nonuniformity and bias through a conventional two point Non-Uniformity Correction (NUC). Applying a gain and offset correction both improves the resolution of the peaks and centers them on the nonnegative integers. The Non-Uniformity Corrected (NUCed) EPCD can be found by setting $g=1$ and $\mu=0$ as constants leading to the model

[TABLE]

with distribution

[TABLE]

where $\sigma_{e\text{-}}=\sigma\times g$ is the total read and quantization noise in units of electrons and $\theta^{\prime}=(H,\sigma_{e\text{-}}^{2})$ . Examples of both the EPCD and NUCed EPCD can be seen in Figure 4 (see Section IV-C).

Lastly, consider the ensemble distribution of the electron number $K$ , which will appear later when evaluating the ability of PCH-EM to predict electron numbers. On a per-pixel basis the electron number is Poisson distributed and since each pixel may have a unique quanta exposure, the ensemble electron number $K_{e}$ is described by

[TABLE]

with probability mass

[TABLE]

where $M_{H}(t)=\mathsf{E}e^{tH}$ denotes the moment generating function of the quanta exposure random variable $H$ .

III Experimental Method

The experimental data was collected using a developmental DSERN capable camera from Gigajot Technology Inc. The specific camera chosen is the GJ00111, which consists of a monochrome one megapixel CMOS QIS with $1.1\,\mu\mathrm{m}$ pitch pixels. It was operated at its full bit-depth of 14-bits using four Correlated Multi-Sample (CMS) cycles to minimize read noise.

For this experiment, the PCH-EM algorithm was used to estimate per-pixel dark current. This is accomplished through operating the camera with a lens cap and using a long integration time of $t_{\mathrm{int}}=4.87\,\mathrm{s}$ . The long integration time ensures each pixel was given ample opportunity to produce thermally generated free-electrons. A total of $17,750$ frames over a $512\times 512\,\mathrm{px}$ region of interest were captured continuously under the dark environment.

IV Results

The PCH-EM algorithm code, available on the Mathworks File Exchange [12], was applied on a per pixel basis to the experimental dataset. To expedite calculations, parallel methods (memory limitations permitting) can be used as the per-pixel estimates can be found independently. An additional speed improvement is also possible by implementing PCH-EM through histograms (number of occurrences for each unique DN observed). Using the histogram is especially beneficial when there are relatively few unique DN values in a sample compared to the number of frames reported. Finally, an additional improvement can also be achieved by vectorizing the code, running the same sequence of calculations on multiple pixels simultaneously with MATLAB’s optimized methods. The final time for the analysis was slightly under an hour running 12 cores on the machine used. Eventually, the release of the optimized histogram implementation of PCH-EM and other improvements to the algorithm in future updates will be provided on the Mathworks file exchange.

The experiments were conducted with no external illumination with the intention of characterizing the dark current. The dark current $(i_{d})$ given in units of $(e\text{-}/\mathrm{px}/\mathrm{s})$ is found from the relation $i_{d}=H/t_{\mathrm{int}}$ . Additionally, the read plus quantization noise $(\sigma_{e\text{-}})$ given in units of $(e\text{-})$ is found by multiplying the gain by the square root of $\sigma^{2}$ , i.e. $\sigma_{e\text{-}}=\sqrt{\sigma^{2}}\times g$ .

IV-A Per-pixel Characterization

The PCH-EM algorithm provides estimates of the PCD model parameters $H$ , $g$ , $\mu$ , and $\sigma^{2}$ . Using these estimated parameters, the predicted probability density of the individual pixels can be computed. A comparison of the predicted density (solid black line) against the observed experimental histogram (gray bars) for four of the sensor’s pixels are shown in Figure 2.

The four pixels shown were selected to demonstrate that the algorithm provides a good fit to the data at high or low quanta exposure as well as high or low read noise. Note that with low quanta exposure, there are very few peaks that may be used for estimating the conversion gain. However, even under such conditions, the probability density function calculated from the parameter estimates still accurately matches the observed data histogram. Also note that since these are dark frame measurements, the quanta exposure represents the expected number of free-electrons generated per-integration time via thermal contributions. Since this is proportional to the integration time, increasing the integration time will increase the observed quanta exposure and can further improve the estimates of the conversion gain if needed.

IV-B PCD Parameter Maps

Applied to the array, the PCD parameters for each pixel were estimated resulting in four two-dimensional arrays (maps) containing per-pixel estimates of $H$ , $g$ , $\mu$ , and $\sigma^{2}$ . Perhaps the most important for DSERN sensors is the distribution of read noise shown below in Figure 3. As can be seen, the vast majority of the pixels have an estimated read noise of less than $0.4\,e\text{-}$ with the median of the histogram occurring at $0.305\,e\text{-}$ .

The spatial context and distributions of other parameters are found in Figures 6-9 (see Appendix C). Structure (or lack thereof) observed in the parameter maps can be tied back to the architecture of the sensor and may potentially be useful in tuning the sensor parameters during development.

IV-C Ensemble Distributions

Applying the parameter estimates through (8), one can observe how the PCH-EM algorithm fits the sensor data on the array scale by predicting the EPCD of the sensor and comparing it to the ensemble histogram of the raw data. In order to estimate the EPCD, the unknown joint density of the PCD parameters $f_{\theta}$ must be determined. While this density is unknown, it may be approximated by binning the four parameter maps in a four-dimensional histogram. After normalization, this provides a discrete approximation for $f_{\theta}$ . The approximate EPCD is then found by evaluating (8), replacing integrals with sums, for an appropriate range of $x$ -values.

Figure 4 (top) shows the ensemble histogram made from $250$ frames of the raw experimental data compared to the estimated EPCD using the parameter maps. For comparison, two EPCD’s were estimated under the assumption of mutually independent parameters ( $f_{\theta}$ approximated by the product of four individual histograms) and dependent parameters ( $f_{\theta}$ approximated by a single four-dimensional histogram), respectively. One can see that the EPCD under the assumption of dependent parameters provides an excellent fit to the raw data; thus providing experimental confirmation of the PCD model and PCH-EM algorithm. The fact that the estimated EPCD for dependent parameters ( $\operatorname{RSME}=1.9\times 10^{-4}$ ) provides a better fit compared to the case of independent parameters ( $\operatorname{RSME}=8.3\times 10^{-4}$ ) makes sense since, for example, the expression for the variance contains $g$ ; therefore, it is expected for $\sigma^{2}$ and $g$ to be dependent. In the ensemble histogram, it can be observed that the peaks become less distinct as signal increases which is usually indicative of conversion gain nonuniformity (see Appendix B).

Figure 4 (bottom) shows the NUCed ensemble histogram found by subtracting per-pixel estimates of $\mu$ from each frame and then multiplying the bias corrected frames by the per-pixel estimates of $g$ . This effectively removes the effects of gain and offset nonuniformity from the raw data. Notice that the peaks are now more clearly resolved and located at nonnegative integers showing that this two-point NUC restores the electron counting capabilities of the sensor. Using the same approach as before, the NUCed EPCD can be found by approximating the joint density $f_{\theta^{\prime}}$ from the quanta exposure and read noise maps under the assumption of dependent and independent parameters, and then approximating the double integral in (10) by sums. As seen in the bottom of Figure 4, both ensemble predictions under the assumption of dependent parameters ( $\operatorname{RSME}=2.3\times 10^{-4}$ ) and independent parameters ( $\operatorname{RSME}=2.7\times 10^{-4}$ ) fit the NUCed data quite well with a slight advantage given to the case of dependent parameters. This indicates that the quanta exposure (dark current) is nearly independent of the read noise (when in units of electrons), which is to be expected (see discussion at the end of Appendix B).

Through Figures 2 and 4, the PCD and PCH-EM algorithm have been shown to be effective in modeling DSERN sensor data and providing estimates of the model parameters, respectively. What remains to be demonstrated is if the electron number prediction formula in (6) can effectively recover the electron numbers for each observation. Using (6), the predicted electron number for each pixel of the $250$ -frame stack of raw experimental data was computed. This process resulted in an array of nonnegative integers, the same size as the image stack, containing the all predictions. A histogram of the predictions is given in Figure 5. While it cannot be known if these predictions agree with the actual electron numbers associated with each observation, the distribution of the predictions can be compared to what would be expected according to the ensemble electron number probability mass in (12). To predict this ensemble distribution, the unknown quanta exposure density $f_{H}$ was approximated by binning the quanta exposure map and then replacing the integral in (12) by a finite sum. Figure 5 compares the ensemble histogram of the electron number predictions against the predicted probability mass according to the model. Recalling the discussion in Section II-B, the data presented in Figure 5 can be viewed as an optimal quantization of the NUCed EPCD in Figure 4 (bottom). The quality of fit between the data and predicted probability mass demonstrates, at the very least, that the predicted electron numbers agree with the actual electron numbers in terms of distribution.

V Discussion and Future Work

In this paper, the PCH-EM algorithm proposed in [6] was successfully demonstrated to accurately estimate quanta exposure, conversion gain, bias, and read noise of DSERN pixels in an automated fashion. Combining the assumed model with the corresponding estimated parameters accurately recreates the raw sensor data histograms, both on a per-pixel level as well as at the ensemble (array) level. The ensemble prediction required accounting for the correlation of the four model parameters. Additionally, it was shown how a two-point non-uniformity correction may be determined and applied to the ensemble, which improves the resolution of individual electron peaks and restores electron counting of the device. Lastly, the ability of PCH-EM to denoise raw sensor measurements and recover the hidden electron signal was demonstrated.

This PCH-EM algorithm is a powerful tool for investigating and tuning the performance of DSERN sensors, as it can be applied automatically over a large span of parameters. Through the use of the estimated parameter maps, PCH-EM not only is useful for sensor characterization but also may find application during the advanced development of the sensors themselves. Also, together with the Monte Carlo methods provided in [12], an experimentalist can investigate the number of frames required to achieve a desired uncertainty.

Future work will include expanding upon the current method to combine multiple illumination level measurements in a multi-sample version of PCH-EM, exploring techniques for accounting for non-linear responses, and releasing optimized code on the Mathworks File Exchange. Additionally, implementing various techniques for estimating the sample Fisher information will be pursued [13, 14, 15, 16]. The ability to estimate the Fisher information would allow the PCH-EM algorithm to not only provide the parameter estimates but also their uncertainties. Ultimately, a generalized characterization method should work across the full dynamic range of the sensor and full parameter space of photon counting sensors.

VI Acknowledgments

The authors would like thank Nico Schlömer for his matlab2tikz function, which was used to create the figures throughout this work [17].

Appendix A Moments of the Ensemble Distributions

Moments of the EPCD are found by noting that $E|\theta\sim\operatorname{PCD}(H,g,\mu,\sigma^{2})$ . Using the law of total expectation the first moment is

[TABLE]

Likewise, by the law of total variance

[TABLE]

Expanding the first variance term further then gives the final result of

[TABLE]

The analogous moments of the NUCed EPCD come from these expressions upon setting $\mu=0$ and $g=1$ as constants. This gives $\mathsf{E}(E^{\prime})=\mathsf{E}(H)$ and $\mathsf{Var}(E^{\prime})=\mathsf{Var}(H)+\mathsf{E}(H)+\mathsf{E}(\sigma_{e\text{-}}^{2})$ .

Appendix B Dependence of Ensemble Peak Resolution on Parameter Nonuniformity

Here, the loss of peak resolution in the EPCD at higher signal levels and the dependence of this phenomenon on parameter nonuniformity is investigated. To do this, it is important to first understand why this behavior is not observed in the single pixel model.

Recall the distribution for a single pixel is given by the PCD

[TABLE]

which is comprised of an infinite mixture of Gaussian components given by the probability density $f_{X|K}$ . The individual components are thus isolated by considering the distribution of the random variable $X|K=k\sim\mathcal{N}(\mu+k/g,\sigma^{2})$ . Computing the variance of this conditioned variable gives

[TABLE]

which is independent of the electron number $k$ . This means that the widths of each component making up the PCD are the same; thus, as signal $(k)$ increases, the resolution of individual peaks remains constant.

Repeating this calculation for the ensemble variable $E$ , while assuming the appropriate regularity conditions to allow interchanging series and integration, the EPCD in (8) can be written in the form

[TABLE]

which is comprised of an infinite mixture of non-Gaussian components given by the probability density $f_{E|K_{e}}$ . The variance of the conditioned variable $E|K_{e}=k\sim f_{E|K_{e}}$ is then given by

[TABLE]

where

[TABLE]

and

[TABLE]

Upon inspection, $\mathsf{Var}(E|K_{e}=k)$ is dependent on $k$ ; thus the widths of the components comprising the EPCD vary with signal level leading to a loss of peak resolution at higher signals.

What is not clear is if the dependence of $\mathsf{Var}(E|K_{e}=k)$ on $k$ is linked to the nonuniformity of only a subset of the parameters. This can be investigated by considering what happens to $\mathsf{Var}(E|K_{e}=k)$ when holding none, one, two, three, or all four parameters constant. This results in sixteen cases. Evaluating all sixteen cases, it can be shown that holding $(H,g)$ , $(H,g,\mu)$ , $(H,g,\sigma^{2})$ , $(g,\mu,\sigma^{2})$ , or $(H,g,\mu,\sigma^{2})$ constant removes the dependence on $k$ . Since the $(H,g)$ case implies the $(H,g,\mu)$ and $(H,g,\sigma^{2})$ cases, and the $(H,g,\mu,\sigma^{2})$ case results in the original PCD, there are only two ways for $\mathsf{Var}(E|K_{e}=k)$ to be independent of $k$ under dependent parameters: when $(H,g)$ is constant or $(g,\mu,\sigma^{2})$ is constant. Thus holding certain subsets of the parameters constant does remove the dependence of $\mathsf{Var}(E|K_{e}=k)$ on $k$ resulting in constant peak resolution. It is also worth noting that $g$ appears in all of these cases showing that if conversion gain nonuniformity exists, then the EPCD component width must depend on $k$ ; causing peak resolution to decrease at higher signal levels. It is interesting that holding only $g$ constant does not remove the dependence on $k$ ; however, note that if $H$ is independent of $(g,\mu,\sigma^{2})$ and then $g$ is held constant $\mathsf{Var}(E|K_{e}=k)=\mathsf{E}(\sigma^{2})+\mathsf{Var}(\mu)$ . This shows that the loss of resolution in the EPCD peaks can be solely contributed to conversion gain nonuniformity of $H$ is independent of $(g,\mu,\sigma^{2})$ . With so many combinations to consider, a study of the statistical dependence of the individual parameters in actual sensor systems may help guide further analysis.

The component width of the NUCed EPCD can also be found as a special case of $\mathsf{Var}(E|K_{e})$ for $\mu=0$ and $g=1$ constant. This special condition leads to

[TABLE]

where the last equality holds when $H$ is independent of $\sigma_{e\text{-}}^{2}$ . This explains why the component width of the NUCed EPCD in Figure 4 appears to be constant.

Appendix C Estimated Parameter Maps

One of the challenges when displaying the estimated parameter maps is the presence of outliers, which given a limited dynamic range of the display means one typically has to clip the map values. To provide a visually aesthetic way to display the maps, a nonlinear transformation of the form

[TABLE]

was applied to the map elements. Here, $F^{-1}_{\beta(2,2)}$ is the $\operatorname{Beta}(2,2)$ quantile function, $\tilde{F}$ is the empirical cumulative distribution function of the map, and $x_{ij}$ is the $ij$ th element of the map. This transformation takes the original histogram of the map and shapes it into that of a $\operatorname{Beta}(2,2)$ distribution; however, because this transformation is monotone, any structures in the original map are carried over to the final transformation, all while suppressing the appearance of outliers so that the structure is clearly observed.

Bibliography17

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] B. P. Beecken and E. R. Fossum, “Determination of the conversion gain and the accuracy of its measurement for detector elements and arrays,” Appl. Opt. , vol. 35, no. 19, pp. 3471–3477, Jul 1996.
2[2] J. R. Janesick, Photon Transfer: D N → λ → 𝐷 𝑁 𝜆 DN\to\lambda . SPIE, 2007.
3[3] A. Hendrickson, D. P. Haefner, and B. L. Preece, “On the optimal measurement of conversion gain in the presence of dark noise,” J. Opt. Soc. Am. A , vol. 39, no. 12, pp. 2169–2185, Dec 2022.
4[4] D. A. Starkey and E. R. Fossum, “Determining conversion gain and read noise using a photon-counting histogram method for deep sub-electron read noise image sensors,” IEEE Journal of the Electron Devices Society , vol. 4, no. 3, pp. 129–135, May 2016.
5[5] K. Nakamoto and H. Hotaka, “Efficient and accurate conversion-gain estimation of a photon-counting image sensor based on the maximum likelihood estimation,” Opt. Express , vol. 30, no. 21, pp. 37 493–37 506, Oct 2022.
6[6] A. Hendrickson and D. P. Haefner, “Photon counting histogram expectation maximization algorithm for characterization of deep sub-electron read noise sensors,” Cornell University ar Xiv , vol. 2302.00090, 2023.
7[7] J. Ma, D. Starkey, A. Rao, K. Odame, and E. R. Fossum, “Characterization of quanta image sensor pump-gate jots with deep sub-electron read noise,” IEEE Journal of the Electron Devices Society , vol. 3, no. 6, pp. 472–480, 2015.
8[8] J. Ma and E. R. Fossum, “Quanta image sensor jot with sub 0.3e- r.m.s. read noise and photon counting capability,” IEEE Electron Device Letters , vol. 36, no. 9, pp. 926–928, 2015.