Quantifying the Loss of Information from Binning List-Mode Data

Eric Clarkson

arXiv:1902.04606·cs.IT·March 18, 2020

Quantifying the Loss of Information from Binning List-Mode Data

Eric Clarkson

PDF

TL;DR

This paper investigates how binning list-mode data in imaging modalities like SPECT and PET causes information loss, quantifies this loss using Fisher information, and identifies three key factors influencing it.

Contribution

It introduces a computational method to quantify Fisher information loss due to binning in list-mode data, considering data smoothness, object characteristics, and binning scheme.

Findings

01

Information loss depends on data smoothness, object, and binning scheme.

02

Fisher information decreases with more aggressive binning.

03

The method enables optimization of binning strategies for minimal information loss.

Abstract

List-mode data is increasingly being uesd in SPECT and PET imaging, among other imaging modalities. However, there are still many imaging designs that effectively bin list-mode data before image reconstruction or other estimation tasks are performed. Intuitively, the binning operation should result in a loss of information. In this work we show that this is true for Fisher information and provide a computational method for quantifying the information loss. In the end we find that the information loss depends on three factors. The first factor is related to the smoothness of the mean data function for the list-mode data. The second factor is the actual object being imaged. Finally, the third factor is the binning scheme in relation to the other two factors.

Equations118

p r (A ∣ θ) = \frac{N ˉ ( θ )}{N !} exp [- \overset{ˉ}{N} (θ)] [n = 1 \prod N p r (a_{n} ∣ θ)],

p r (A ∣ θ) = \frac{N ˉ ( θ )}{N !} exp [- \overset{ˉ}{N} (θ)] [n = 1 \prod N p r (a_{n} ∣ θ)],

F_{L M} (θ) = ⟨ [\nabla_{θ} ln p r (A ∣ θ)] [\nabla_{θ} ln p r (A ∣ θ)]^{†} ⟩_{A ∣ θ} .

F_{L M} (θ) = ⟨ [\nabla_{θ} ln p r (A ∣ θ)] [\nabla_{θ} ln p r (A ∣ θ)]^{†} ⟩_{A ∣ θ} .

F_{L M} (θ) = \overset{ˉ}{N} (θ) {⟨ [\nabla_{θ} ln p r (a ∣ θ)] [\nabla_{θ} ln p r (a ∣ θ)]^{†} ⟩_{a ∣ θ} + [\nabla_{θ} ln \overset{ˉ}{N} (θ)] [\nabla_{θ} ln \overset{ˉ}{N} (θ)]^{†}} .

F_{L M} (θ) = \overset{ˉ}{N} (θ) {⟨ [\nabla_{θ} ln p r (a ∣ θ)] [\nabla_{θ} ln p r (a ∣ θ)]^{†} ⟩_{a ∣ θ} + [\nabla_{θ} ln \overset{ˉ}{N} (θ)] [\nabla_{θ} ln \overset{ˉ}{N} (θ)]^{†}} .

g (a) = n = 1 \sum N δ (a - a_{n}) .

g (a) = n = 1 \sum N δ (a - a_{n}) .

g_{m} = n = 1 \sum N b_{m} (a_{n}) = \int_{A} g (a) b_{m} (a) d^{q} a

g_{m} = n = 1 \sum N b_{m} (a_{n}) = \int_{A} g (a) b_{m} (a) d^{q} a

p r (g ∣ θ) = m = 1 \prod M \frac{[ g ˉ _{m} ( θ ) ] ^{g_{m}}}{g _{m} !} exp [- \overset{g}{ˉ}_{m} (θ)]

p r (g ∣ θ) = m = 1 \prod M \frac{[ g ˉ _{m} ( θ ) ] ^{g_{m}}}{g _{m} !} exp [- \overset{g}{ˉ}_{m} (θ)]

F_{B} (θ) = ⟨ [\nabla_{θ} ln p r (g ∣ θ)] [\nabla_{θ} ln p r (g ∣ θ)]^{†} ⟩

F_{B} (θ) = ⟨ [\nabla_{θ} ln p r (g ∣ θ)] [\nabla_{θ} ln p r (g ∣ θ)]^{†} ⟩

\overset{ˉ}{N} (θ) = \int_{A} \overset{g}{ˉ} (a ∣ θ) d^{q} a = m = 1 \sum M \overset{g}{ˉ}_{m} (θ) .

\overset{ˉ}{N} (θ) = \int_{A} \overset{g}{ˉ} (a ∣ θ) d^{q} a = m = 1 \sum M \overset{g}{ˉ}_{m} (θ) .

F_{L M} (θ) = \overset{ˉ}{N} (θ) ⟨ [\nabla_{θ} ln \overset{g}{ˉ} (a ∣ θ)] [\nabla_{θ} ln \overset{g}{ˉ} (a ∣ θ)]^{†} ⟩_{a ∣ θ},

F_{L M} (θ) = \overset{ˉ}{N} (θ) ⟨ [\nabla_{θ} ln \overset{g}{ˉ} (a ∣ θ)] [\nabla_{θ} ln \overset{g}{ˉ} (a ∣ θ)]^{†} ⟩_{a ∣ θ},

F_{L M} (θ) = \int_{A} [\nabla_{θ} ln \overset{g}{ˉ} (a ∣ θ)] [\nabla_{θ} ln \overset{g}{ˉ} (a ∣ θ)]^{†} \overset{g}{ˉ} (a ∣ θ) d^{q} a .

F_{L M} (θ) = \int_{A} [\nabla_{θ} ln \overset{g}{ˉ} (a ∣ θ)] [\nabla_{θ} ln \overset{g}{ˉ} (a ∣ θ)]^{†} \overset{g}{ˉ} (a ∣ θ) d^{q} a .

\overset{g}{ˉ}_{m} (θ) = \int_{A} b_{m} (a) \overset{g}{ˉ} (a ∣ θ) d^{q} a

\overset{g}{ˉ}_{m} (θ) = \int_{A} b_{m} (a) \overset{g}{ˉ} (a ∣ θ) d^{q} a

F_{B} (θ) = \overset{ˉ}{N} (θ) ⟨ [\nabla_{θ} ln \overset{g}{ˉ}_{m} (θ)] [\nabla_{θ} ln \overset{g}{ˉ}_{m} (θ)]^{†} ⟩_{m ∣ θ} .

F_{B} (θ) = \overset{ˉ}{N} (θ) ⟨ [\nabla_{θ} ln \overset{g}{ˉ}_{m} (θ)] [\nabla_{θ} ln \overset{g}{ˉ}_{m} (θ)]^{†} ⟩_{m ∣ θ} .

F_{B} (θ) = m = 1 \sum M [\nabla_{θ} ln \overset{g}{ˉ}_{m} (θ)] [\nabla_{θ} ln \overset{g}{ˉ}_{m} (θ)]^{†} \overset{g}{ˉ}_{m} (θ)

F_{B} (θ) = m = 1 \sum M [\nabla_{θ} ln \overset{g}{ˉ}_{m} (θ)] [\nabla_{θ} ln \overset{g}{ˉ}_{m} (θ)]^{†} \overset{g}{ˉ}_{m} (θ)

A U C (θ_{0}, θ_{1}) = \frac{1}{2} + \frac{1}{2} erf [\frac{1}{2} d (θ_{0}, θ_{1})],

A U C (θ_{0}, θ_{1}) = \frac{1}{2} + \frac{1}{2} erf [\frac{1}{2} d (θ_{0}, θ_{1})],

△ θ^{†} F_{L M} (θ) △ θ = \int_{A} [△ θ^{†} \nabla_{θ} \overset{g}{ˉ} (a ∣ θ)]^{2} [\overset{g}{ˉ} (a ∣ θ)]^{- 1} d^{q} a

△ θ^{†} F_{L M} (θ) △ θ = \int_{A} [△ θ^{†} \nabla_{θ} \overset{g}{ˉ} (a ∣ θ)]^{2} [\overset{g}{ˉ} (a ∣ θ)]^{- 1} d^{q} a

△ θ^{†} F_{B} (θ) △ θ = m = 1 \sum M [\int_{A} b_{m} (a) △ θ^{†} \nabla_{θ} \overset{g}{ˉ} (a ∣ θ) d^{q} a]^{2} [\overset{g}{ˉ}_{m} (θ)]^{- 1}

△ θ^{†} F_{B} (θ) △ θ = m = 1 \sum M [\int_{A} b_{m} (a) △ θ^{†} \nabla_{θ} \overset{g}{ˉ} (a ∣ θ) d^{q} a]^{2} [\overset{g}{ˉ}_{m} (θ)]^{- 1}

△ θ^{†} F_{L M} (θ) △ θ = \int_{A} [γ (a)]^{2} [\overset{g}{ˉ} (a ∣ θ)]^{- 1} d^{q} a

△ θ^{†} F_{L M} (θ) △ θ = \int_{A} [γ (a)]^{2} [\overset{g}{ˉ} (a ∣ θ)]^{- 1} d^{q} a

(γ, γ^{'})_{θ} = \int_{A} γ^{*} (a) γ^{'} (a) [\overset{g}{ˉ} (a ∣ θ)]^{- 1} d^{q} a = (γ, D_{θ}^{- 1} γ^{'})

(γ, γ^{'})_{θ} = \int_{A} γ^{*} (a) γ^{'} (a) [\overset{g}{ˉ} (a ∣ θ)]^{- 1} d^{q} a = (γ, D_{θ}^{- 1} γ^{'})

△ θ^{†} F_{B} (θ) △ θ = m = 1 \sum M [\int_{A} b_{m} (a) γ (a) d^{q} a]^{2} [\overset{g}{ˉ}_{m} (θ)]^{- 1} .

△ θ^{†} F_{B} (θ) △ θ = m = 1 \sum M [\int_{A} b_{m} (a) γ (a) d^{q} a]^{2} [\overset{g}{ˉ}_{m} (θ)]^{- 1} .

(B γ)_{m} = \int_{A} b_{m} (a) γ (a) d^{q} a

(B γ)_{m} = \int_{A} b_{m} (a) γ (a) d^{q} a

B^{†} g = m = 1 \sum M g_{m} b_{m} (a) .

B^{†} g = m = 1 \sum M g_{m} b_{m} (a) .

△ θ^{†} F_{B} (θ) △ θ = m = 1 \sum M (B γ)_{m}^{2} [\overset{g}{ˉ}_{m} (θ)]^{- 1} .

△ θ^{†} F_{B} (θ) △ θ = m = 1 \sum M (B γ)_{m}^{2} [\overset{g}{ˉ}_{m} (θ)]^{- 1} .

(g, g^{'})_{θ} = m = 1 \sum M g_{m}^{*} g_{m}^{'} [\overset{g}{ˉ}_{m} (θ)]^{- 1} = (g, D_{θ}^{- 1} g^{'})

(g, g^{'})_{θ} = m = 1 \sum M g_{m}^{*} g_{m}^{'} [\overset{g}{ˉ}_{m} (θ)]^{- 1} = (g, D_{θ}^{- 1} g^{'})

(g, B γ^{'})_{θ} = (γ, D_{θ}^{- 1} B γ^{'}) = (B^{†} D_{θ}^{- 1} g, γ^{'}) = (D_{θ} B^{†} D_{θ}^{- 1} g, D_{θ}^{- 1} γ^{'}) = (D_{θ} B^{†} D_{θ}^{- 1} g, γ^{'})_{θ}

(g, B γ^{'})_{θ} = (γ, D_{θ}^{- 1} B γ^{'}) = (B^{†} D_{θ}^{- 1} g, γ^{'}) = (D_{θ} B^{†} D_{θ}^{- 1} g, D_{θ}^{- 1} γ^{'}) = (D_{θ} B^{†} D_{θ}^{- 1} g, γ^{'})_{θ}

B^{+} = D_{θ} B^{†} D_{θ}^{- 1} (B D_{θ} B^{†} D_{θ}^{- 1})^{- 1} = D_{θ} B^{†} (B D_{θ} B^{†})^{- 1}

B^{+} = D_{θ} B^{†} D_{θ}^{- 1} (B D_{θ} B^{†} D_{θ}^{- 1})^{- 1} = D_{θ} B^{†} (B D_{θ} B^{†})^{- 1}

D_{θ} B^{†} g (a) = D_{θ} {m^{'} = 1 \sum M g_{m^{'}} b_{m^{'}} (a)} = \overset{g}{ˉ} (a ∣ θ) m^{'} = 1 \sum M g_{m^{'}} b_{m^{'}} (a)

D_{θ} B^{†} g (a) = D_{θ} {m^{'} = 1 \sum M g_{m^{'}} b_{m^{'}} (a)} = \overset{g}{ˉ} (a ∣ θ) m^{'} = 1 \sum M g_{m^{'}} b_{m^{'}} (a)

(B D_{f} B^{†} g)_{m} = g_{m} \int_{A} \overset{g}{ˉ} (a ∣ θ) b_{m} (a) d^{q} a = g_{m} \overset{g}{ˉ}_{m} (θ) .

(B D_{f} B^{†} g)_{m} = g_{m} \int_{A} \overset{g}{ˉ} (a ∣ θ) b_{m} (a) d^{q} a = g_{m} \overset{g}{ˉ}_{m} (θ) .

γ_{1} (a) = B^{+} B γ (a) = \overset{g}{ˉ} (a ∣ θ) m = 1 \sum M [\frac{( B γ ) _{m}}{g ˉ _{m} ( θ )}] b_{m} (a) .

γ_{1} (a) = B^{+} B γ (a) = \overset{g}{ˉ} (a ∣ θ) m = 1 \sum M [\frac{( B γ ) _{m}}{g ˉ _{m} ( θ )}] b_{m} (a) .

∥ γ_{1} ∥_{θ}^{2} = \int_{A} [γ_{1} (a)]^{2} [\overset{g}{ˉ} (a ∣ θ)]^{- 1} d^{q} a .

∥ γ_{1} ∥_{θ}^{2} = \int_{A} [γ_{1} (a)]^{2} [\overset{g}{ˉ} (a ∣ θ)]^{- 1} d^{q} a .

∥ γ_{1} ∥_{θ}^{2} = m = 1 \sum M [\frac{( B γ ) _{m}}{g ˉ _{m} ( θ )}]^{2} \int_{A} \overset{g}{ˉ} (a ∣ θ) b_{m} (a) d^{q} a .

∥ γ_{1} ∥_{θ}^{2} = m = 1 \sum M [\frac{( B γ ) _{m}}{g ˉ _{m} ( θ )}]^{2} \int_{A} \overset{g}{ˉ} (a ∣ θ) b_{m} (a) d^{q} a .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Quantifying the Loss of Information from Binning List-Mode Data

Eric Clarkson

Abstract

List-mode data is increasingly being uesd in SPECT and PET imaging, among other imaging modalities. However, there are still many imaging designs that effectively bin list-mode data before image reconstruction or other estimation tasks are performed. Intuitively, the binning operation should result in a loss of information. In this work we show that this is true for Fisher information and provide a computational method for quantifying the information loss. In the end we find that the information loss depends on three factors. The first factor is related to the smoothness of the mean data function for the list-mode data. The second factor is the actual object being imaged. Finally, the third factor is the binning scheme in relation to the other two factors.

1 Introduction

Many imaging systems detect individual particles as they interact with the imaging hardware. These particles are usually photons, but there are also other choices such as neutrons, beta particles and alpha particles. A list-mode imaging system produces an attribute vector for each particle detected. The attribute vector may include spatial position, energy, time or other attributes that can be assigned to the particle [1-13]. When the particles are photons, list-mode systems are also called photon processing systems to indicate that the attributes are estimated from raw detector outputs via some data processing algorithm [14-17]. In this work we are only concerned with the fact that the imaging system produces an attribute vector for each particle, regardless of how these attributes are arrived at.

We may envision the more common type of imaging system, a binned system, as the result of resolving the space of all attribute vectors into a collection of non-overlapping bins. The system then counts how many attribute vectors fall into each bin and produces an integer vector whose dimension is the number of bins. Intuitively, this would seem to result in a loss of information. If we formulate the task of the imaging system as the estimate of a certain number of parameters related to the object being imaged, then we may consider quantifying this loss of information, if indeed there is a loss of information.

If the parameter vector of interest has a known prior distribution, then the Shannon information between the parameter vector and the data may be used as a measure of information for the task at hand. In this case the data processing inequality implies that the Shannon information is not increased by the binning operation, but it does not quantify the loss of information due to binning. In this work we will use the Fisher Information Matrix (FIM) to quantify the information loss due to binning. The FIM does not require a prior distribution on the parameter vector of interest. We will show that the FIM always decreases when list-mode data is binned and provide an expression to calculate the information loss. We will find that the information loss depends on three factors, the smoothness of the mean data function for the list-mode data, the actual object being imaged, and the the binning scheme in relation to the other two factors.

2 List mode Fisher information

We will confine our calculations to photon imaging systems where we know that Poisson statistics are applicable. In a list mode imaging system the data is a list of $q$ -dimensional attribute vectors $\mathbf{a}_{n}$ , one for each photon detected. These photon attributes contained in each of these vectors may include a two dimensional position on a the face of a detector, the depth of interaction in a scintillation detector, the energy of the photon, the direction the photon is travelling when detected for a plenoptic array, and polarization parameters. The collection of all possible attribute vectors is attribute space, $\mathbb{A}$ . We may arrange the data list into a matrix $\mathbf{A}=\left[\mathbf{a}_{1},\ldots\mathbf{a}_{N}\right]$ and, for a fixed exposure time, the conditional probability distribution function (PDF) for the list is given by

[TABLE]

where $\boldsymbol{\theta}$ is a $p$ -dimensional parameter vector describing the object being imaged and $pr\left(\mathbf{a}|\boldsymbol{\theta}\right)$ is the attribute space conditional PDF determined by $\boldsymbol{\theta}$ . The specific form for $pr\left(\mathbf{a}|\boldsymbol{\theta}\right)$ depends on the imaging system. The FIM with respect to $\boldsymbol{\theta}$ for list mode data is defined by

[TABLE]

Using the specific form for $pr\left(\mathbf{A}|\boldsymbol{\theta}\right)$ the list mode FIM can be written as

[TABLE]

This is a $p\times p$ matrix which figures prominently in the task of estimating $\boldsymbol{\theta}$ from the data list $\mathbf{A}$ via the Cramer-Rao bound [18]. As we will discuss further below, the FIM is also related to the performance of an ideal observer using the data list $\mathbf{A}$ for the task of detecting a change in the parameter vector from $\boldsymbol{\theta}$ to $\boldsymbol{\theta}+\triangle\boldsymbol{\theta}$ [19,20].

3 Binned Fisher information

List mode data can also de described as a Poisson Point Process [21] on attribute space via the generalized function $g\left(\mathbf{a}\right)$ given by

[TABLE]

If we introduce binning functions $b_{m}\left(\mathbf{a}\right)$ for $m=1,\ldots,M$ , then we get the components

[TABLE]

of a binned $M$ -dimensional data vector $\mathbf{g}$ . We will assume that the functions $b_{m}\left(\mathbf{a}\right)$ are binary with non-overlapping supports, so that $b_{m}\left(\mathbf{a}\right)b_{m^{\prime}}\left(\mathbf{a}\right)=\delta_{mm^{\prime}}b_{m}\left(\mathbf{a}\right)$ , and that they cover all of attribute space, i.e. for all $\mathbf{a}$ in $\mathbb{A}$ we have $b_{1}\left(\mathbf{a}\right)+\ldots+b_{M}\left(\mathbf{a}\right)=1.$ The PDF for the binned data vector is multivariate Poisson:

[TABLE]

with $\bar{g}_{m}\left(\boldsymbol{\theta}\right)=\left\langle g_{m}\right\rangle_{\mathbf{A}|\boldsymbol{\theta}}.$ The binned FIM is defined by

[TABLE]

This matrix is relevant to the task of estimating $\boldsymbol{\theta}$ from the data vector $\mathbf{g}$ via the corresponding Cramer-Rao bound. As above, this FIM is also related to the performance of an ideal observer using the data vector $\mathbf{g}$ for the task of detecting a change in the parameter vector from $\boldsymbol{\theta}$ to $\boldsymbol{\theta}+\triangle\boldsymbol{\theta}$ . Intuitively we expect better perfomance on the estimation task or the detection task with the list mode data than with the binned data, since there is an obvious loss of information about each photon in the transition from list mode to binned data. In the following we will show that this is true and derive an equation that quantifies this loss of information using the FIM matrices for the two data types.

4 Relation between the two FIMs

The attribute-space PDF can be written in terms of the conditional mean of the Poisson Point Process $\bar{g}\left(\mathbf{a}|\boldsymbol{\theta}\right)=\left\langle g\left(\mathbf{a}\right)\right\rangle_{\mathbf{A}|\boldsymbol{\theta}}$ via the equation $\bar{N}\left(\boldsymbol{\theta}\right)pr\left(\mathbf{a}|\boldsymbol{\theta}\right)=\bar{g}\left(\mathbf{a}|\boldsymbol{\theta}\right)$ , where

[TABLE]

Now we may write the list mode FIM as

[TABLE]

and this is the same as the integral expression

[TABLE]

Thus the list mode FIM is determined entirely by the conditional mean function $\bar{g}\left(\mathbf{a}|\boldsymbol{\theta}\right)$ .

Meanwhile, we have the relation between conditional means

[TABLE]

and we can define a finite conditional probability distribution $Pr\left(m|\boldsymbol{\theta}\right)$ on $\left\{1,\ldots,M\right\}$ via $\bar{N}\left(\boldsymbol{\theta}\right)Pr\left(m|\boldsymbol{\theta}\right)=\bar{g}_{m}\left(\boldsymbol{\theta}\right)$ . Now the binned FIM is given by an expectation with respect to this finite probability distribution

[TABLE]

Notice the similarity with the corresponding expectation expression for the list mode FIM. The only difference is that a PDF for the attribute vector $\mathbf{a}$ has been replace by a finite probability distribution for the bin index $m.$ The binned FIM can also be written as

[TABLE]

Thus, from one viewpoint , we get the binned FIM by using the bin functions $b_{m}\left(\mathbf{a}\right)$ to numerically perform the integration in the list mode FIM. It is not obvious at this point that this numerical procedure will always produce a lower value for the FIM.

The ideal observer detectability $d\left(\boldsymbol{\theta}_{0},\boldsymbol{\theta}_{1}\right)$ for the task of detecting a change in the parameter vector from $\boldsymbol{\theta}_{0}$ to $\boldsymbol{\theta}_{1}$ is defined by

[TABLE]

where $AUC\left(\boldsymbol{\theta}_{0},\boldsymbol{\theta}_{1}\right)$ is the area under the ROC curve for the ideal observer. It has been shown that, to lowest order, $d^{2}\left(\boldsymbol{\theta},\boldsymbol{\theta}+\triangle\boldsymbol{\theta}\right)=\triangle\boldsymbol{\theta}^{\dagger}\mathbf{F}\left(\boldsymbol{\theta}\right)\triangle\boldsymbol{\theta}+\ldots,$ where $\mathbf{F}\left(\boldsymbol{\theta}\right)$ is the FIM for the conditional PDF of the data. Thus the scalar

[TABLE]

gives the square of the approximate ideal-observer detectability for this task when we use list mode data. Similarly, the scalar

[TABLE]

gives the square of the approximate ideal-observer detectability for a small change in the parameter vector from $\boldsymbol{\theta}$ to $\boldsymbol{\theta}+\triangle\boldsymbol{\theta}$ if we are using binned data. We will show that $\triangle\boldsymbol{\theta}^{\dagger}\mathbf{F}_{LM}\left(\boldsymbol{\theta}\right)\triangle\boldsymbol{\theta}\geq\triangle\boldsymbol{\theta}^{\dagger}\mathbf{F}_{B}\left(\boldsymbol{\theta}\right)\triangle\boldsymbol{\theta}$ for all $\boldsymbol{\theta}$ and $\triangle\boldsymbol{\theta}$ . By definition, this then implies that $\mathbf{F}_{LM}\left(\boldsymbol{\theta}\right)\geq\mathbf{F}_{B}\left(\boldsymbol{\theta}\right)$ as matrices for all $\boldsymbol{\theta}.$

To simplify the calculations we will define $\gamma\left(\mathbf{a}\right)=\triangle\boldsymbol{\theta}^{\dagger}\nabla_{\boldsymbol{\theta}}\bar{g}\left(\mathbf{a}|\boldsymbol{\theta}\right)$ and suppress the fact that this function also depends on $\boldsymbol{\theta}$ and $\triangle\boldsymbol{\theta}$ , since these vectors are fixed for the purposes of this computation. Then we have

[TABLE]

This expression suggests that, for fixed $\boldsymbol{\theta}$ , we define a weighted Hilbert space inner product for functions on attribute space via

[TABLE]

where $\mathcal{D}_{\boldsymbol{\theta}}^{-1}\gamma^{\prime}\left(\mathbf{a}\right)=\gamma^{\prime}\left(\mathbf{a}\right)\left[\bar{g}\left(\mathbf{a}|\boldsymbol{\theta}\right)\right]^{-1}$ . The list-mode approximate detectability is then given by the corresponding weighted Hilbert-space norm as $\triangle\boldsymbol{\theta}^{\dagger}\mathbf{F}_{LM}\left(\boldsymbol{\theta}\right)\triangle\boldsymbol{\theta}=\left\|\gamma\right\|_{\boldsymbol{\theta}}^{2}$ .

For the binned data we have the summation

[TABLE]

We define the binning operator $\mathcal{B}$ by

[TABLE]

and the ordinary Hilbert space adjoint of this operator by

[TABLE]

Then we have a simpler looking expression

[TABLE]

This expression suggests introducing a weighted inner product in the $M$ -dimensional data space by

[TABLE]

where $\mathbf{D}_{\boldsymbol{\theta}}^{-1}$ is a diagonal $M\times M$ matrix with the numbers $\left[\bar{g}_{m}\left(\boldsymbol{\theta}\right)\right]^{-1}$ along the diagonal. With this notation the binned approxiamte detectability is given by the weighted norm $\triangle\boldsymbol{\theta}^{\dagger}\mathbf{F}_{B}\left(\boldsymbol{\theta}\right)\triangle\boldsymbol{\theta}=\left\|\mathcal{B}\gamma\right\|_{\boldsymbol{\theta}}^{2}$ . Thus both $\triangle\boldsymbol{\theta}^{\dagger}\mathbf{F}_{LM}\left(\boldsymbol{\theta}\right)\triangle\boldsymbol{\theta}$ and $\triangle\boldsymbol{\theta}^{\dagger}\mathbf{F}_{B}\left(\boldsymbol{\theta}\right)\triangle\boldsymbol{\theta}$ are now expressed as weighted Hilbert space norms of the function $\gamma$ and the vector $\mathcal{B}\gamma$ , respectively.

We can now think of the binning operator as a map between two weighted Hilbert spaces: $\mathcal{B}:L_{\boldsymbol{\theta}}^{2}\left(\mathbb{A}\right)\longrightarrow\mathbb{R}_{\boldsymbol{\theta}}^{M}$ . As a first step we want to find the pseudoinverse of this operator. We begin by finding the adjoint of this operator. Note that this is not the “ordinary adjoint” described above. The relevant calculation for this adjoint is given by

[TABLE]

Thus $\mathcal{D}_{\boldsymbol{\theta}}\mathcal{B}^{\dagger}\mathbf{D}_{\boldsymbol{\theta}}^{-1}$ is the adjoint operator we are looking for. The pseudoinverse of $\mathcal{B}$ , as an operator between the weighted Hilbert spaces, is then given by

[TABLE]

If we look at this expression in detail we first note that

[TABLE]

Now implementing the binning operator, and using the properties of the binning functions, gives us, in component form,

[TABLE]

Therefore we find that $\mathcal{B}\mathcal{D}_{\boldsymbol{\theta}}\mathcal{B^{\dagger}}=\mathbf{D}_{\boldsymbol{\theta}}$ . Now we have a simplified version of the needed pseudoinverse: $\mathcal{B}^{+}=\mathcal{D}_{\boldsymbol{\theta}}\mathcal{B^{\dagger}}\mathbf{D}_{\boldsymbol{\theta}}^{-1}$ .

We may decompose the function $\gamma$ into two components $\gamma=\gamma_{1}+\gamma_{0}$ , where $\gamma_{0}$ is a null function with respect to the binning operator, i.e. $\mathcal{B}\gamma_{0}=\mathbf{0}$ , and we have the orthogonality condition $\left(\gamma_{1},\gamma_{0}\right)_{\boldsymbol{\theta}}=0.$ The component $\gamma_{1}$ is given by $\gamma_{1}=\mathcal{B}^{+}\mathcal{B}\gamma$ . Therefore we have $\gamma_{1}=\mathcal{D}_{\boldsymbol{\theta}}\mathcal{B^{\dagger}}\mathbf{D}_{\boldsymbol{\theta}}^{-1}\mathcal{B}\gamma$ . Writing this equation out in detail we have

[TABLE]

The null component of $\gamma$ is then defined by $\gamma_{0}\left(\mathbf{a}\right)=\gamma\left(\mathbf{a}\right)-\gamma_{1}\left(\mathbf{a}\right)$ , and due to the orthogonality condition we have $\left\|\gamma\right\|_{\boldsymbol{\theta}}^{2}=\left\|\gamma_{1}\right\|_{\boldsymbol{\theta}}^{2}+\left\|\gamma_{0}\right\|_{\boldsymbol{\theta}}^{2}$ .

Now we examine the square magnitude, in the weighted Hilbert space, of the $\gamma_{1}$ component of $\gamma$ :

[TABLE]

Substituting our expression for $\gamma_{1}\left(\mathbf{a}\right)$ and then using the properties of the binning functions, we find that

[TABLE]

After performing the integration we find that $\left\|\gamma_{1}\right\|_{\boldsymbol{\theta}}^{2}=\triangle\boldsymbol{\theta}^{\dagger}\mathbf{F}_{B}\left(\boldsymbol{\theta}\right)\triangle\boldsymbol{\theta}$ .

Now we can find the difference between the list-mode and binned approximate detectabilities

[TABLE]

Using the definition of $\gamma\left(\mathbf{a}\right)$ we have the final result

[TABLE]

Since $\triangle\boldsymbol{\theta}$ is arbitrary, this equation gives us a matrix inequality between FIMs : $\mathbf{F}_{LM}\left(\boldsymbol{\theta}\right)\geq\mathbf{F}_{B}\left(\boldsymbol{\theta}\right)$ with equality only if $\left[\triangle\boldsymbol{\theta}^{\dagger}\nabla_{\boldsymbol{\theta}}\bar{g}\left(\mathbf{a}|\boldsymbol{\theta}\right)\right]_{0}=0.$ The equality condition can also be written as

[TABLE]

where $\gamma\left(\mathbf{a}\right)=\triangle\boldsymbol{\theta}^{\dagger}\nabla_{\boldsymbol{\theta}}\bar{g}\left(\mathbf{a}|\boldsymbol{\theta}\right)$ and

[TABLE]

The probability is zero that this condition will be satisfied in any real imaging situation, which means that binning always results in a loss of Fisher information.

Note that the condition for no loss of Fisher information due to binning can be written as

[TABLE]

This then gives us

[TABLE]

Thus each bin contributes an amount to the loss of detectability according to three factors. The first factor is the deviation of the quantity in curly brackets from zero within that bin. The second factor is the value of the mean data function $\bar{g}\left(\mathbf{a}|\boldsymbol{\theta}\right)$ within the bin. The third factor is the size of the bin itself. Therefore the efficiency of any particular choice of bins in preserving Fisher information depends on the actual parameter value $\boldsymbol{\theta}$ as well as the bin sizes. Having derived this relationship it is actually streightforward to prove that it is valid without any discussion of weighted Hilbert spaces. However, the path we followed to get here demonstrates that the loss of Fisher information due to binning is caused by the null space of the binning operator $\mathcal{B}:L_{\boldsymbol{\theta}}^{2}\left(\mathbb{A}\right)\longrightarrow\mathbb{R}_{\boldsymbol{\theta}}^{M}$ , when viewed as an operator between weighted Hilbert spaces.

5 FIMs for object Reconstruction

In this section the parameter vector $\boldsymbol{\theta}$ is replaced with a function $f\left(\mathbf{r}\right)$ of spatial coordinates. This complication is mitigated by a linear relation between the object function and mean data function via a linear operator:

[TABLE]

where $S$ is a support region for object functions in a $q$ -dimensional space. The gradient operator $\nabla_{\boldsymbol{\theta}}$ is replaced by a functional derivative or Frechet derivative. The FIM matrices are now a Fisher information operators $\mathcal{F}_{LM}$ and $\mathcal{F}_{B}$ . The simplicity of the connection between $f\left(\mathbf{r}\right)$ and $\bar{g}\left(\mathbf{a}|f\right)$ makes the functional derivative easy to compute.

The end result for the detectability calculation with list-mode data is then given by

[TABLE]

The weighted inner product for functions on attribute space is now defined by

[TABLE]

With the resulting weighted Hilbert space norm we then have $\left(\triangle f,\mathcal{F}_{LM}\left(f\right)\triangle f\right)=\left\|\mathcal{L}\triangle f\right\|_{f}^{2}.$

The imaging operator for the binned imaging system is given by the concatenation of the list-mode system operator with the binning operator: $\mathcal{H}=\mathcal{B}\mathcal{L}$ . The detectability calculation for the binned system then gives us

[TABLE]

As before we introduce a weighted inner product in data space via

[TABLE]

and we then have $\left(\triangle f,\mathcal{F}_{B}\left(f\right)\triangle f\right)=\left\|\mathcal{H}\triangle f\right\|_{f}^{2}$ .

The relevant operators are now the list mode system operator $\mathcal{L}:L^{2}\left(\mathbb{S}\right)\longrightarrow L_{f}^{2}\left(\mathbb{A}\right)$ , the binning operator $\mathcal{B}:L_{f}^{2}\left(\mathbb{A}\right)\longrightarrow\mathbb{R}_{f}^{M}$ , and their concatenation into the binned system operator $\mathcal{H}:L^{2}\left(\mathbb{S}\right)\longrightarrow\mathbb{R}_{f}^{M}$ . We have the deomposition in $L_{f}^{2}\left(\mathbb{A}\right)$ of the function $\mathcal{L}\triangle f$ as f $\mathcal{L}\triangle f=\left(\mathcal{L}\triangle f\right)_{1}+\left(\mathcal{L}\triangle f\right)_{0}$ , where $\mathcal{B}\left(\mathcal{L}\triangle f\right)_{0}=\mathbf{0}$ and $\left(\left(\mathcal{L}\triangle f\right)_{1},\left(\mathcal{L}\triangle f\right)_{0}\right)_{f}=0$ .

As before we find the adjoint of the binning operator, as an operator between weight Hilbert spaces, via

[TABLE]

We then have the pseudoinverse of the binning operator $\mathcal{B}^{+}=\mathcal{D}_{f}\mathcal{B^{\dagger}}\mathbf{D}_{f}^{-1}\left(\mathcal{B}\mathcal{D}_{f}\mathcal{B^{\dagger}}\mathbf{D}_{f}^{-1}\right)^{-1}$ , which simplifies to $\mathcal{B}^{+}=\mathcal{D}_{f}\mathcal{B^{\dagger}}\left(\mathcal{B}\mathcal{D}_{f}\mathcal{B^{\dagger}}\right)^{-1}$ . Computing the operator in parentheses in this last expression leads to

[TABLE]

Examining this equation componentwise then gives us

[TABLE]

Therefore we have $\mathcal{B}\mathcal{D}_{f}\mathcal{B^{\dagger}}=\mathbf{D}_{f}$ and the needed pseudoinverse is gien by $\mathcal{B}^{+}=\mathcal{D}_{f}\mathcal{B^{\dagger}}\mathbf{D}_{f}^{-1}$ .

Now we have for the first term in the orthogonal decomposition $\left(\mathcal{L}\triangle f\right)_{1}=\mathcal{B}^{+}\mathcal{B}\mathcal{L}\triangle f$ . If we write this equation out explicitly it becomes

[TABLE]

Then the null component of $\mathcal{L}\triangle f$ with respect to the binning operator in the weighted Hilbert space is $.\left(\mathcal{L}\triangle f\right)_{0}\left(\mathbf{a}\right)=\mathcal{L}\triangle f\left(\mathbf{a}\right)-\left(\mathcal{L}\triangle f\right)_{1}\left(\mathbf{a}\right)$ . Using the orthogonality of the decomposition we have $\left\|\mathcal{L}\triangle f\right\|_{f}^{2}=\left\|\left(\mathcal{L}\triangle f\right)_{1}\right\|_{f}^{2}+\left\|\left(\mathcal{L}\triangle f\right)_{0}\right\|_{f}^{2}$ . The first term in the sum on the right is

[TABLE]

Using the properties of the bin functions we then have

[TABLE]

Thus we have $\left\|\left(\mathcal{L}\triangle f\right)_{1}\right\|_{f}^{2}=\left(\triangle f,\mathcal{F}_{B}\left(f\right)\triangle f\right)$ .

Now we see that the null component $\left(\mathcal{L}\triangle f\right)_{0}$ determines the loss of Fisher information: $\left(\triangle f,\mathcal{F}_{LM}\left(f\right)\triangle f\right)-\left(\triangle f,\mathcal{F}_{B}\left(f\right)\triangle f\right)=\left\|\left(\mathcal{L}\triangle f\right)_{0}\right\|_{f}^{2}$ . Alternatively we can write

[TABLE]

The two approximate detectabilities are equal only if

[TABLE]

This condition implies that for almost all perturbation functions $\triangle f\left(\mathbf{a}\right)$ the list-mode approximate detectability will be greater than the binned approximate detectability.

Note that the condition for no loss of information due to binning can also be written as

[TABLE]

This then gives us

[TABLE]

Thus, as in the case described above for a finite dimensional parameter, each bin contributes to the loss of the detectability of a change in the object function according to three factors. The first factor is again the deviation of the quantity in curly brackets from zero within that bin. The second factor is the value of the function $\mathcal{L}f\left(\mathbf{a}\right)$ within the bin. The third factor is the size of the bin itself. The efficiency of any particular choice of bins in preserving Fisher information depends on the actual object function $f$ as well as bin size.

Finally, note that, as in the previous section, this last equality can be proved directly. Again, the path follwed in this derivation shows that the loss of Fisher information about the object function due to binning comes from the null space of $\mathcal{B}:L_{f}^{2}\left(\mathbb{A}\right)\longrightarrow\mathbb{R}_{f}^{M}$ as an operator between weighted Hilbert spaces.

6 Example

For this example, consider the attribute space to be a symmetric interval on the real line: $\mathbb{A}=\left[-L/2,L/2\right]$ . The object functions will be square integrable functions of a real variable and the list-mode system operator is convolution with a pint spread function (PSF): $\mathcal{L}f\left(x\right)=p\ast f\left(x\right)$ . We assume that the point spread function is band limited to the band $\left[-B/2,B/2\right]$ .

Now let $M$ and $\triangle x$ be such that $L=M\triangle x$ and define the regularly spaced points in $\mathbb{A}$ via

[TABLE]

and the bin functions as

[TABLE]

We now have the binning operator described by

[TABLE]

The condition for no loss in the approximate detectability by binning is given by

[TABLE]

This condition is impossible to satisfy since the function on the left is band-limited and the function on the right, in general, is not. Thus, even with Nyquist sampling, when $B\triangle x=1$ , there is a loss in the detectability of a small change in the object function when we bin the list-mode data. The actual loss of Fisher information for a small change in the object function is given by

[TABLE]

In general, loss of Fisher information is mitigated if $B$ is decreased since this will mean that $p\ast\triangle f\left(x\right)$ and $p\ast f\left(x\right)$ are smoother functions, and hence there will be a decrease the quantities in the curly brackets.

There is at least one circumstance in this example where there is no loss of Fisher information from binning the list-mode data. If $\triangle f\left(x\right)=\alpha f\left(x\right)$ for some constant $\alpha$ , then $\left(\triangle f,\mathcal{F}_{LM}\left(f\right)\triangle f\right)-\left(\triangle f,\mathcal{F}_{B}\left(f\right)\triangle f\right)=0.$ his is true even if $M=1$ and $\triangle x=L$ . In other words, to detect a simple change in amplitude of the object function we might as well use one bin covering all of $\mathbb{A}$ . There may also be other special situations where binning does not create a loss of Fisher information, but for generic functions $f\left(x\right)$ and $\triangle f\left(x\right)$ there will always be a loss.

7 Conclusion

We have shown that there is almost always a loss of Fisher information for any estimatioion task when list-mode data is binned. This loss of information is due to the null space of the binning operator when it is viewed as an operator between certain parameter dependent weighted Hilbert spaces. The magnitude of the loss can be quantified by finding the null component, with respect to the binning operator, of a directional derivative of the conditional PDF as an element of one of the weighted Hilbert spaces. We found that the information loss depends on the smoothness of the mean data function for the list-mode data, the actual object being imaged, and the the binning scheme in relation to the other two factors. We have shown that these conclutions apply even when the estimation problem is an object reconstruction problem, where the finite dimensional parameter vector is replaced with a function in an infinite dimensional Hilbert space.

As a final note the difference $\triangle\boldsymbol{\theta}^{\dagger}\mathbf{F}_{LM}\left(\boldsymbol{\theta}\right)\triangle\boldsymbol{\theta}-\triangle\boldsymbol{\theta}^{\dagger}\mathbf{F}_{B}\left(\boldsymbol{\theta}\right)\triangle\boldsymbol{\theta}$ can be written as

[TABLE]

Therefore we have an expression for the difference $\mathbf{F}_{LM}\left(\boldsymbol{\theta}\right)-\mathbf{F}_{B}\left(\boldsymbol{\theta}\right)$ of FIMs:

[TABLE]

Now if we have a nominal value for $\boldsymbol{\theta}$ , but there is some uncertainty in this value, then this is equivalent to making $\triangle\boldsymbol{\theta}$ a random vector with zero mean. If the covariance matrix for this vector is $\mathbf{K}_{\boldsymbol{\theta}}$ then the average value for $\triangle\boldsymbol{\theta}^{\dagger}\mathbf{F}_{LM}\left(\boldsymbol{\theta}\right)\triangle\boldsymbol{\theta}-\triangle\boldsymbol{\theta}^{\dagger}\mathbf{F}_{B}\left(\boldsymbol{\theta}\right)\triangle\boldsymbol{\theta}$ is $\mathrm{tr}\left\{\mathbf{K}_{\theta}\left[\mathbf{F}_{LM}\left(\boldsymbol{\theta}\right)-\mathbf{F}_{B}\left(\boldsymbol{\theta}\right)\right]\right\}$ . This may be a useful quantification of the average loss of Fisher information due to binning in this situation. When $\mathbf{K}_{\boldsymbol{\theta}}=\sigma^{2}\mathbf{I}$ we end up with

[TABLE]

This is a relatively compact expression that can be easily evaluated in many cases.

Bibliography21

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] L. Caucci and H. H. Barrett , “Objective assessment of image quality. V. Photon counting detectors and list -mode data, “ JOSA A 29, 1003-1016 (2012).
2[2] H. H. Barrett, T. White and L. C. Parra, “List-mode likelihood,” JOSA A 14, 2914-2923 (1997).
3[3] L. Parra and H. H. Barrett, “List-mode likelihood: EM algorithm and image quality estimation demonstrated on 2-D PET,” IEEE Trans. Med. Imag. 17, 228–235 (1998).
4[4] P. C. Johns, J. Dubeau, D. G. Gobbi, M. Li, and M. S. Dixit, “Photon-counting detectors for digital radiography and X-ray computed tomography,” in “Opto-Canada: SPIE Regional Meeting on Optoelectronics, Photonics, and Imaging,” (Proc. SPIE TD 01) 367–369 (2002).
5[5] P. M. Shikhaliev, T. Xu, and S. Molloi, “Photon counting computed tomography: Concept and initial results,” Med. Phys. 32, 427–436 (2005).
6[6] A. J. Reader, S. Ally, F. Bakatselos, R. Manavaki, R. J. Walledge, A. P. Jeavons, P. J. Julyan, S. Zhao, D. L. Hastings, and J. Zweit, “One-pass list-mode EM algorithm for high-resolution 3-D PET image reconstruction into large arrays,” IEEE Trans. Nucl. Sci. 49, 693–699 (2002).
7[7] P. Khurd, I.-T. Hsiao, A. Rangarajan, and G. Gindi, “A globally convergent regularized ordered-subset EM algorithm for list-mode reconstruction,” IEEE Trans. Nucl. Sci. 51, 719–725 (2004).
8[8] A. J. Reader, K. Erlandsson, R. J. Ott, and M. A. Flower, “Attenuation and scatter correction of list-mode data driven iterative and analytic image reconstruction algorithms for rotating 3D PET systems,” IEEE Trans. Nucl. Sci. 46, 2218–2226 (1999).