Filling of incomplete sinograms from sparse PET detector configurations using a residual U‐Net

Klara Leffler; Luigi Tommaso Luppino; Samuel Kuttner; Karin Söderkvist; Jan Axelsson

PMC · DOI:10.1002/mp.70293·February 27, 2026

Filling of incomplete sinograms from sparse PET detector configurations using a residual U‐Net

Klara Leffler, Luigi Tommaso Luppino, Samuel Kuttner, Karin Söderkvist, Jan Axelsson

PDF

Open Access

TL;DR

This paper introduces a deep learning method to improve PET scans using fewer detectors, reducing costs while maintaining image quality.

Contribution

A modified Residual U-Net is proposed to restore missing PET sinogram data from sparse detector configurations.

Findings

01

The model recovers missing PET sinogram counts with a mean absolute error below two events per pixel.

02

The deep learning approach outperforms 2D interpolation in both sinogram and reconstructed image quality.

03

The method enables cost-effective PET scanner designs with extended field-of-view.

Abstract

Long axial field‐of‐view PET scanners are becoming increasingly available worldwide for clinical and research nuclear medicine examinations, providing an increased field‐of‐view and sensitivity compared to traditional PET scanners. However, a significant cost is associated with manufacturing the densely packed photodetectors required for the extended‐coverage systems. Despite improved performance allowing ultralow dose or ultrafast scans, the financial barrier remains, limiting clinical utilisation. To mitigate the cost limitations, alternative sparse system configurations with strategically placed inter‐detector gaps have been proposed, allowing an extended field‐of‐view PET design with detector costs similar to a standard PET system, albeit at the expense of image quality. To address the challenges posed by sparse detector configurations, particularly the heavy undersampling of PET…

Linked entities

Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.

Species1

Homo sapiens(human · species)

Chemicals1

PennPET

Diseases1

lesion

Figures9

Click any figure to enlarge with its caption.

Example comparison of a standard compact PET configuration (a) and a sparse PET configuration using a chessboard detector pattern (b). For the sake of resolution, the shown mockup scanners comprise of 15 rings with 128 detector elements each. PET, positron emission tomography.

Visualisation of the sinogram distortion due to the sparse PET chessboard configuration. (a) Pixel‐wise correlation between the original and distorted sinograms; figure shows random sample of 105 pixels from the different scans (different colours for different scans). The distorted pixels follow two separate distributions (marked by arrows)—they are either zero (i) or of lower intensity than the original (ii). The ideal fit is illustrated by a solid line, and the overall total fit to the original pixels is illustrated by a dashed line. (b) shows the sinogram distortion pattern with zeroed out pixels in direct planes and cross planes with ring difference >1. (c) shows the sinogram distortion pattern with low‐intensity pixels in summed cross planes with ring difference 1. PET, positron emission tomography.

Architecture of the implemented sinogram restoration network. Each green arrow represents a residual block containing convolution, batch normalisation and rectified linear unit (ReLU) activation, as well as a residual connection. The dotted arrows represent skip connections, and the circled plus signs represent addition of the features from the down‐sampling layers with the up‐sampling layers. The network down‐samples via convolution with stride 2, reducing the spatial dimensions by half, and up‐samples using transpose convolution, doubling the spatial dimensions. Two final convolutions layers are used to generate the output. The sinograms are normalised to range [0:1] before input and denormalised by the same factor after output. Finally, the pixels not affected by the removed detectors are copied from the input sinograms and reinstated in the output.

Visual comparison of 2D sinograms based on the ground truth (Original), the sparse chessboard configuration (Distorted), and predictions from interpolation filling (Interpolated) and from the sinogram restoration network (Restored). The figure includes two different sinogram slices from one scan in the test set. (a) shows a direct plane sinogram slice. (b) shows a summed cross‐plane sinogram slice with ring difference 1.

Pixel‐wise correlation between the original sinograms and the predicted sinograms from (a) interpolation and (b) the restoration network. The figure shows a random sample of 105 pixels from the different scans (different colours for different scans). The ideal fit is illustrated by the solid lines, and the overall total fit to the original pixels is illustrated by the dashed lines. The interpolated pixels exhibit traces of the two separate distributions of the distorted pixels with remaining pixels of lower intensity than the original (marked by arrow).

(a) Structural similarity index measure (SSIM) and (b) mean absolute error (MAE) of sinograms from the sparse chessboard configuration (Distorted), and predictions from interpolation filling (Interpolation) and from the sinogram restoration network (Restored). Note that high values are better for the SSIM, while low values of the MAE are the best. Whiskers, box limits, and inner line indicate 5th and 95th percentiles, 25th and 75th percentiles, and median, respectively. The figure includes results from Mann–Whitney U test for difference between the samples, where * indicates a p‐value below 0.001.

Visual comparison of reconstructed image slices based on original sinograms (Original), sinograms from the sparse chessboard configuration (Distorted), and predicted sinograms from interpolation (Interpolated) and from the restoration network (Restored). (a) and (b) show two different image slices from two different scans in the test set, with reconstructed images on the top vertical panel, corresponding difference images on the middle panel (colour bar in Bq/ml), and corresponding histograms of pixel differences in the bottom panel (x‐axis in Bq/ml).

Pixel‐wise correlation between the original images and images reconstructed based on the predicted sinograms from the restoration network from (a) interpolation and (b) the restoration network. The figure shows a random sample of 105 pixels from the different scans (different colours for different scans). The ideal fit is illustrated by the solid lines, and the overall total fit to the original pixels is illustrated by the dashed lines. The interpolated image pixels indicate two separate distributions of the image pixels with either lower (i) or higher (ii) intensity than the original (marked by arrows).

(a) Structural similarity index measure (SSIM) and (b) mean absolute error (MAE) of reconstructed images based on the sinograms from the sparse chessboard configuration (Distorted), and predictions from interpolation filling (Interpolation) and from the sinogram restoration network (Restored). Note that high values are better for the SSIM, while low values of the MAE are the best. Whiskers, box limits, and inner line indicate 5th and 95th percentiles, 25th and 75th percentiles, and median, respectively. Figure includes results from Mann–Whitney U test for difference between the samples, where * indicates a p‐value below 0.001.

Funding1

—Swedish Research Council10.13039/501100004359

Keywords

deep learningsparse PETsinogram restoration

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMedical Imaging Techniques and Applications · Radiation Detection and Scintillator Technologies · Medical Image Segmentation Techniques

Full text

INTRODUCTION

1

Positron emission tomography (PET) imaging has become a cornerstone in medical diagnostics. Technology‐driven developments such as iterative image reconstruction and increased computational power have improved performance and paved the way for constructing long axial field‐of‐view (FOV) “total” body PET systems. Current commercial models like the United Imaging uExplorer^TM^ total‐body PET scanner and the Siemens Biograph Vision Quadra^TM^ nearly total‐body PET scanner, offering axial FOVs of up to 194 and 106 cm, respectively, are examples of this latest technological leap.1, 2 The updated PennPET Explorer further exemplifies this trend towards extended FOV.3, 4, 5 These advancements promise benefits like lower radioactive doses, shorter scan times, and reduced image noise. However, they come at a significant cost, primarily due to the increased number of detectors required.6, 7, 8, 9 Therefore, the traditional cylindrical configuration of PET scanners, characterised by densely packed photodetectors, presents substantial financial limitations in the transition to extended FOV PET imaging.

A possible, cost‐effective solution is to introduce gaps between detector elements, either in the axial or transaxial direction or a combination thereof. Such a system geometry potentially extends detector coverage without requiring additional crystal material, thereby providing an extended FOV scanner with reduced material requirements compared to conventional models.8, 10, 11, 12, 13 However, the sparse detector configurations invariably affect image quality, leading to overall sensitivity loss, increased background variability, and reduced spatial resolution and contrast recovery.8, 14, 15, 16, 17 Simulation‐based comparisons have confirmed that such configurations can reduce system cost but substantially lower sensitivity, especially in designs with aggressive sparsity levels.18, 19 Nonetheless, cost‐effectiveness analyses suggest that extended FOV PET systems may still be financially justifiable in selected clinical applications, particularly when paired with algorithmic compensation strategies.20 Various compensatory methods have been explored to address challenges related to detector gaps, including compressed sensing and inpainting,21, 22, 23 sinogram interpolation,24 optimisation‐based reconstruction algorithms,25 normalisation to account for missing lines of response (LORs),10, 11, 19 and continuous bed motion.9, 17, 26 However, these solutions have often been applied to small‐scale data losses and struggle to handle the massive count losses that arise from removing large portions of detectors elements from the PET configurations. The normalisation approach, although offering a large‐scale solution, has previously only been applied to either axial or transaxial gaps of small sizes and has yet to be successfully implemented on the complex missing data patterns resulting from sparsity patterns that combine axial and transaxial gaps. Furthermore, while continuous bed motion helps average out undersampling artifacts, it does not address the root problem of missing detector elements and remains available only on select scanners, limiting its applicability in sparse detector configurations.

In recent years, deep learning has emerged as a potent tool in medical imaging, particularly for data completion tasks like estimating high‐count PET data from low‐count measurements.27, 28, 29 A recent review highlights the expanding role of deep learning across the PET reconstruction pipeline.30 The detector gap problem has been studied as a learning problem in smaller scales such as a few uniformly spaced axial gaps,31, 32, 33, 34 or a C‐shaped ring design.35 However, these studies did not address the extensive and complex patterns of missing data that arise in more aggressive sparse PET configurations, such as those explored here. Moreover, most deep learning applications in PET reconstruction have focused on the image domain; direct sinogram restoration from sparsely sampled data remains largely unexplored.

Therefore, this proof‐of‐concept study proposes a novel deep learning‐based approach to fill incomplete sinograms resulting from inter‐detector gaps when removing 50% of detectors in a chessboard pattern. Our method, leveraging a deep sinogram restoration network, presents a practical and efficient solution to address the significant count losses and image quality degradation inherent in sparse PET system configurations. This approach enhances the feasibility of cost‐effective extended FOV PET scanners and represents a novel technical advancement in the application of artificial intelligence in nuclear medicine.

METHODS

2

Data description and processing

2.1

The PET dataset originates from a clinical trial, MORRIS (NCT02379039), approved by the Regional Ethical Review Board, Umeå (dnr 2015/117‐31). To evaluate the proposed method, eight different pelvis scans acquired without time‐of‐flight on a GE Signa PET/magnetic resonance (MR) imaging system were used. The original dataset, therefore, contained information from all detector positions. To simulate a sparse PET configuration, a subsampling scheme cancelling half of the detectors in a chessboard pattern was adopted (Figure 1). The chessboard pattern was created by cancelling single crystals units, that is, 1×1 crystals in each chessboard square, resulting in gaps of size 4×5.3 mm. Detector removal was performed by discarding coincidences from the original list mode data file and binning the new list mode file into 1981 2D sinograms per scan (the scanner's standard sinogram format). The resulting count loss corresponds to deleting approximately 75% of the LORs. In the sparse configuration, we evaluated cancelling out crystal units corresponding to both black and complimentary white positions as shown in Figure 1. The modified dataset contained 8×2×1981=31696 incomplete 2D sinogram slices, counting eight individual scans, chessboard and complimentary chessboard patterns, and all 1981 direct and oblique planes.

Example comparison of a standard compact PET configuration (a) and a sparse PET configuration using a chessboard detector pattern (b). For the sake of resolution, the shown mockup scanners comprise of 15 rings with 128 detector elements each. PET, positron emission tomography.

One can determine which lines of response are associated with removed detectors and, therefore, create a binary sinogram mask indicating affected sinogram pixels. Highlighted pixels in the mask must be restored, whereas the others can be left as they are. The effects of the detector cancellation caused sinogram pixels to either be zeroed out or have lower intensity specific 2D sinograms (Figure 2a). The main pattern for missing counts corresponded to a grid of diagonal zero‐valued lines (Figure 2b). The second pattern containing low‐intensity values and zero‐valued lines appeared in added cross planes for ring differences equal to one mm (Figure 2c).

Visualisation of the sinogram distortion due to the sparse PET chessboard configuration. (a) Pixel‐wise correlation between the original and distorted sinograms; figure shows random sample of 105 pixels from the different scans (different colours for different scans). The distorted pixels follow two separate distributions (marked by arrows)—they are either zero (i) or of lower intensity than the original (ii). The ideal fit is illustrated by a solid line, and the overall total fit to the original pixels is illustrated by a dashed line. (b) shows the sinogram distortion pattern with zeroed out pixels in direct planes and cross planes with ring difference >1. (c) shows the sinogram distortion pattern with low‐intensity pixels in summed cross planes with ring difference 1. PET, positron emission tomography.

Sinogram restoration network

2.2

The proposed deep network was inspired by a segmentation task36 and consists of a residual U‐Net37 combining the strengths of residual learning38 and the well‐known U‐Net architecture.39 The network consists of an asymmetric encoder–decoder structure (Figure 3). The encoder layers feature an increasing number of residual blocks and one strided convolution, which halves the spatial dimensions. The layers of the decoder are composed of strided transposed convolutions which double the spatial dimensions, followed by one residual block. Skip connections connect the down‐sampling and up‐sampling segments, allowing the network to combine positional and contextual information to predict the missing sinogram information and aid gradient back‐propagation during training. Each sinogram was normalised to the range [0:1] before input and then denormalised by the same factor after output to control the unboundedness of the sinogram values. This was shown to be beneficial during training, achieving a more steady and fast convergence.40 Finally, the pixels not affected by the removed detectors were copied from the input sinograms and reinstated in the output.

Architecture of the implemented sinogram restoration network. Each green arrow represents a residual block containing convolution, batch normalisation and rectified linear unit (ReLU) activation, as well as a residual connection. The dotted arrows represent skip connections, and the circled plus signs represent addition of the features from the down‐sampling layers with the up‐sampling layers. The network down‐samples via convolution with stride 2, reducing the spatial dimensions by half, and up‐samples using transpose convolution, doubling the spatial dimensions. Two final convolutions layers are used to generate the output. The sinograms are normalised to range [0:1] before input and denormalised by the same factor after output. Finally, the pixels not affected by the removed detectors are copied from the input sinograms and reinstated in the output.

Network training

2.3

The proposed sinogram restoration network was trained on six patients (6×2×1981=23772 sinograms) at a time, holding out one patient for validation (1×2×1981=3692 sinograms) and one for testing (1×2×1981=3692 sinograms). Eight‐fold cross‐validation allowed assessment of model performance on all patients. A combined pixel‐wise structural similarity index measure (SSIM) and ℓ1 based loss function L was used for the preservation of pixel accuracy as well as visual coherence. Specifically, we defined our loss as

[eqn]

where

[eqn]

[eqn]

Here, y^ and y are the predicted and original sinograms, respectively. The means μ, variances σ2 and covariance σy^y are computed locally over corresponding pixels. The constants C1 and C2 depend on the dynamic range L of the sinograms such that C1=(0.01L)2 and C2=(0.03L)2. The MAE is computed over a masked region m consisting of n activated pixels.

The loss function was minimised using the Adam optimiser41 for 200 epochs with a learning rate of 10−3, which experienced a stair‐case exponential decay rate of 0.96. During each training epoch, 372 batches of 64 sinograms were used until the whole training set was exhausted. The training was evaluated using the validation set at the end of each epoch. If the validation loss did not improve for 20 consecutive epochs, an early stopping criterion was triggered, effectively stopping the training.

The model was developed using the Keras modules of TensorFlow and trained on a high‐performance computer equipped with 48 CPU cores and 2 NVIDIA A100 GPUs. The data was converted into the TFRecords format, and the loading pipeline was optimised, so each epoch took about 36s. The minimum, maximum and average training time (number of epochs) were 73, 122, and 98min (109, 194, 149 epochs), respectively. Once the model was trained, the runtime to infer the restored sinograms was approximately of 20 s per patient. The same operation took approximately two and a half minutes on an Apple MacBook Pro M1 (2021), with a much more modest hardware configuration which is comparable to the ones found within typical clinical settings.

Quantification

2.4

The sinogram restoration network performance was evaluated in both sinogram and image domains. For comparison, interpolated 2D sinograms were also created using the Clough–Tocher method.42, 43 Both the network restored (all 1981×2 slices in the test set) and the interpolated data were compared to the ground truth original scans (without missing detectors). Between‐sinogram and between‐image comparisons were based on the SSIM and MAE in Equations (1), (2). Note that high values are better for the SSIM, while low values of the MAE are the best.

To complement these global metrics, we measured mean activity (μROI) and standard deviation (σROI) in a 5–10 cm3 bladder region of interest (ROI) and a randomly placed background ROI of similar size. We define background variability (BV) for a ROI as:

[eqn]

In this study, we report a relative background variability (rBV), defined as the ratio of BV in the restored images to BV in the corresponding original images. We further evaluated the contrast recovery (CR) defined as the ratio between the contrast in the restored and the original images, defined as:

[eqn]

All data processing steps, including list mode alterations, binning into sinograms, and image reconstructions, were performed with GE Healthcare's PET research toolbox Duetto (v02.18). Following the clinical protocol, we performed image reconstruction to 192×192×89 voxels, 60 cm radial and 25 cm axial FOV via the 3D ordered subset expectation maximisation algorithm with 28 subsets, 2 iterations, and a 5 mm Gaussian postfilter. The reconstruction used the default normalisation file and corrections for attenuation, scatter and randoms.

For a robust and quantitative comparison, two statistical tests were chosen based on the nature of the data and the aspects of the prediction methods being evaluated. Fisher's Z‐transformation was used to compare pixel‐wise correlations between original and predicted sinograms. The Mann–Whitney U test was employed to evaluate significant difference between evaluated measures in both sinograms and reconstructed images.

RESULTS

3

This section will refer to the ground truth original scans as “original” and the sparse PET configuration‐modified data as “distorted”. Predictions resulting from the sinogram restoration network are referred to as the “restored” sinograms, as opposed to the “interpolated” sinograms that served as a baseline method to fill in the missing data.

Sinogram restoration

3.1

The proposed sinogram restoration model successfully recovers missing counts in the distorted sinograms (Figure 4). The sinogram restoration network fills in missing counts with high fidelity and demonstrates a strong positive linear correlation with the original counts (Figure 5b). The restored sinograms cannot reproduce finer details that would have occurred in missing detectors, which inevitably leads to a smoothing effect (Figure 4b). This is reflected in moderately high SSIM values, as seen in Figure 6a. The mean absolute pixel deviation is consistently within two counts per pixel (Figure 6b).

Visual comparison of 2D sinograms based on the ground truth (Original), the sparse chessboard configuration (Distorted), and predictions from interpolation filling (Interpolated) and from the sinogram restoration network (Restored). The figure includes two different sinogram slices from one scan in the test set. (a) shows a direct plane sinogram slice. (b) shows a summed cross‐plane sinogram slice with ring difference 1.

Pixel‐wise correlation between the original sinograms and the predicted sinograms from (a) interpolation and (b) the restoration network. The figure shows a random sample of 105 pixels from the different scans (different colours for different scans). The ideal fit is illustrated by the solid lines, and the overall total fit to the original pixels is illustrated by the dashed lines. The interpolated pixels exhibit traces of the two separate distributions of the distorted pixels with remaining pixels of lower intensity than the original (marked by arrow).

(a) Structural similarity index measure (SSIM) and (b) mean absolute error (MAE) of sinograms from the sparse chessboard configuration (Distorted), and predictions from interpolation filling (Interpolation) and from the sinogram restoration network (Restored). Note that high values are better for the SSIM, while low values of the MAE are the best. Whiskers, box limits, and inner line indicate 5th and 95th percentiles, 25th and 75th percentiles, and median, respectively. The figure includes results from Mann–Whitney U test for difference between the samples, where *** indicates a p‐value below 0.001.

The interpolated sinograms also present low MAE (Figure 6b) and a strong positive correlation with the original counts (Figure 5a). However, the correlation is significantly lower compared to the restored predictions (p<0.001). The interpolation method fails to recover the lower counts in the diagonal cross planes of the distorted data, leading to distinctive underrepresentation in the added cross‐plane sinograms for ring differences equal to one (Figure 4b). The shortcoming of the interpolation method is evident, with inferior values of the SSIM and MAE (p<0.001), as seen in Figure 6.

Figure 4 shows representative 2D direct sinograms and summed cross‐planes with ring difference 1. The interested reader is referred to supplementary Figure S.1 for corresponding examples from the same scans for larger ring differences (5 and 11), confirming similar cancellation patterns and restoration behaviour across oblique planes.

Reconstructed images

3.2

How the effects of sinogram restoration translate in terms of quality of reconstructed images is shown in Figure 7. The smooth nature of the restored sinograms leads to reconstructed images with smoother background texture and some lack of finer details. This is manifested as somewhat reduced high‐contrast regions and missing low‐contrast regions (Figure 7). Quantitatively, this was reflected in the background variability, where low‐contrast regions appeared slightly noisier than the original images (rBV≈1.08), while high‐contrast regions without specific bindings such as the bladder were slightly smoother (rBV≈0.92). A similar pattern can be seen in the pixel‐wise deviation (Figure 8b). The main deviation is attributed to the high‐contrast pixels and has an underestimating effect. Overall, Figure 8b shows a strong correspondence between restored and original pixel intensities, consistent with an average contrast recovery of 100% (SD 2%) across scans, indicating that the restoration preserved contrast between high uptake and background.

Visual comparison of reconstructed image slices based on original sinograms (Original), sinograms from the sparse chessboard configuration (Distorted), and predicted sinograms from interpolation (Interpolated) and from the restoration network (Restored). (a) and (b) show two different image slices from two different scans in the test set, with reconstructed images on the top vertical panel, corresponding difference images on the middle panel (colour bar in Bq/ml), and corresponding histograms of pixel differences in the bottom panel (x‐axis in Bq/ml).

Pixel‐wise correlation between the original images and images reconstructed based on the predicted sinograms from the restoration network from (a) interpolation and (b) the restoration network. The figure shows a random sample of 105 pixels from the different scans (different colours for different scans). The ideal fit is illustrated by the solid lines, and the overall total fit to the original pixels is illustrated by the dashed lines. The interpolated image pixels indicate two separate distributions of the image pixels with either lower (i) or higher (ii) intensity than the original (marked by arrows).

The underestimation in interpolated sinogram counts seen in Figure 4b has a pronounced impact on reconstruction, resulting in image slices with lower intensity (Figures 7a and 8a). Moreover, the interpolated sinograms contribute to occasional spurious features in the reconstructed images, appearing as overestimated intensities, visible in Figures 7b and 8a. This tendency is reflected in the histograms in Figure 7, where the interpolated images exhibit a broader and more uneven distribution of pixel differences relative to the original images. In contrast, the restored images show a narrower error distribution, indicating closer overall agreement with the original, despite the inherent smoothing. The visual comparison is also appreciated in the quantitative measures in Figure 9, with superior values of SSIM and MAE for the restored images (p<0.001).

(a) Structural similarity index measure (SSIM) and (b) mean absolute error (MAE) of reconstructed images based on the sinograms from the sparse chessboard configuration (Distorted), and predictions from interpolation filling (Interpolation) and from the sinogram restoration network (Restored). Note that high values are better for the SSIM, while low values of the MAE are the best. Whiskers, box limits, and inner line indicate 5th and 95th percentiles, 25th and 75th percentiles, and median, respectively. Figure includes results from Mann–Whitney U test for difference between the samples, where *** indicates a p‐value below 0.001.

Image slices from the same scans of Figure 7 but reconstructed in coronal view can be found in Figure S.2, supporting the same qualitative trends while revealing some band‐shaped artefacts, most evident in the interpolated images.

DISCUSSION

4

This study presents a novel method for deep learning‐based filling of incomplete sinograms resulting from sparse PET configuration systems with inter‐detector gaps following a chessboard pattern. Motivated by other studies on sparse detector configurations,8, 9, 10, 11, 14, 15, 16, 17, 24, 25, 26 we explored several patterns for detector cancellation. In addition to the expected sensitivity loss from missing counts relative to non‐sparse designs,16, 17 specific image quality degradations have been reported with increasing gap sizes. Background variability and losses in spatial resolution and contrast recovery tend to increase with the axial gap size.8, 9 Other known drawbacks are noise artefacts due to axial gaps and streaking artefacts due to transaxial gaps.14, 15, 31 The chessboard pattern used here was chosen to generate maximum detector sparsity (50% of the detectors) with minimal gap size. We investigated the data loss effects of several block sizes for the chess squares. A preliminary evaluation of the sinogram restoration network on configurations of sizes 1×1, 2×2, and 3×4 showed robustness in terms of SSIM and MAE (data not shown). However, only the best‐performing (in terms of SSIM and MAE) chessboard pattern size 1×1 was fine‐tuned and included in the presented results.

The restoration network successfully recovered distorted sinograms. However, the smooth nature of the restored sinograms causes excess smoothing also in the reconstructed images, limiting clinical usability at this point. Although small high‐contrast regions are detected in the images, the model demonstrates a lack of precision in correctly quantifying these areas. The smoothing effect seems to have a direct impact on the contrast recovery, which was evident in the reconstructed image in Figure 7 where some high‐contrast areas are reduced in size and some are distorted in their shape. It is also evident in the pixel‐wise contrast comparison in Figure 8b, where we see an indication of high‐contrast pixels as the main drivers to the deviation from a perfect fit to the original pixels.

In contrast, the interpolation method performed worse, underestimating specific sinogram slices and overestimating some pixels, causing hallucinatory artefacts in some reconstructed image slices. This is further evidenced by the histograms in Figure 7, which show that the interpolated method results in wider and more irregular error distributions, reinforcing its tendency towards inconsistent pixel‐level accuracy. The underestimated sinogram slices coincide with certain cross planes, adding up oblique angles from two directions and a position shift in the distortion pattern, resulting in lower counts across all pixels. Interpolation cannot infer systematic count loss but only fills in gaps, often leading to artefacts.

The 2D sinograms corresponding to summed cross planes with ring difference 1 displayed both zeroed‐out lines and overall lower counts values in the remaining pixels and, therefore, differ from the ground truth in all sinogram pixels (Figure 2b). Hence, the sinogram restoration network had to learn to predict entire sinogram slices for these cross planes. This approach is not possible for the interpolation, where reference points are needed to generate predictions. Therefore, a different mask was supplied to the interpolation method for these mentioned cross planes, leaving the low‐count pixels as reference points, resulting in an overall underestimation of the zeroed‐out pixels. Even when boosted with a global scaling factor, the interpolation approach consistently underestimated counts and could not match the performance of our restoration network. The effect is likely responsible for the band‐shaped artifacts observed in the reconstructed coronal images (Figure S.2). This behaviour may be related to the 2D formulation adopted in this work, where the network was trained on individual sinogram slices without explicit modelling of inter‐slice correlations. Further investigation into architectures that better capture cross‐plane dependencies may therefore be warranted.

Building on these observations, future work should explore extending the current 2D network into a 3D formulation to explicitly model volumetric context, as well as an investigation of incorporating sharpness to the U‐Net structure to mitigate the smoothing effect inherent in neural networks. It would also be interesting to investigate this specific model's capacity regarding possible data losses for clinically approved image quality and if there is an optimal sampling pattern. One might consider an adaptive sparsity pattern, for example, tighter sampling at the scanner centre and more sparse sampling at the scanner ends. Another necessary step is to explore the integration of sinogram re‐normalisation to account for the missing lines of response, particularly in the cross‐planes. In this study, we did not explicitly apply re‐normalisation. However, our results show that the network is capable of restoring both the missing signals in the direct planes and the attenuated signals in the cross‐planes without a separate re‐normalisation. This suggests that the model can, to some extent, compensate for the effects of missing and attenuated signals without explicit need for re‐normalisation of the sinograms. Nonetheless, applying explicit re‐normalisation as a preprocessing step may reduce the network's burden and improve restoration accuracy—a possibility that warrants further investigation.

Furthermore, the dataset was limited in the sense that it only included one anatomical region and only left one patient for validation. This was a main motivation for training the model on individual sinogram slices, which comes at a possible trade‐off of introduced bias in the results. The current model was furthermore trained on one specific detector pattern at a time. To increase model generalisability, the dataset should be extended to multiple anatomical regions, and the training process could be extended to include several pattern sizes. Our model could also be combined with parallax correction. In the case of missing LORs due to the missing detector elements, such a correction scheme would also give the added benefit of shifting some of the measured counts to empty detector positions, thereby assisting the estimation of missing counts. Another alternative is to extend our model to include time‐of‐flight (TOF) information. We expect that including the TOF timing resolution would improve lesion detectability compared to the non‐TOF setting due to enhanced signal‐to‐noise, mainly due to the increase in the number of 2D sinograms available for training. We do not expect the reported blurring to go away since the same detector elements are still missing. An additional future direction would be to evaluate whether combining our method with, for instance, bed motion could enhance sinogram restoration, since bed motion effectively provides multiple slightly shifted sampling configurations of the same patient position. In summary, the present work should be regarded as a proof‐of‐concept from a deep learning standpoint, with practical implementation requiring further development.

The proposed sparse chessboard configuration is designed to explore the potential for lower‐cost or more cost‐effective clinical PET scanners, aiming for the same axial FOV as conventional scanners but with only 50% of the original detector materials. We further speculate that the proposed sparse geometry could alternatively be used to allow the development of a two‐fold extension of the axial FOV of conventional compact ring configurations with the same number of detectors. An extended axial FOV would, for instance, allow simultaneous imaging of distant organs such as the brain and heart, as in the total body PET systems available on the market. This enhanced coverage enables static whole‐body imaging, a challenge with standard systems due to axial limitations. A sparse long axial FOV PET scanner with 50% fewer detectors, retaining only 25% of the initial LORs, would inevitably have substantially lower sensitivity than full‐coverage total‐body scanners. However, its sensitivity would remain comparable to that of earlier‐generation PET systems, which have more limited axial coverage. From this perspective, such a system could still enable whole‐body imaging at a reasonable cost and acceptable sensitivity. Combining the two approaches of extended axial FOV and reduced detector cost compared to compact designs results in a medium‐cost extended FOV, which offers many of the advantages of a full total body PET but positioning itself between the current conventional PET and extended FOV systems in terms of investments. We note, however, that while the 1×1 sparsity pattern performed best in our evaluation, such a configuration may not be the most practical or cost‐effective from a manufacturing perspective.

Finally, although our work focuses on sparse ring‐based PET configurations, other approaches share the same goal of reducing detector costs through novel system geometries. Flat‐panel PET configurations44, 45 reflect the same broader objective of enabling affordable PET imaging through hardware sparsity. Preliminary studies have applied deep learning‐based denoising to flat‐panel PET systems with sparse detectors, showing promising image quality with a 30% detector reduction.46

CONCLUSION

5

This study serves as a proof of concept that sparse PET configurations with inter‐detector gaps are within reach with the aid of deep learning‐based data restoration. Our model demonstrated promising results in sinogram restoration. However, certain limitations, particularly in preserving finer details, indicate the need for further refinements. Nevertheless, this research represents a step forward in increasing the availability of clinical PET systems worldwide by realising the practical potential of deep learning approaches for designing low‐cost extended FOV PET systems.

CONFLICT OF INTEREST STATEMENT

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Bibliography46

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1Spencer BA , Berg E , Schmall JP , et al. Performance evaluation of the u EXPLORER total‐body PET/CT Scanner based on NEMA NU 2‐2018 with additional tests to characterize PET scanners with a long axial field of view. J Nucl Med. 2021;62:861‐870, doi:10.2967/jnumed.120.250597 33008932 PMC 8729871 · doi ↗ · pubmed ↗
2Prenosil GA , Sari H , Fürstner M , et al. Performance characteristics of the biograph vision quadra PET/CT System with a long axial field of view using the NEMA NU 2‐2018 standard. J Nucl Med. 2022;63:476‐484, doi:10.2967/jnumed.121.261972 34301780 · doi ↗ · pubmed ↗
3Pantel AR , Viswanath V , Daube‐Witherspoon ME , et al. Penn PET explorer: Human imaging on a whole‐body imager. J Nucl Med. 2020;61:144‐151, doi:10.2967/jnumed.119.231845 31562224 PMC 6954463 · doi ↗ · pubmed ↗
4Karp JS , Viswanath V , Geagan MJ , et al. Penn PET Explorer: Design and preliminary performance of a whole‐body imager. J Nucl Med. 2020;61:136‐141. doi:10.2967/jnumed.119.229997 31227573 PMC 6954465 · doi ↗ · pubmed ↗
5Dai B , Daube‐Witherspoon ME , Mc Donald S , et al. Performance evaluation of the Penn PET explorer with expanded axial coverage. Phys Med Biol. 2023;68:095007, doi:10.1088/1361-6560/acc 722 PMC 1045077436958051 · doi ↗ · pubmed ↗
6Vandenberghe S , Moskal P , Karp JS . State of the art in total body PET. EJNMMI Physics. 2020;7:1‐33, doi:10.1186/s 40658-020-00290-2 32451783 PMC 7248164 · doi ↗ · pubmed ↗
7Abgral R , Bourhis D , Salaun PY . Clinical perspectives for the use of total body PET/CT. Eur J Nucl Med Mol Imaging. 2021;48:1712‐1718, doi:10.1007/s 00259-021-05293-4 33742236 · doi ↗ · pubmed ↗
8Daube‐Witherspoon ME , Viswanath V , Werner ME , Karp JS . Performance characteristics of long axial field‐of‐view PET scanners with axial gaps. IEEE Trans Radiat Plasma Med Sci. 2021;5:322‐330, doi:10.1109/trpms.2020.3027257 34179595 PMC 8224406 · doi ↗ · pubmed ↗