Synthesis and Inpainting-Based MR-CT Registration for Image-Guided Thermal Ablation of Liver Tumors
Dongming Wei, Sahar Ahmad, Jiayu Huo, Wen Peng, Yunhao Ge, Zhong Xue,, Pew-Thian Yap, Wentao Li, Dinggang Shen, Qian Wang

TL;DR
This paper introduces a rapid, multi-step image registration method combining synthesis, inpainting, and deep learning to accurately align MR and CT images for guiding liver tumor ablation procedures.
Contribution
It presents a novel fast registration framework that integrates Cycle-GAN synthesis, inpainting, and unsupervised registration networks for MR-CT image alignment.
Findings
Achieves high registration accuracy.
Operates with very fast computational speed.
Effective in guiding thermal ablation procedures.
Abstract
Thermal ablation is a minimally invasive procedure for treat-ing small or unresectable tumors. Although CT is widely used for guiding ablation procedures, the contrast of tumors against surrounding normal tissues in CT images is often poor, aggravating the difficulty in accurate thermal ablation. In this paper, we propose a fast MR-CT image registration method to overlay a pre-procedural MR (pMR) image onto an intra-procedural CT (iCT) image for guiding the thermal ablation of liver tumors. By first using a Cycle-GAN model with mutual information constraint to generate synthesized CT (sCT) image from the cor-responding pMR, pre-procedural MR-CT image registration is carried out through traditional mono-modality CT-CT image registration. At the intra-procedural stage, a partial-convolution-based network is first used to inpaint the probe and its artifacts in the iCT image. Then, an…
| In-plane FOV | Resolution | Scanner | |
| pMR image | Siemens 3.0T Skyra | ||
| pCT image | Philips Brilliance 64 CT | ||
| iCT image | Philips Brilliance 64 CT | ||
| Rigid | Deformable | ||||
| FLIRT | ANTs | ANTs | Proposed | ||
| pMR-iCT | |||||
| Liver | Dice (%) | 48.4440.40 | 85.522.92 | 86.593.30 | 86.963.00 |
| TRE (mm) | - | 52.569.85 | 5.182.43 | 4.932.72 | |
| pMR-pCT | |||||
| Liver | Dice (%) | 48.0740.40 | 87.032.60 | 89.552.04 | 90.591.73 |
| TRE (mm) | - | 6.632.73 | 5.592.01 | 4.672.00 | |
| Tumor | Dice (%) | 12.8825.76 | 51.1017.13 | 55.345.70 | 62.454.66 |
| TRE (mm) | - | 6.712.27 | 6.081.40 | 3.890.99 | |
| pCT-iCT | |||||
| Liver | Dice (%) | - | - | 87.905.25 | 88.635.53 |
| TRE (mm) | - | - | 5.063.29 | 4.373.30 | |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMedical Imaging Techniques and Applications · Radiomics and Machine Learning in Medical Imaging · Advanced X-ray and CT Imaging
MethodsPerceptual control theoretic architecture
\setremarkmarkup
(#2)
11institutetext: Institute for Medical Imaging Technology, School of Biomedical Engineering, Shanghai Jiao Tong University, Shanghai 200030, China
11email: [email protected] 22institutetext: Department of Radiology and Biomedical Research Imaging Center (BRIC), University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
22email: [email protected] 33institutetext: North China Electric Power University, Beijing, China 44institutetext: Shanghai United Imaging Intelligence Co., Ltd, Shanghai, China 55institutetext: Shanghai Cancer Center, Fudan University, Shanghai, China
Synthesis and Inpainting-Based MR-CT Registration for Image-Guided Thermal Ablation of Liver Tumors
Dongming Wei 1122
Sahar Ahmad 22
Jiayu Huo 11
Wen Peng 33
Yunhao Ge 44
Zhong Xue 44
Pew-Thian Yap 22
Wentao Li 55
Dinggang Shen(✉) 22
Qian Wang(✉) 11
Abstract
Thermal ablation is a minimally invasive procedure for treating small or unresectable tumors. Although CT is widely used for guiding ablation procedures, the contrast of tumors against surrounding normal tissues in CT images is often poor, aggravating the difficulty in accurate thermal ablation. In this paper, we propose a fast MR-CT image registration method to overlay a pre-procedural MR (pMR) image onto an intra-procedural CT (iCT) image for guiding the thermal ablation of liver tumors. By first using a Cycle-GAN model with mutual information constraint to generate synthesized CT (sCT) image from the corresponding pMR, pre-procedural MR-CT image registration is carried out through traditional mono-modality CT-CT image registration. At the intra-procedural stage, a partial-convolution-based network is first used to inpaint the probe and its artifacts in the iCT image. Then, an unsupervised registration network is used to efficiently align the pre-procedural CT (pCT) with the inpainted iCT (inpCT) image. The final transformation from pMR to iCT is obtained by combining the two estimated transformations, i.e., (1) from the pMR image space to the pCT image space (through sCT) and (2) from the pCT image space to the iCT image space (through inpCT). Experimental results confirm that the proposed method achieves high registration accuracy with a very fast computational speed.
Keywords:
Thermal Ablation Liver Tumor Image Registration Neural Network.
1 Introduction
Thermal ablation [1] elevates the temperature (55*∘–65∘* Celsius) of a focal zone in the tumor and induces irreversible cell injury and eventually tumor apoptosis and coagulative necrosis. Therefore, accurate targeting of the tumor area is critical for ablating tumor tissues only and leaving the surrounding healthy tissues intact.
CT imaging is typically used to guide the interventional procedure in thermal ablation, where pre-procedural CT (pCT) is used for planning, and intra-procedural CT (iCT) is captured during the treatment to facilitate safe placement of the ablation probe and accurate targeting of the tumor [2]. However, CT is relatively poor in tissue contrast (e.g., arteries) and is susceptible to artifacts introduced by the probe during the procedure. Therefore, high-resolution pCT and pre-procedural MR (pMR) images are typically aligned during planning and then registered onto the iCT image for more precise guidance in positioning the probe to the desired region of interest (ROI) [3, 4]. In liver tumor ablation, accuracy and speed of such an alignment are both important as it can compensate for deformations caused by patient positioning and respiratory motion without delay.
Most volumetric registration algorithms [5, 6, 7] are only feasible in the pre-procedural stage as they involve iterative yet time-consuming optimization. Moreover, they do not deal with the probe-induced artifacts. In order to overcome these challenges, we propose a fast image registration framework to align pMR images onto iCT images for guiding thermal ablation of the liver tumor. Meanwhile, our method also eliminates the probe artifacts so that they do not interfere during registration. The proposed registration framework consists of two stages:
Pre-Procedure: Rigid and deformable registrations between pMR and pCT images. We use mutual-information (MI)-based Cycle-GAN to generate sCT from pMR images to convert the cross-modality registration into a mono-modality problem. 2. 2.
Intra-Procedure: Fast deformable registration of the inpainted iCT (inpCT) image with the pCT image, using an unsupervised registration network (UR-Net).
Finally, the pMR image is aligned to the iCT image by composing the two transformations estimated in the above two stages.
2 Methods
In order to accurately and efficiently register pMR images onto iCT images for guiding thermal ablation of liver tumors, we propose a two-stage registration framework. First, we convert the cross-modality MR-CT image registration into
mono-modality registration (CT-CT) by synthesizing sCT images from pMR images through MI-based Cycle-GAN. This mono-modality rigid and deformable registration is performed by using ANTs [8]. Then, the trained UR-Net performs deformable registration between pCT and inpCT, which is inpainted by a partial-convolution (PConv)-based network from the iCT image. Thus, the pMR image can be warped onto the iCT space by composing the output transformations of the two registration stages. The pipeline of our method is shown in Fig. 1.
2.1 Pre-Procedural MR-CT Registration
MR-CT registration is challenging due to large appearance differences between the two modalities. Previous works [9] have shown that the cross-modality registration can be converted into a mono-modality registration to achieve better performance. To this end, we synthesize the sCT image from an input pMR image, which then facilitates the subsequent registration between pMR and pCT images.
The MR-to-CT synthesis is completed by using Cycle-GAN with mutual information constraint. Cycle-GAN [10], as one of the state-of-the-art image synthesis algorithms, adopts the adversarial loss given by two discriminators ( and ), such that the distribution of the output images of the two generators ( and ) is indistinguishable from that of the input images. It also uses the cycle-consistency (shown in Fig. 2(a)) to enforce the forward (MR-to-CT) and backward (CT-to-MR) syntheses to be bijective. However, Cycle-GAN fails to enforce structural similarity between the pMR and sCT images, which may lead to uncontrollable drifting of tissue/organ boundaries in the synthesized images (see red dashed contours in Fig. 2(b)).
Therefore, in addition to the cycle-consistency loss , we propose to introduce the MI loss to the generators ( and ) to directly enforce the structural similarity between the input and synthesized images (as shown in Fig. 2(a)). The MI loss is defined as:
[TABLE]
where and denote the histograms of and , respectively, and refers to the joint histogram of and . After synthesizing the cross-modality registration (pMR-pCT) is converted into a mono-modality registration problem (sCT-pCT). We then perform a conventional mono-modality registration using ANTs [8] to estimate the field that accounts for both rigid and deformable transformations.
2.2 Intra-procedural Registration
Due to the high computational efficiency requirement during intra-procedural stage, we propose to train a UR-Net (as shown in Fig. 3) in an unsupervised manner to perform expeditious deformable registration between pCT and inpCT images. The advantage is its fast speed by using parallel convolution on GPU during network inference. Dalca et al. have applied a similar unsupervised registration network to perform brain MR image registration [11], which shows comparable performance with the state-of-the-art optimization-based methods. In our work, the UR-Net is trained by pCT-inpCT image pairs with the loss function defined as:
[TABLE]
where and represent the fixed and moving images, respectively. The loss function comprises of (1) similarity (S) in terms of cross-correlation () between the warped moving image () and the fixed image (), and (2) the regularization (Reg) penalty defined in terms of L2-norm of the gradients of the deformation fields (). The detailed network architecture is shown in Supplementary file.
The probe and its artifacts, which hinder the interpretation of the underlying anatomy, may cause inaccurate registration [6]. In order to remove the probe and its streak artifacts in the iCT image, we train a PConv-based [12] 3D U-Net to obtain inpCT image (). Specifically, given a rough mask (e.g., a bounding box or a polygon) covering the probe and its artifacts, the inpainting network can reconstruct the underlying tissues and update the mask layer-by-layer until the mask shrinks away. The inpainting is based on 3D convolution (Conv)-based U-Net architecture, while each convolution layer is replaced with partial convolution to ensure that the inpainted contents will not be affected by the probe and its streak artifacts in the mask. The probe and its streak artifacts exist in iCT images but not in pCT images. The mask can be manually drawn in the iCT images by clinicians during the procedure, or prepared in advance as part of the procedure planning.
3 Experiments and Results
3.0.1 Dataset and Pre-processing –
Thirty-nine subjects undergoing liver tumor ablation were included in our experiment. Each subject was scanned with his/her own pMR, pCT and iCT images (see Table 1 for a summary of the parameters). Livers were delineated from pMR, pCT and iCT images, respectively, while tumors were delineated from pMR and pCT images but not iCT due to probe artifacts and limited contrast. We used 11 labeled subjects for testing, and the remaining 28 subjects were used for training the three networks. Before normalizing intensities into the range [0,1], the intensities of pCT and iCT images were thresholded in the range [-800,800]. pMR images were also rigidly aligned onto pCT images. Then, all the image sizes were resampled to with isotropic voxel distances. If not stated otherwise, the same training/testing datasets were used.
3.0.2 Implementation –
Three networks were trained: (1) pre-procedural stage: MI-based Cycle-GAN network for CT synthesis, and PConv-based U-Net for probe inpainting; (2) intra-procedural stage: UR-Net for deformable registration. All these networks were implemented in Keras and trained on a single NVIDIA Titan X GPU.
3.0.3 Evaluation Metrics –
We computed the Dice ratio over ROIs and the target registration error (TRE) over several landmarks of livers and tumors, to evaluate the registration accuracy. These two metrics are widely used for evaluation of registration performance, with higher Dice ratio (or lower TRE) characterizing better registration quality.
3.1 Registration Results for pMR and iCT
We conducted pMR-iCT image registration over the test dataset by using FSL FLIRT (rigid), ANTs (rigid and deformable) and the proposed method, respectively. As shown in Fig. 4, our proposed method achieves better accuracy against ANTs in tumor region and body alignment. Quantitative results using Dice ratio and TRE are presented in Table 2. It can be observed that our method yields better performance than ANTs. The last line of Table 2 reports the computational time in intra-procedural stage of ANTs and our proposed method, and our proposed method performed the registration in several seconds. Notice that our proposed method in the intra-procedural stage includes the iCT inpainting and the pCT-iCT image registration steps, which were computed on GPU.
3.2 Pre-procedural Stage
3.2.1 Pre-procedural Stage Registration –
The Dice and TRE metrics were evaluated over pMR-pCT pairs and their corresponding sCT-pCT pairs of the testing dataset. As shown in Table 2, the registration performance of our algorithm is improved over sCT-pCT pair, especially on the target tumor region (more than 7% Dice and 1.1mm TRE improvement), which proves that our proposed synthesis algorithm can facilitate the cross-modality registration.
3.2.2 MR-to-CT Synthesis –
We extracted 2240 slices from the transverse planes of 28 unpaired pMR and pCT images and used them as the training dataset. pMR and pCT images of each subject were linearly registered. For testing, we applied the MR-to-CT generator to synthesize CT images from pMR images slice-by-slice and then concatenated the synthesized slices into 3D volumes. As shown in Fig. 2(b), our method predicted better CT-like images from pMR images.
3.3 Intra-procedural Stage
3.3.1 Intra-procedural Stage Registration –
UR-Net was trained by rigidly registered pCT-inpCT image pairs in an unsupervised manner, where inpCT images were generated by the trained PConv-base inpainting network. We compared the UR-Net registration performance with ANTs over pCT-inpCT image pairs. The Dice and TRE metrics of liver and the intra-procedural computation time were evaluated (see Table 2). It can be seen that the UR-Net yielded better performance and was more efficient. The UR-Net method predicted a deformation field in around 3 secs for a image pair.
3.3.2 Inpainting –
pCT images were used for training the inpainting network. Each image was augmented 500 times using 2 or 3 random 3D shapes (i.e., 3D balls and bars) with random locations and sizes to imitate manual mask (cf. Fig. 5(b)). In the testing stage, we compared the inpainting results quantitatively and qualitatively on the iCT images. The results in Fig. 5(a) shows that PConv-based network obtained around 7 dB PSNR improvement in the masked region than Conv-based network. Particularly, each iCT image was combined with a cubic mask () in healthy tissue region and input into PConv and Conv-based networks, respectively. The PSNR is computed over the reconstructed region with the ground-truth. For clinical cases, the results are shown in Fig. 5(c). The mask was drawn in the pre-procedural images by physicians to cover the tumor and the planned puncture pathway in the tumor-centered transverse plane. Then, an appropriate height was chosen in the intra-procedural stage to ensure that the closed 3D contour can cover the probe and its artifacts. The inference time was around 2 secs for a subject. Note that we will further quantitatively evaluate the improvement of tumor registration, and investigate the possibility of reconstructing the tumor directly by the PConv-based inpainting network.
4 Conclusion
A learning-based registration framework is proposed to align pre-procedural MR and intra-procedural CT images for image-guided thermal ablation of liver tumor. Experimental results showed that our method can efficiently and effectively overlay pMR onto iCT during ablation with high registration accuracy, compared to the state-of-the-art ANTs algorithm. We also showed that MI-based Cycle-GAN synthesis and unsupervised registration improves the overall performance.
Acknowledgement
This work was partially supported by the National Key Research and Development Program of China (2018YFC0116400) and STCSM (19QC1400600).
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Ahmed, M., Brace, C.L., Lee, F.T., Goldberg, S.N.: Principles of and Advances in Percutaneous Ablation. Radiology. 258 (2), 351–369 (2011)
- 2[2] Ahmed, M., et al.: Image-guided tumor ablation: standardization of terminology and reporting criteria—a 10-year update. Radiology. 273 (1), 241–260 (2014)
- 3[3] Chu, K.F., Dupuy, D.E.: Thermal ablation of tumours: biological mechanisms and advances in therapy. Nat. Rev. Cancer. 14 (3), 199–208 (2014)
- 4[4] Maybody, M.: An overview of image-guided percutaneous ablation of renal tumors. Semin. Intervent. Radiol. 27 (03), 261–267 (2010)
- 5[5] Liao, R., et al.: A Review of Recent Advances in Registration Techniques Applied to Minimally Invasive Therapy. IEEE Trans. Multimed. 15 (5), 983–1000 (2013)
- 6[6] Fei B., Lee Z., Boll D.T., Duerk J.L., Lewin J.S., Wilson D.L.: Image Registration and Fusion for Interventional MRI Guided Thermal Ablation of the Prostate Cancer. In: Ellis R.E., Peters T.M. (eds) MICCAI 2003. LNCS, vol. 2879, pp. 364–372, Springer, Berlin, Heidelberg (2003). https://doi.org/10.1007/978-3-540-39903-2_45 · doi ↗
- 7[7] Elhawary, H., et al.: Multimodality Non-rigid Image Registration for Planning, Targeting and Monitoring During CT-Guided Percutaneous Liver Tumor Cryoablation. Acad. Radiol. 17 (11), 1334–1344 (2010)
- 8[8] Avants, B.B., Tustison, N., Song, G.: Advanced normalization tools (ANTS). Insight j. 2 , 1–35 (2011)
