Assessment of deep learning reconstruction effects on detection and differentiation of liver metastasis from hepatic hemangioma in diffusion-weighted imaging
Kumi Ozaki, Hanae Hasegawa, Shota Ishida, Jihun Kwon, Yasutomo Katsumata, Masami Yoneyama, Yukichi Tanahashi, Satoshi Goshima

TL;DR
This study compares deep learning-enhanced and traditional imaging methods for detecting liver tumors and hemangiomas, finding improved image quality but similar diagnostic accuracy.
Contribution
The study evaluates deep learning reconstruction's impact on liver metastasis and hemangioma differentiation in diffusion-weighted imaging.
Findings
DL-DWI showed better metastasis conspicuity and higher SNR/CNR compared to CS-DWI.
ADC values of liver and lesions were lower in DL-DWI than in CS-DWI.
Diagnostic performance metrics like ROC area, sensitivity, and specificity were comparable between DL-DWI and CS-DWI.
Abstract
To evaluate and compare the performance of diffusion-weighted imaging (DWI) using compressed sensing (CS) and DWI using CS with model-based deep learning reconstruction (DL-DWI) in detecting and differentiating liver metastases from hepatic hemangiomas. We retrospectively analyzed data from 53 patients with metastases or hemangiomas (34 men and 19 women, mean age, 65.9 years) who underwent abdominal DWI. Two radiologists evaluated liver contour and distortion, artifact, noise, overall image quality, and lesion conspicuity using a five-point scale. Signal-to-noise ratio (SNR) and apparent diffusion coefficient (ADC) of the liver, as well as contras-to-noise ratio (CNR) and ADC of metastases (n = 59) and hemangiomas (n = 33) were assessed and statistically compared. A receiver operating characteristic (ROC) analysis was performed to assess the diagnostic performance of the two sequences…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMRI in cancer diagnosis · Hepatocellular Carcinoma Treatment and Prognosis · Advanced MRI Techniques and Applications
Introduction
Diffusion-weighted imaging (DWI) visualizes tissue characteristics by measuring the random motion of water molecules, thereby providing valuable information about the tissue microenvironment, such as tissue cellularity and cell membrane integrity [1]. DWI and its corresponding apparent diffusion coefficient (ADC) map have become essential sequences that significantly assist in lesion detection and diagnosis of various diseases, including inflammatory conditions and neoplasms of the upper abdomen [2, 3]. An important clinical application of upper abdominal DWI is the detection and characterization of liver metastases, which is crucial in staging and treatment planning, particularly for patients with colorectal cancer and other primary malignancies [4, 5]. The differential diagnosis of liver metastases from hemangiomas is clinically important, as T2-weighted imaging and dynamic contrast-enhanced studies often pose challenges in distinguishing these lesions [6, 7], making DWI a critical imaging tool for lesion differentiation.
The most commonly used pulse sequence for DWI is single-shot echo-planar imaging (EPI), which samples the entire k-space with a single excitation pulse, enabling rapid image acquisition [8]. This technique offers advantages including fast acquisition times and robustness against bulk motion, but it has inherent limitations such as low signal-to-noise ratio (SNR), reduced spatial resolution, and geometric distortion [9, 10]. Parallel imaging (PI) technology can reduce these distortions [11]; however, increasing the acceleration factor with PI decreases SNR, particularly in regions with high geometric (g-) factors such as the central portion of the image [12]. In addition, the high b-values recommended for the diagnosis of focal liver lesions often result in an insufficient SNR in DWI. Therefore, effective noise reduction methods are highly desirable to address these limitations.
In recent years, compressed sensing (CS) image acquisition and reconstruction techniques have been increasingly used in abdominal MRI [13, 14]. CS is an acceleration technique that requires random undersampling of k-space data, high sparsity of image signals, and nonlinear reconstruction. When combined with PI, CS reconstruction employs iterative L1-regularized denoising filters, such as wavelet-based approaches, to achieve an optimal balance between noise reduction and data consistency [15, 16]. Furthermore, previous studies have shown that applying the CS framework to EPI can reduce noise-like artifacts and improve image quality of EPI-based DWI without further optimization of the EPI sampling scheme [17, 18].
Recent developments in deep learning (DL) with convolutional neural networks (CNNs) have significantly advanced medical imaging applications [19, 20], particularly DL-based image reconstruction techniques, which have demonstrated promising results in denoising for MRI data acquisition and reconstruction [21, 22]. Notably, model-based DL reconstruction techniques represent a cutting-edge development in denoising magnetic resonance images, providing significant advancements in the field [23, 24].
This emergent model-based DL technique holds significant potential for effective noise removal in DWI and is expected to provide superior denoising performance in challenging imaging scenarios across various anatomical regions and pathological conditions [24–26]. However, its clinical utility for detecting and differentiating hepatic lesions has not yet been validated.
Therefore, we aimed to evaluate and compare the performance of diffusion-weighted imaging (DWI) using compressed sensing (CS) and DWI using compressed sensing with model-based deep learning reconstruction (DL-DWI) in detecting and differentiating liver metastases from hepatic hemangiomasDL-DWI.
Materials and methods
The Research Ethics Committee of our institution approved this study (Approval No. 24–159) and waived the requirement for written informed consent due to the retrospective nature of this study.
Patients
We identified 343 consecutive patients who underwent abdominal 3-T magnetic resonance imaging (MRI) examinations between August and December 2024. Patients with liver metastasis or hepatic hemangioma, and prior or follow-up computed tomography (CT) or MR images spanning at least 6 months or more were enrolled in this study. We excluded those whose examination was interrupted (n = 2) and those with metal artifacts due to gastrectomy (n = 1). The demographics of the study population are presented in Fig. 1.Fig. 1. Flowchart of patient selection
MR examinations
MR examinations were performed using a 3 T MRI unit (MR7700; Philips Healthcare, Best, the Netherlands) with a 32-channel ds Torso coil. DWI was obtained using fat-suppressed two-dimensional single-shot EPI with CS reconstruction at b values of 0 and 800 s/mm^2^. Table 1 shows the details of the imaging parameters and the acquisition time. ADC maps were calculated automatically in the MR console using b-values of 0 and 800 s/mm^2^ with a mono-exponential model.Table 1. Acquisition parameters of diffusion-weighted images and other sequencesDiffusion weighted imagingChemical shift imagingT2-weighted imagingSingle-shot turbo spin echoDynamic contrast studyHepatobiliary phaseSequenceEPIFFEMotion freeTurbo spin echoe-THRIVEe-THRIVEFat suppressionSTIR-SPIRSPAIRSPAIRSPAIRAcquisition planeTransverseTransverseTransverseTransverseTransverseTransverseRespiratory compensationRespiratory triggerBreath-holdRespiratory triggerBreath-holdBreath-holdBreath-holdRepetition/echo time (ms)Range (1000–1400)/ 64200/2.3 or 1.15Range 1200–1700/77Shortest (5697)/823/1.53/1.5Echo train-28---Flip angle (degrees)906090901015Section thickness/gap (mm)5.0/1.05.0/1.05.0/1.05.0/1.04.0/04.0/0Number of slices34343436100100Field of view (mm^2^)360 × 304360 × 280360 × 284360 × 298360 × 300360 × 300Acquisition voxel (mm^3^)2.81 × 2.83 × 5.01.61 × 1.50 × 5.001.34 × 1.34 × 5.01.30 × 1.79 × 5.01.2 × 1.5 × 3.41.3 × 1.6 × 3.4Recon matrix256448512512512512Recon voxel (mm^3^)1.41 × 1.41 × 5.00.8 × 0.8 × 5.00.7 × 0.7 × 5.00.7 × 0.7 × 5.00.7 × 0.7 × 2.00.7 × 0.7 × 1.7BW in EPI freq dir (Hz)2742.71535.9545.5503.2868.5722.7Parallel imaging (PI)CS-SENSE/AI CS-SENSEAI CS-SENSEAI CS-SENSEAI CS-SENSEAI CS-SENSECS-SENSEPI factor2.52.42.54.54.53.7Half scan0.8920.85No0.9NoNoMatrix128 × 107224 × 193268 × 211276 × 167320 × 192320 × 224NSAb0 (2), b 800 (3)11.4111Acquisition time (sec)14416.316413.513.1 × 416*Dynamic contrast study consists of pre-contrast, arterial, portal venous, and transitional phases. In dynamic contrast study, a dose of 0.1 mL/kg of gadoxetic acid was administered at a rate of 1 mL/s followed by a 20-mL saline flush using a power injector and was used a bolus tracking techniqueEPI: echo-planar imaging, FFE: fast field echo, SPIR; spectral presaturation with inversion recovery, e-THRIVE; T1-high resolution isotropic volume excitation, STIR: short tau inversion recovery. SPIR = spectral presaturation with inversion recovery, SPAIR = spectral attenuated inversion recovery, NEX: number of excitations, CS-SENSE = compressed SENSE, AI CS-SENSE = compressed SENSE with deep learning reconstruction, NSA: number of sample [signals] averaged
Model-based DL reconstruction
We implemented a model-based DL reconstruction technique using Adaptive-CS-Net, deployed through a vendor prototype (Next Generation Scan Acceleration patch). The comprehensive training and optimization methods for Adaptive-CS-Net have been previously described [23]. This model-based DL reconstruction technique achieves effective denoising by performing iterative image processing on undersampled k-space data using the Adaptive-CS-Net architecture. In this model’s pipeline, Adaptive-CS-Net serves as a functional replacement for the wavelet transform component in conventional CS-based image reconstruction algorithms. The network architecture comprises a U-Net structure integrated with a soft thresholding function, enabling effective image denoising through sparsification.
A distinctive feature of this reconstruction model is its integration of domain-specific prior knowledge for verifying data consistency. At each iteration, the system performs consistency checks by comparing images before and after processing through Adaptive-CS-Net to ensure optimal denoising performance. This iterative framework facilitates the removal of noise while preserving and enhancing the original signal, ultimately achieving substantial improvements in image quality. All image processing was performed within the MR system’s console.
In this study, we used an enhanced version of Adaptive-CS-Net beyond that originally described in Pezzotti et al. [23], and incorporated additional improvements detailed in Foreman et al. [24]. The enhanced network underwent extensive pretraining on a large, diverse dataset comprising 1.5T and 3T images across multiple anatomical regions and contrast mechanisms. The network was optimized for execution on standard reconstruction hardware to ensure a robust performance and broad clinical applicability. Previous clinical evaluations have demonstrated the effectiveness of this approach for head and neck and prostate DWI applications [25, 26].
We intentionally excluded PI-based acceleration, commonly employed in similar studies to isolate the image quality improvement effects of model-based DL reconstruction. This methodological approach enabled single DWI acquisitions rather than comparative dual scans, thereby eliminating potential confounding factors such as variations in acquisition conditions and incidental patient movement between consecutive scans. Consequently, the reconstruction was performed on the console using DWI using single-shot EPI with CS (CS-DWI), yielding the following four image types: CS-DWI, CS-DWI with model-based DL reconstruction-enhanced CS (DL-DWI), and ADC maps based on CS-DWI and DL-DWI.
Identification of hepatic lesions and imaging diagnosis
Two board-certified radiologists (with 25 and 29 years of post-training experience in interpreting abdominal MRI images) detected and diagnosed all liver metastases and hepatic hemangiomas in consensus with reference to all MRI sequences obtained simultaneously and previous and following contrast-enhanced CT and MR images spanning at least 6 months or more.
Hemangiomas were diagnosed based on characteristic imaging findings, including nodular or peripheral discontinuous enhancement during the arterial phase and progressive centripetal fill-in enhancement during the portal venous or delayed phases on contrast-enhanced CT or MRI. On T2-weighted and fat-suppressed T2-weighted images, the lesions demonstrated hyperintensity relative to the liver parenchyma, but with signal intensity lower than that of water [27]. The diagnosis was further supported by the absence of malignancy in the patient’s medical history and documented lesion stability (no size change) over a minimum follow-up period of 6 months.
In contrast, liver metastases were diagnosed based on clinical history of pathologically confirmed malignancy combined with characteristic imaging findings. The diagnostic imaging criteria included peripheral and peritumoral enhancement during the arterial or portal venous phases on contrast-enhanced CT or MRI, and/or hypointensity on hepatobiliary phase images of gadoxetic acid-enhanced MRI [28]. Lesion appearance or size changes during the follow-up period provided additional diagnostic support. No hypervascular metastases were observed.
Two radiologists reported the diagnosis, size, and location of the lesions. All nodules detected and diagnosed by these two radiologists underwent qualitative evaluation by two additional radiologists as described below. Quantitative evaluation was performed for nodules measuring ≥ 5 mm. All image analyses were conducted with the final diagnosis blinded to the evaluating radiologists.
Quantitative image analysis
Two board-certified radiologists (readers 1 and 2, with 5 and 16 years of imaging experience, respectively) performed quantitative image analysis independently. Readers were blinded to sequence types, and could adjust the window settings. They were also blinded to the patient’s clinical information and the image acquisition method, and all images were anonymized. They reviewed two types of images presented independently in a random order. Although the qualitative diagnosis of the nodules was unknown (hemangioma or liver metastasis), the nodules to be measured were pre-specified. Hepatic cysts were not included in this study as they can be readily differentiated when assessed alongside other routine sequences, particularly T2-weighted imaging. All regions of interest (ROI) were drawn using the SAI viewer (SYNAPSE®, FUJIFILM Medical, Tokyo, Japan). Circular ROIs were placed in artifact-free, homogeneous areas (excluding vessels) on the right robe of the liver and on focal liver lesions to measure the signal intensity (SI), standard deviation (SD), and mean ADC values on DWI (b value = 800 s/mm^2^). Once an ROI was placed on one image, it was copied and pasted onto the other DWI image. Measurements were performed in triplicates to ensure data consistency and reproducibility, and the mean of the three ROIs were used to represent each region.
Given the geometric variation in noise across the field of view of the image resulting from the use of PI, the SD of SIs in the background air could not be used as a measurement of noise. Therefore, we calculated the estimated SNR of each object using the method proposed by Heverhagen, as follows [29]:
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$SNR = \frac{{SI_{liber} }}{{SD_{noise} }}$$\end{document}where SI_liver_ and SD_noise_ represent the mean SI and SD of the liver parenchyma and erector spinae, respectively.
The CNR for the focal liver lesions was calculated as follows [30]:
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\mathrm{CNR}} = \left| {{\mathrm{SI}}_{{{\mathrm{liver}}}} - {\mathrm{SI}}_{{{\mathrm{lesion}}}} } \right|/\surd \left[ {\left( {{\mathrm{SD}}_{{{\mathrm{liver}}}}^{2} + {\mathrm{SD}}_{{{\mathrm{liver}}}}^{2} } \right)/2} \right]$$\end{document}where SI_liver_ and SD_liver_ represent the SI and SD of the liver parenchyma, and SI_lesion_ and SD_lesion_ represent the SI and SD of the focal liver lesion.
Qualitative image analysis
The same board-certified radiologists independently evaluated the CS-DWI and DL-DWI at a b value of 800 s/mm^2^. The evaluations included [31]: contour and distortion of the liver, artifacts (including motion, ringing, partial volume, and susceptibility artifacts), and overall image quality (as reflected in the general image impression). Each evaluation was scored using a 5-point Likert scale, and the conspicuity of the hepatic lesions was also assessed. The criteria are listed in Table 2.Table 2. Scoring criteria for qualitative image evaluationScoreContour of the liverDistortion of the liverImage noiseArtifactsOverall image qualityLesion conspicuity5Smooth regular contourNormal liver morphologyNo noiseNo artifactsExcellentComplete lesion visualization, optimal for detection4Mild surface nodularityMinimal surface irregularityMinimal noiseMinimal artifactsGoodGood lesion visualization, adequate for detection3Moderate undulating borderModerate contour deformationMild noise causing little impact on diagnosisMild artifacts causing little impact on diagnosisAcceptableAcceptable lesion visualization, detection possible2Irregular coarse marginMarked architectural distortionModerate noise causing substantial impactModerate artifacts causing substantial impact on diagnosisModerately poorLimited lesion visualization, detection difficult1Severely distorted outlineSevere structural derangementNondiagnostic due to severe noiseSevere artifactsDefinitely poorPoor lesion visualization, detection not feasible
Statistical analysis
Continuous variables are expressed as mean ± SD. Data normality was confirmed using the Shapiro–Wilk test. Subsequently, SNR of the liver parenchyma, CNR of the lesions, and mean ADC values of the liver parenchyma and lesions were compared between the CS-DWI and DL-DWI using the Wilcoxon signed-rank test. Intra-observer agreement of each reader and interobserver agreement between the two readers for SNR and CNR measurements were evaluated using intraclass correlation coefficients (ICCs). A 2-way mixed model was employed to calculate ICCs with 95% confidence intervals. ICC (3,1) was used to assess intra-observer agreement within each reader’s three repeated measurements, and ICC (2,1) was used to evaluate inter-observer agreement between the two readers. Agreement based on the ICCs was classified using the following definitions: < 0.50, poor; 0.50–0.74, moderate; 0.75–0.89, good; and 0.90–1.00, excellent [32]. Qualitative image scores were compared between the CS-DWI and DL-DWI using the Wilcoxon signed-rank test. Cohen’s weighted Kappa analysis assessed interreader agreement (0.01–0.20, poor; 0.21–0.40, fair; 0.41–0.60, moderate; 0.61–0.80, substantial; and 0.81–1.00, almost perfect agreements) [33].
The optimal ADC cutoff value for differentiating liver metastases from hemangiomas was determined using receiver operating characteristic (ROC) analysis, which yielded the maximal sensitivity and specificity. The diagnostic performance of CS-DWI and DL-DWI was evaluated using the optimal ADC cutoff values. In ROC analyses, the mean ADC values reported by the two radiologists were considered representative of the ADC values. The area under the ROC curve (AUC) was compared between CS-DWI and DL-DWI using DeLong’s test. The sensitivity and specificity for differentiating between the two entities in CS-DWI and DL-DWI were examined using McNemar’s test. For all statistical analyses, two-sided p-values of < 0.05 denoted statistical significance. All statistical analyses were performed using SPSS software (version 27.0; IBM Corp., Armonk, NY, USA).
Results
Clinical characteristics of patients
This study included 53 consecutive patients (22 with liver metastases and 31 with hepatic hemangiomas): 23 males and 38 females with a mean age of 65.9 ± 10.1 years. None of the patients had both lesions. Those with metastases were significantly older than those with hemangiomas (p < 0.05). Metastases were significantly larger than hemangiomas (p < 0.05). Additional details on patients and lesions characteristics are presented in Table 3. Representative cases are shown in Figs. 2 and 3.Table 3. Characteristics of the subjectsCharacteristicsTotalLiver metastasisHepatic hemangiomaP valueCase532231–Age, years65.9 ± 10.170.6 ± 7.463.1 ± 10.70.003Sex; male/female34/1912/1022/90.284Body weight61.0 ± 13.058.3 ± 12.862.9 ± 13.00.889Hight162.2 ± 8.3160.1 ± 9.9163.5 ± 6.90.153Body surface area1.64 ± 0.191.60 ± 0.041.68 ± 0.190.802NodulesNumber of lesions1377859–Number of lesions (1/≧ 2)44/915/729/2–≧ 5 mm/ < 5 mm92/4559/1922/26–Size (mm)12.1 ± 8.514.2 ± 8.78.9 ± 8.5 < 0.001Location (lateral, medial, anterior, and posterior segments, and caudate lobe)17/13/26/31/510/9/20/18/27/4/6/13/30.307Lesions within 1 cm of the liver surface5128230.199Primary tumor–Colorectal (10), pancreatic (5), gastric cancer (3), and others (4)––Continuous variables are expressed as mean ± standard deviation. Categorical variables are expressed as the number of casesFig. 2Representative cases of liver metastasis in a 78-year-old man with sigmoid colon cancer. a DWI (b-value = 800 s/mm^2^) image with compressed sensing (CS-DWI). b DWI image with CS and model-based image reconstruction (DL-DWI). c ADC map (derived from b0 and b800) of CS-DWI. (d) ADC map (derived from b0 and b800) of DL-DWI. A 16-mm liver metastasis in segment 4 demonstrated high signal intensity on DWI; however, it appeared distorted with decreased signal attenuation due to artifacts on CS-DWI (a). DL-DWI (b) provides superior visualization of the entire nodule. The ADC map of CS-DWI (c) shows more prominent distortion and artifacts compared to that of DL-DWI (d). DWI = diffusion-weighted imaging, CS-DWI = DWI using single-shot echo planar imaging with compressed sensing, DL-DWI = DWI using echo planar imaging with model-based deep learning (DL) reconstruction-enhanced CS, ADC = apparent diffusion coefficientFig. 3Representative cases of hepatic hemangioma in a 47-year-old woman. a DWI (b-value = 800 s/mm^2^) image with compressed sensing (CS-DWI). b DWI image with CS and model-based image reconstruction (DL-DWI). c ADC map (derived from b0 and b800) of CS-DWI. d ADC map (derived from b0 and b800) of DL-DWI. A 32-mm hepatic metastasis in segment 6 demonstrated high signal intensity on DWI; however, it appeared distorted with decreased signal attenuation due to artifacts on CS-DWI (a). DL-DWI (b) provides superior visualization of the entire nodule. DWI = diffusion-weighted imaging, CS-DWI = DWI using single-shot echo planar imaging with compressed sensing, DL-DWI = DWI using echo planar imaging with model-based deep learning (DL) reconstruction-enhanced CS, ADC = apparent diffusion coefficient
Quantitative image evaluation
The SNR of liver parenchyma in DL-DWI was significantly higher, while the ADC values were significantly lower than those in CS-DWI (p < 0.05). The CNR and ADC values of liver metastases and hepatic hemangiomas in DL-DWI were significantly higher than those in CS-DWI (p < 0.05). Good to excellent intra-observer agreement was found for both readers’ measurements (ICC [1, 3] = 0.907–0.950 and 0.920–0.981, respectively). Good agreement was found between the two readers’ measurements (ICC [1, 2] = 0.751–0.852) (Table 4). No significant difference was observed between CS-DWI and DL-DWI in AUC (0.959 and 0.956 in CS-DWI and DL-DWI, respectively; p = 0.853), sensitivity (81.7% and 81.8% in CS-DWI and DL-DWI, respectively; p > 0.999), and specificity (97.0% and 96.7% in CS-DWI and DL-DWI, respectively; p > 0.999; Fig. 4). To differentiate liver metastases from hepatic hemangiomas, the optimal cutoff ADC values were 1.411 × 10^–6^ mm^2^/s and 1.693 × 10^–6^ mm^2^/s for CS-DWI and DL-DWI respectively.Table 4. Quantitative image analyses resultsCS-DWIDL-DWIP valueICC (2, 1) [95% CI]Liver parenchyma (n = 53)**SNR0.842 [0.792–0.889]Reader 19.71 ± 4.6711.00 ± 5.63 < 0.001ICC (3, 1) [95% CI]0.936 [0.922–0.955]0.931 [0.919–0.950]–Reader 213.88 ± 6.5317.12 ± 9.80 < 0.001ICC (3, 1) [95% CI]0.969 [0.959–0.978]0.967 [0.958–0.974]–ADC value (× 10^–6^* mm*^2^/s)0.849 [0.791–0.888]Reader 11.370 ± 0.2451.322 ± 0.317 < 0.001ICC (3, 1) [95% CI]0.907 [0.901–0.916]0.911 [0.903–0.917]–Reader 21.258 ± 0.2091.169 ± 0.188 < 0.001ICC (3, 1) [95% CI]0.927 [0.915–0.936]0.920 [0.914–0.928]–Liver metastases (n = 59)**CNR0.811 [0.765–0.849]Reader 125.6 ± 16.628.4 ± 18.4 < 0.001ICC (3, 1) [95% CI]0.950 [0.938–0.961]0.946 [0.932–0.955]–Reader 242.4 ± 26.748.9 ± 30.1 < 0.001ICC (3, 1) [95% CI]0.971 [0.958–0.983]0.977 [0.960–0.988]–ADC value (× 10^–6^* mm*^2^/s)0.751 [0.701–0.823]Reader 11.042 ± 0.4071.010 ± 0.421 < 0.001ICC (3, 1) [95% CI]0.911 [0.903–0.920]0.921 [0.913–0.933]Reader 21.105 ± 0.4911.079 ± 0.510 < 0.001ICC (3, 1) [95% CI]0.961 [0.943–0.968]0.963 [0.965–0.989]–Hepatic hemangioma (n = 33)**CNR0.852 [0.735–0.915]Reader 122.58 ± 18.9327.09 ± 22.37 < 0.001ICC (3, 1) [95% CI]0.925 [0.920–0.933]0.929 [0.921–0.938]Reader 223.03 ± 16.6928.20 ± 21.49 < 0.001ICC (3, 1) [95% CI]0.975 [0.961–0.985]0.981 [0.965–0.989]–ADC value (× 10^–6^* mm*^2^*/s)*0.793 [0.660–0.878]Reader 11.977 ± 0.4662.000 ± 0.479 < 0.001ICC (3, 1) [95% CI]0.920 [0.902–0.941]0.932 [0.914–0.974]Reader 21.792 ± 0.5541.776 ± 0.545 < 0.001ICC (3, 1) [95% CI]0.951 [0.938–0.963]0.950 [0.940–0.961]–Data are expressed as mean score ± standard deviationCS-DWI, diffusion-weighted imaging (DWI) using single-shot echo planar imaging with compressed sensing (CS); DL-DWI, DWI using echo planar imaging with model-based deep learning (DL) reconstruction-enhanced CS; ICC, intraclass correlation coefficient; CI, confidence interval; SNR, signal-to-noise ratio; ADC, apparent diffusion coefficient; CNR, contrast-to-noise ratioFig. 4Receiver operating characteristic curves for differentiating liver metastases from hepatic hemangiomas by ADC values. No significant differences were observed in AUC, sensitivity, or specificity between CS-DWI and DL-DWI in differentiating liver metastases from hemangiomas based on ADC values. DWI = diffusion-weighted imaging, CS-DWI = DWI using single-shot echo planar imaging with compressed sensing, DL-DWI = DWI using echo planar imaging with model-based deep learning (DL) reconstruction-enhanced CS, ADC = apparent diffusion coefficient, AUC = area under the receiver operating characteristic curve
Qualitative image analysis
DL-DWI showed significantly high image qualities regarding liver contour and distortion, image noise, artifact, and overall image quality compared with CS-DWI (p < 0.05). Concerning the lesion conspicuity, no significant differences were observed in hepatic hemangioma conspicuity between DL-DWI and CS-DWI in Reader 1 (p = 0.028). In others, the conspicuity of lesions on DL-DWI images was significantly higher than that on CS-DWI images (p < 0.05). Particularly, lesions within 1 cm from the liver surface showed significantly higher conspicuity on DL-DWI compared to CS-DWI for Readers 1 and 2 (p < 0.05). The results showed fair to moderate agreement (Cohen’s κ; 0.58–0.69) (Table 5 and Fig. 5).Table 5. Qualitative image analyses resultsCS-DWIDL-DWIP valueκ-valueDistortion of the liver0.67Reader 13 (3–4)4 (3–4) < 0.001Reader 23 (2–4)4 (4–4) < 0.001Liver contour0.59Reader 14 (3–5)4 (4–5)0.005Reader 23 (3–4)4 (4–4) < 0.001Artifact0.68Reader 13 (3–4)4 (3–4)0.001Reader 23 (3–3)4 (3–4) < 0.001Image noise0.61Reader 13 (2–4)4 (3–5)0.001Reader 23 (3–4)4 (4–4) < 0.001Overall image quality0.69Reader 13 (3–4)4 (3–5) < 0.001Reader 23 (3–4)4 (4–4) < 0.001Lesion conspicuity of all lesions (n = 137)0.63Reader 14 (3–5)4 (4–5)0.028Reader 25 (3–5)5 (4–5) < 0.001Liver metastasis (n = 78)0.62Reader 15 (3–5)5 (4–5)0.028Reader 25 (3.25–5)5 (4–5)0.002Hepatic hemangioma (n = 59)0.63Reader 14 (3–5)4 (3–5)0.637Reader 24 (3–5)4 (3–5)0.382Lesions within 1 cm from the liver surface (n = 51)0.62Reader 14 (3–4)5 (4–5)0.028Reader 24 (3–4)5 (4–5)0.002*Liver metastasis (n = 28)0.62Reader 14 (3–4)5 (4–5)0.019Reader 24 (3–4)5 (4–5)0.002Hepatic hemangioma (n = 23)0.62Reader 14 (3–5)5 (4–5)0.003Reader 24 (3–4)5 (4–5)0.035Data are expressed as medians with interquartile range in parenthesesCS-DWI, diffusion-weighted imaging (DWI) using single-shot echo planar imaging with compressed sensing (CS); DL-DWI, DWI using echo planar imaging with model-based deep learning (DL) reconstruction-enhanced CSFig. 5Stacked bar charts for qualitative image assessment stratified. a. Distortion of the liver, b. Liver contour, c. Artifact, d. Image noise, e. Overall image quality, f. All lesion conspicuity, g. Conspicuity of liver metastasis, h. Conspicuity of hepatic hemangioma, i. Lesion conspicuity within 1 cm from the liver surface, j. Liver metastasis conspicuity within 1 cm from the liver surface, k. Hepatic hemangioma conspicuity within 1 cm from the liver surface. An asterisk () indicates a significant difference (p < 0.05)
Discussion
This study demonstrated that DL-DWI significantly improved image quality and enhanced the conspicuity of hepatic lesions, including hemangiomas and liver metastases, compared to conventional CS-DWI, particularly for nodules located within 1 cm of the liver surface. These findings represent a significant advancement in abdominal DWI technology with important clinical implications. Our method of comparing techniques within single acquisitions rather than using separate scans strengthens the validity of our findings by eliminating potential confounding factors, such as patient movement, physiological variations, and scanner stability issues.
The superior performance of model-based DL reconstruction addresses the inherent limitations of conventional CS reconstruction. Single-shot EPI is the most frequently used technique in clinical DWI; however, combining it with PI methods to reduce geometric image distortions often decreases SNR in regions with high g-factors, particularly in the central area of the image. This reduction in SNR poses challenges in the evaluation of upper abdominal lesions. While the compressed SENSE framework applied to EPI (CS-DWI) reduces noise-like artifacts and improves image quality without further optimization of the EPI sampling scheme [17, 18], we speculate that noise and artifact reduction by L1 regularization might be limited, as it is an algorithm-based method with a set threshold value in the wavelet transformation process. The Adaptive-CS-Net architecture effectively addresses these limitations by leveraging learned features from large training datasets to achieve more advanced denoising than conventional wavelet-based approaches. The model-based DL reconstruction approach efficiently utilizes, processes, and integrates large amounts of signal data during iterative image processing, enabling effective noise reduction [23, 24] while preserving clinically relevant signal characteristics.
Our findings revealed that DL-DWI demonstrated significant improvements in lesion conspicuity as well as in liver contours, reduction of liver distortion, artifact, image noise, and overall image quality compared with CS-DWI. The scores for pancreatic distortion did not differ significantly, and this may be attributed to the fact that CS-DWI images had already achieved improved image quality through the compressed SENSE framework. The enhanced image quality resulted in improved lesion conspicuity for hemangiomas and liver metastases, particularly for lesions located within 1 cm from the liver surface. This selective improvement indicates enhanced image quality in regions susceptible to geometric distortion caused by B0 inhomogeneity, particularly areas in adjacent to air-containing structures, such as the lungs and regions prone to gastrointestinal artifacts. The lack of overall significant difference is likely due to the markedly high signal intensity of hemangiomas on b800 DWI due to T2-shine through effect, making them inherently conspicuous and rarely compromised to the extent that detection becomes challenging. The improvement in lesion visibility has direct clinical implications, potentially enhancing diagnostic confidence, reducing interpretation time, and facilitating earlier detection of pathological changes. The ability to better visualize hepatic lesions is particularly relevant in oncological applications where accurate lesion detection and characterization are critical for treatment planning and monitoring.
These quantitative improvements were further supported by the significantly higher SNR of the liver parenchyma and CNR of the focal liver lesions on DL-DWI than on CS-DWI. A crucial quantitative observation was that the ADC values of the liver and focal liver lesions on DL-DWI were significantly lower than those on CS-DWI images. Notably, some studies have reported no significant difference in ADC values between DL-DWI and conventional DWI in liver imaging [34], while others have reported lower ADC values in DL-DWI compared to conventional DWI in liver imaging [35]. However, these comparisons involved PI-DWI, and no studies have compared CS and AICS similar to our study. The underlying mechanisms remain unclear, although we speculate that the significant difference in background noise levels between DL-DWI and CS-DWI may have contributed to the observed differences in ADC value. Nevertheless, theoretically, ADC bias and random measurement error in DWI can be effectively reduced by model-based deep DL reconstruction [36].
The influence of model-based DL reconstruction on ADC values suggests that the cutoff ADC values used to differentiate between the two lesion types could be altered. Previous studies reported ADC cutoff values for differentiating liver metastases and hepatic hemangiomas ranging from 1.33 × 10⁻^3^ to 1.70 × 10⁻^3^ mm^2^/s [18, 37–39]. In our study, the ADC cutoff values of 1.693 × 10⁻^3^ mm^2^/s and 1.411 × 10⁻^3^ mm^2^/s measured on DL-DWI fell within this range. In addition, in this study, despite slight variations in the ADC cutoff values for distinguishing liver metastases from hepatic hemangiomas, no significant difference was observed in the discriminative ability between DL-DWI and CS-DWI. However, despite being within the reported range, the observed variations in cutoff values highlight that ADC measurements can be influenced by various factors, including equipment specifications, CS techniques, and different DL reconstruction methods. This variability underscores the critical need for continuous data updates as imaging equipment and reconstruction technologies evolve. In the ROC analysis using the cutoff values obtained in this study, the discriminative ability for differentiating liver metastases from hepatic hemangiomas was 0.956 DL-DWIand 0.959 for DL-DWI and CS-DWI, respectively, showing no significant difference between them. However, both values were higher than the AUC of 0.89 reported by Kaga et al. using CS-DWI analysis. We determined that ADC measurements could be obtained from images of nodules measuring ≥ 5 mm, which included smaller nodules compared to Kaga et al.'s study, which targeted nodules of ≥ 10 mm. We attribute these findings to the high specifications of the imaging equipment used in our study. Furthermore, establishing universally applicable diagnostic criteria that maintain reliability across different institutions would require extensive multi-center data collection and standardization efforts to account for these technical variations.
The present study had some limitations. First, the sample size was relatively small, and the study was conducted at a single center, which may have introduced selection bias. Therefore, our results should be regarded as preliminary. Second, MRI parameters such as acceleration factor, b-values, and acquisition time have not yet been optimized, requiring further trials for developing a standard methodology. Third, we did not directly compare our model-based DL reconstruction with other DL-based reconstruction techniques. Fourth, while we observed improved conspicuity of hemangiomas and liver metastases, quantitative assessment revealed no significant difference in the discriminative ability between hepatic metastases and hepatic hemangiomas. Further qualitative and quantitative investigations are needed for other types of focal liver lesions and lesions in other upper abdominal organs, such as gallbladder, pancreas, and spleen, to evaluate the feasibility of AICS, which may pose the risk of suppressing small anatomical details.
Future research should focus on a comprehensive evaluation of diagnostic performance for specific pathological conditions with histopathological correlation. Investigation across different b-values, optimization of acquisition parameters, and larger multi-center studies are needed to validate these preliminary findings and establish standardized protocols for clinical implementation.
In conclusion, this study provides compelling evidence that model-based DL reconstruction with Adaptive-CS-Net significantly enhances the clinical utility of abdominal DWI by improving image quality and lesion conspicuity, particularly for hemangiomas and liver metastases. These findings suggest that integrating advanced reconstruction techniques represent a promising direction for optimizing diffusion-weighted imaging in clinical practice, though further validation studies are required to establish its full clinical potential.
