Reverse Imaging for Wide-spectrum Generalization of Cardiac MRI Segmentation
Yidong Zhao, Peter Kellman, Hui Xue, Tongyun Yang, Yi Zhang, Yuchi Han, Orlando Simonetti, Qian Tao

TL;DR
This paper introduces Reverse Imaging, a physics-driven data augmentation method that infers underlying spin properties from cardiac MRI images, enabling improved segmentation across diverse imaging sequences and protocols.
Contribution
The paper presents a novel inverse problem approach using a learned spin prior to synthesize images and enhance generalization in cardiac MRI segmentation.
Findings
Achieves accurate segmentation across different MRI contrast protocols.
Enables meaningful spin-property estimation from images.
Supports wide-spectrum generalization of segmentation models.
Abstract
Pretrained segmentation models for cardiac magnetic resonance imaging (MRI) struggle to generalize across different imaging sequences due to significant variations in image contrast. These variations arise from changes in imaging protocols, yet the same fundamental spin properties, including proton density, T1, and T2 values, govern all acquired images. With this core principle, we introduce Reverse Imaging, a novel physics-driven method for cardiac MRI data augmentation and domain adaptation to fundamentally solve the generalization problem. Our method reversely infers the underlying spin properties from observed cardiac MRI images, by solving ill-posed nonlinear inverse problems regularized by the prior distribution of spin properties. We acquire this "spin prior" by learning a generative diffusion model from the multiparametric SAturation-recovery single-SHot acquisition sequence…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced MRI Techniques and Applications · Medical Image Segmentation Techniques · Cardiac Imaging and Diagnostics
11institutetext: Department of Imaging Physics, Delft University of Technology, Delft, The Netherlands 22institutetext: National Heart Lung and Blood Institute, National Institutes of Health, Bethesda, Maryland, USA 33institutetext: Microsoft Research 44institutetext: Cardiovascular Division, The Ohio State University Wexner Medical Center, Columbus, Ohio, USA
44email: [email protected]
Reverse Imaging for Wide-spectrum Generalization of Cardiac MRI Segmentation
Yidong Zhao 11
Peter Kellman 22
Hui Xue 33
Tongyun Yang 11
Yi Zhang 11
Yuchi Han 44
Orlando Simonetti 44
Qian Tao 11
Abstract
Pretrained segmentation models for cardiac magnetic resonance imaging (MRI) struggle to generalize across different imaging sequences due to significant variations in image contrast. These variations arise from changes in imaging protocols, yet the same fundamental spin properties, including proton density, , and values, govern all acquired images. With this core principle, we introduce Reverse Imaging, a novel physics-driven method for cardiac MRI data augmentation and domain adaptation to fundamentally solve the generalization problem. Our method reversely infers the underlying spin properties from observed cardiac MRI images, by solving ill-posed nonlinear inverse problems regularized by the prior distribution of spin properties. We acquire this “spin prior” by learning a generative diffusion model from the multiparametric SAturation-recovery single-SHot acquisition sequence (mSASHA) dataset, which offers joint cardiac and maps. Our method enables approximate but meaningful spin-property estimates from MR images, which provide an interpretable “latent variable” that lead to highly flexible image synthesis of arbitrary novel sequences. We show that Reverse Imaging enables highly accurate segmentation across vastly different image contrasts and imaging protocols, realizing wide-spectrum generalization of cardiac MRI segmentation.
Keywords:
CMR Segmentation Reverse Imaging MR Physics.
1 Introduction
Segmentation is crucial for evaluating cardiac biomarkers from cardiac magnetic resonance imaging (MRI), for example, the ejection fraction from balanced steady-state free-precession (bSSFP) cine imaging [3, 4]. Learning-based methods have become the standard practice for cardiac MRI segmentation [15, 3, 4, 17], mostly trained on widely accessible bSSFP cine images with manual annotations [3, 4, 17]. However, models trained on bSSFP cine are prone to fail in test images with heterogeneous contrasts [4, 17]. This can be partially mitigated by data augmentation (e.g. blur, gamma correction, etc.) [15], which effectively generalizes pretrained models to bSSFP cine images cross vendors and centers [27, 4, 6].
However, a much stronger contrast change comes from different MRI sequences, which cannot be tackled by common image augmentation techniques. For example, an alternative to bSSFP is the gradient echo sequence (GRE), which is less affected by field inhomogeneities [23] and more suitable for patients with implanted devices [10]. However, GRE has much poorer contrasts and easily fails the models trained on bSSFP. An even stronger contrast change occurs in quantitative imaging, such as mapping by the modified Look-Locker Inversion Recovery Sequence (MOLLI) [18]. In MOLLI, the image readouts have substantial contrast variations during the spin relaxation process, alternating between bright and dark blood, with low myocardium-blood contrast. Consequently, segmentation models trained on bSSFP images (source domain) can fail completely on MOLLI images (target domain) even with extensive data augmentation.
To tackle the cross-sequence generalization challenge, previous works have focused on disentangling “content” and “style” [20, 19]. BayeSeg learns to remove the style and extracts the boundary for segmentation [11, 12]. Domain adaptation techniques strive to learn content and style embeddings [5, 28]. The learned “content” is supposed to be shared between sequences, while cross-sequence translation is achieved by interchanging the learned “style” [5, 28, 9]. However, such strategies require data from new sequences in order to learn the disentangling, and, importantly, the extracted content or style is barely interpretable. In this study, we argue that the fundamental but overlooked “content” is the underlying spin properties, including the voxel-wise proton density (PD), , and [13] of blood and tissue. Given this content, the style is fully governed by the MRI signal models of new pulse sequences. This motivates us to infer the spin properties from the observed image, for more interpretable and physics-grounded image translation to tackle the cross-sequence domain generalization challenge.
Unlike quantitative MRI that directly maps spin properties [18, 16, 7], retrieving the underlying spin properties from a qualitative image such as bSSFP is ill-posed because infinitely many solutions may explain the same image. This challenge parallels that of accelerated MRI, where partial measurements are insufficient for image reconstruction, yet the inverse problem can be solved by incorporating a regularizer that defines the prior distribution of plausible images [21]. Recent MRI reconstruction works use generative diffusion models [14] as a priori regularizers, allowing high-quality reconstruction from partial measurements [26, 8]. Following the same spirit, we introduce a diffusion-based generative prior on spin properties, called “spin prior”, to regularize the ill-posed inverse problem of spin property estimation. Specifically, we leverage the multiparametric SAturation-recovery single-SHot acquisition (mSASHA) datasets, which perform joint cardiac and mapping, to learn the spin prior. We term the process of inferring spin properties from observed images as Reverse Imaging. Reverse Imaging uniquely enables physics-grounded data augmentation and domain adaptation without the need for target domain data. We will show that Reverse Imaging leads to high-quality zero-shot generalization to a wide spectrum of MRI sequences. We make the following contributions:
- •
We propose an interpretable cross-sequence translation framework for cardiac MRI, which explicitly formulates the translation as applying MRI physics forward models to estimated tissue spin properties (i.e. , and ).
- •
We introduce Reverse Imaging, a novel approach that reversely infers spin properties from observed MR images using a physics-guided reverse diffusion process. The diffusion model serves as a generative prior that regularizes the estimation of the underlying spin properties.
- •
With Reverse Imaging, we achieve high-quality zero-shot generalization of cardiac MRI segmentation, from bSSFP to a wide spectrum of unseen contrasts and imaging protocols.
2 Method
2.1 Physics-based Cross-Sequence Translation
2.1.1 Physics-Prior Decomposition
Let and be images acquired using sequences and , on which we aim to perform cross-sequence translation. The direct translation from to is intractable. To solve this, we propose a decomposition of , by introducing an underlying physics as a “latent variable” :
[TABLE]
A probability graph model is shown in Fig. 1. Reverse Imaging can be formulated as posterior inference , expressed as:
[TABLE]
As a result of (1) and (2), the inference relies on two likelihoods and , which are directly accessible from the imaging physics forward model. The only missing piece for inference, therefore, is the prior distribution on the spin properties . Moreover, we can interchange and in (1) once the prior is known, allowing dual-direction translation of both and .
2.1.2 Likelihoods from MR Imaging Physics
The log-likelihood of observing image from spin properties under sequence is:
[TABLE]
where the signal equation (i.e. forward model) can be expressed by the Bloch equation of MR imaging [13]. Below, we list the signal equations for the sequences involved in this work, namely, bSSFP, MOLLI mapping, and GRE. We denote the flip angle (FA) as , the repetition and echo times as and . The bSSFP imaging equation [24] with short and is:
[TABLE]
For mapping, the phase-insensitive readout [25] at inversion time is:
[TABLE]
where the apparent is and the inversion factor . For GRE cine images from patients with implanted devices, let and , the GRE signal model [13] is described as
[TABLE]
2.2 Generative Prior on Spin Properties from mSASHA
To learn the spin prior , we leverage an advanced quantitative MRI sequence mSASHA, which enables joint and mapping [7]. mSASHA consists of a train of saturation recovery pulses (SR) with SR time , followed by a -prep pulse with duration and echo time . Its signal equation is:
[TABLE]
where the unknowns can be extracted from multiple acquisitions of with varying , and by least-square fitting [7]. encodes and sequence parameters, and the approximation can be used [1, 2].
To model the prior of spin properties , we propose to train a denoising diffusion probabilistic model (DDPM) [14] in the space. DDPM adds noise to the real ’s from mSASHA in the forward process for steps, where is the noise scheduler at and . The learned model can generate data by starting with a Gaussian distribution and reversing the diffusion process:
[TABLE]
where and . The diffusion model learns a denoising backbone parameterized by , , , as the generative prior for .
2.3 Reverse Imaging and Cross-Sequence Synthesis
Reverse Imaging searches for in the prior space that best explains the observation . Following conditional generation principles [8], we replace in (8) with :
[TABLE]
We approximate the time-step-dependent likelihood by: , where [8]. Starting from , the recursive process for reverse the imaging sequence is
[TABLE]
where is a step size parameter. The guided reverse diffusion process in (10) yields the estimated spin properties in the prior distribution that reconstructs . After Reverse Imaging, we can synthesize images of novel sequences using the imaging physics equations described in (4)-(6). Synthesizing other imaging sequences is also possible by sequence-specific Bloch equations. In addition, we can also use the image physics of to guide generation by replacing , with , in (10) and translating from target to source (T2S).
3 Experiments
3.0.1 Datasets
We used the ACDC dataset [3] training split with bSSFP cine images of 100 subjects as the source domain dataset. The mSASHA dataset for learning the spin prior consists of 63 subjects with 186 short-axis slices. For evaluation of cross-sequence generalizability, we included two datasets imaged with different sequences: 1) MOLLI mapping data: 49 subjects with 131 short-axis slices in total. Each slice contains 11 baseline images with different contrasts. 2) cine MRI of 25 subjects with implanted devices: 16 subjects with the bSSFP sequence (Dev-bSSPF) and 9 with GRE sequence (Dev-GRE). Both the MOLLI and the Device datasets have significantly different contrasts than those of the ACDC.
3.0.2 Experimental Settings
The DDPM is pretrained on ACDC images for steps and then fine-tuned with the real spin properties of from mSASHA for steps on the resolution of . We employ the hugging-face diffuser for the implementation of DDPM [22]. The estimated spin properties were resized to their original resolution after Reverse Imaging. We perform forward steps and reverse steps of diffusion in (10) for Reverse Imaging, with the step size . As the exact FA for ACDC scans is unknown, we approximate it with as typically used. We open source the estimated spin properties for the ACDC dataset and code111https://github.com/Ido-zh/cmr_reverse.git. For evaluating the generalization performance, we include the following methods:
- •
Baseline We train an nnUNet[15] with the ACDC data as the baseline;
- •
BayeSeg We compare our model against BayeSeg [11, 12] which extracts the boundary and contour information from images for generalization;
- •
RI-T2S We translate the MOLLI and GRE data into bSSFP cine by Reverse Imaging (RI) and perform segmentation on the translated (T2S) images with the baseline model;
- •
RI-Aug. We perform Reverse Imaging on the ACDC cine images and extend the nnUNet framework by synthesizing MOLLI and GRE images through the estimated spin properties for physics-based augmentation in (5) and (6).
4 Results and Discussion
4.1 Reverse Imaging
We first show an example of Reverse Imaging a bSSFP cine in Fig. 2-(a). The estimated spin properties accurately reconstruct the bSSFP image, indicating a high likelihood. The predicted PD has low contrast between tissues, while in and estimation, blood has a high and , myocardium has high but low , and fat has low but a high . These match the reference relative magnitude of the spin properties [29]. Based on estimated spin properties, images of various imaging protocols can be generated with large contrast variations, as shown in Fig. 2-(b). Reverse Imaging does not provide precise spin property estimation, but we note that they are not used for quantitative evaluation purposes w.r.t. or , instead, they serve to synthesize novel contrasts that are essential for wide-spectrum generalization. Reverse Imaging also enables the T2S translation from MOLLI to bSSFP cine (Fig. 2-(c)).
4.2 Segmentation Performance
We now demonstrate that Reverse Imaging can significantly improve the generalization of segmentation models. In Table 1, we list the segmentation accuracy measured by the Dice score of the left ventricle (LV), myocardium (MYO) and right ventricle (RV) in the challenging MOLLI and Device datasets. To further evaluate the robustness against contrast variation, we also report the Dice for each baseline image of MOLLI in Fig. 3, as MOLLI readouts show strong contrast variation during inversion recovery. Qualitative examples of segmentation results are shown in Fig. 4.
The baseline nnUNet generalizes poorly in the MOLLI images (Table 1), especially when the blood-myocardium contrast is inverted (see #4-#7 in Fig. 3 and LL-#2 in Fig. 4). BayeSeg can improve performance on dark blood images, yet it can still fail on low-contrast images (e.g. Fig. 4 LL-#3-#5). The device images have bright blood but lower contrast, and the baseline nnUNet works more robustly than BayeSeg with higher Dice scores (Table 1). Performing the T2S transform (RI-T2S) significantly improved the segmentation accuracy for MOLLI, but a performance drop is observed in the device dataset. This is because the short ( ms) in GRE minimizes weighting (cf. (6)). In this case, Reverse Imaging is limited by the little amount of information carried by the observed image. However, this can be solved by using Reverse Imaging for augmentation instead (RI-Aug.), which achieves the highest segmentation accuracy in both datasets, significantly better than all other methods (Table 1, ). With RI-Aug., the trained model consistently achieves highly accurate segmentation on images with various contrasts, including low-contrast, bright and dark blood, and GRE images (Fig. 4).
5 Conclusion
We introduced Reverse Imaging, a novel physics-grounded approach for cross-sequence generalization in cardiac MRI segmentation. Combining imaging physics and a generative diffusion prior, our method estimates underlying spin properties from observed images. Reverse Imaging provides an interpretable and physically grounded solution for domain adaptation, addressing the challenge of contrast variations across imaging protocols. Our experiments demonstrate that Reverse Imaging significantly boosts segmentation robustness, enabling zero-shot generalization to unseen sequences without requiring target-domain data. This approach offers a new paradigm for improving cardiac MRI segmentation generalizability, with potential applications to broader medical imaging tasks where domain shifts pose a significant challenge.
{credits}
5.0.1 Acknowledgements
We acknowledge the financial support from the TU Delft AI Initiative and the Dutch Research Council (NWO). This preprint has not undergone post-submission improvements or corrections.
5.0.2 \discintname
We the authors hereby declare no competing interests in this paper.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Akçakaya, M., Basha, T.A., Weingärtner, S., Roujol, S., Berg, S., Nezafat, R.: Improved quantitative myocardial T 2 mapping: Impact of the fitting model. Magnetic resonance in medicine 74 (1), 93–105 (2015)
- 2[2] Akçakaya, M., Weingärtner, S., Basha, T.A., Roujol, S., Bellm, S., Nezafat, R.: Joint myocardial T 1 and T 2 mapping using a combination of saturation recovery and t 2-preparation. Magnetic resonance in medicine 76 (3), 888–896 (2016)
- 3[3] Bernard, O., Lalande, A., Zotti, C., Cervenansky, F., Yang, X., Heng, P.A., Cetin, I., Lekadir, K., Camara, O., Ballester, M.A.G., et al.: Deep learning techniques for automatic MRI cardiac multi-structures segmentation and diagnosis: is the problem solved? IEEE transactions on medical imaging 37 (11), 2514–2525 (2018)
- 4[4] Campello, V.M., Gkontra, P., Izquierdo, C., Martin-Isla, C., Sojoudi, A., Full, P.M., Maier-Hein, K., Zhang, Y., He, Z., Ma, J., et al.: Multi-centre, multi-vendor and multi-disease cardiac segmentation: the M&Ms challenge. IEEE Transactions on Medical Imaging 40 (12), 3543–3554 (2021)
- 5[5] Chen, C., Ouyang, C., Tarroni, G., Schlemper, J., Qiu, H., Bai, W., Rueckert, D.: Unsupervised multi-modal style transfer for cardiac MR segmentation. In: Statistical Atlases and Computational Models of the Heart. Multi-Sequence CMR Segmentation, CRT-E Piggy and LV Full Quantification Challenges: 10th International Workshop, STACOM 2019, Held in Conjunction with MICCAI 2019, Shenzhen, China, October 13, 2019, Revised Selected Papers 10. pp. 209–219. Springer (2020)
- 6[6] Chen, C., Qin, C., Ouyang, C., Li, Z., Wang, S., Qiu, H., Chen, L., Tarroni, G., Bai, W., Rueckert, D.: Enhancing MR image segmentation with realistic adversarial data augmentation. Medical Image Analysis 82 , 102597 (2022)
- 7[7] Chow, K., Hayes, G., Flewitt, J.A., Feuchter, P., Lydell, C., Howarth, A., Pagano, J.J., Thompson, R.B., Kellman, P., White, J.A.: Improved accuracy and precision with three-parameter simultaneous myocardial T 1 and T 2 mapping using multiparametric SASHA. Magnetic Resonance in Medicine 87 (6), 2775–2791 (2022)
- 8[8] Chung, H., Kim, J., Mccann, M.T., Klasky, M.L., Ye, J.C.: Diffusion posterior sampling for general noisy inverse problems. ar Xiv preprint ar Xiv:2209.14687 (2022)
