CardioDiT: Latent Diffusion Transformers for 4D Cardiac MRI Synthesis
Marvin Seyfarth, Sarah Kaye M\"uller, Arman Ghanaat, Isabelle Ayx, Fabian Fastenrath, Philipp Wild, Alexander Hertel, Theano Papavassiliu, Salman Ul Hassan Dar, and Sandy Engelhardt

TL;DR
CardioDiT introduces a 4D latent diffusion transformer model that synthesizes temporally coherent 4D cardiac MRI images, improving consistency and realistic cardiac motion modeling over previous methods.
Contribution
It presents the first fully 4D diffusion transformer framework for cardiac MRI synthesis, jointly modeling space and time without architectural factorization.
Findings
Enhanced inter-slice consistency in generated images
Improved temporal coherence of cardiac motion
Realistic distribution of cardiac functions
Abstract
Latent diffusion models (LDMs) have recently achieved strong performance in 3D medical image synthesis. However, modalities like cine cardiac MRI (CMR), representing a temporally synchronized 3D volume across the cardiac cycle, add an additional dimension that most generative approaches do not model directly. Instead, they factorize space and time or enforce temporal consistency through auxiliary mechanisms such as anatomical masks. Such strategies introduce structural biases that may limit global context integration and lead to subtle spatiotemporal discontinuities or physiologically inconsistent cardiac dynamics. We investigate whether a unified 4D generative model can learn continuous cardiac dynamics without architectural factorization. We propose CardioDiT, a fully 4D latent diffusion framework for short-axis cine CMR synthesis based on diffusion transformers. A spatiotemporal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Functional Brain Connectivity Studies · Model Reduction and Neural Networks
