MulModSeg: Enhancing Unpaired Multi-Modal Medical Image Segmentation with Modality-Conditioned Text Embedding and Alternating Training
Chengyin Li, Hui Zhu, Rafi Ibn Sultan, Hassan Bagher Ebadian, Prashant, Khanduri, Chetty Indrin, Kundan Thind, Dongxiao Zhu

TL;DR
MulModSeg introduces a simple, modality-aware segmentation framework using text embeddings and alternating training, significantly improving multi-modal medical image segmentation without complex modifications.
Contribution
It proposes a novel modality-conditioned text embedding and alternating training approach to enhance multi-modal segmentation with minimal structural changes.
Findings
Outperforms previous methods in multi-organ and cardiac segmentation
Effective across both CT and MR modalities
Works with CNN and Transformer backbones
Abstract
In the diverse field of medical imaging, automatic segmentation has numerous applications and must handle a wide variety of input domains, such as different types of Computed Tomography (CT) scans and Magnetic Resonance (MR) images. This heterogeneity challenges automatic segmentation algorithms to maintain consistent performance across different modalities due to the requirement for spatially aligned and paired images. Typically, segmentation models are trained using a single modality, which limits their ability to generalize to other types of input data without employing transfer learning techniques. Additionally, leveraging complementary information from different modalities to enhance segmentation precision often necessitates substantial modifications to popular encoder-decoder designs, such as introducing multiple branched encoding or decoding paths for each modality. In this work,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRadiomics and Machine Learning in Medical Imaging · Topic Modeling · Machine Learning in Healthcare
