Language-guided Medical Image Segmentation with Target-informed Multi-level Contrastive Alignments
Mingjian Li, Mingyuan Meng, Shuchang Ye, Michael Fulham, Lei Bi, Jinman Kim

TL;DR
This paper introduces TMCA, a novel framework that improves medical image segmentation by aligning image and text patterns at multiple levels, leveraging target information for more precise and detailed guidance.
Contribution
The study proposes a Target-informed Multi-level Contrastive Alignment framework that enhances visual-language integration in medical segmentation by addressing pattern gaps and providing fine-grained guidance.
Findings
TMCA outperforms existing methods on four benchmark datasets.
The framework effectively aligns image-text patterns at multiple levels.
Results show improved segmentation accuracy and detail recognition.
Abstract
Medical image segmentation is a fundamental task in numerous medical engineering applications. Recently, language-guided segmentation has shown promise in medical scenarios where textual clinical reports are readily available as semantic guidance. Clinical reports contain diagnostic information provided by clinicians, which can provide auxiliary textual semantics to guide segmentation. However, existing language-guided segmentation methods neglect the inherent pattern gaps between image and text modalities, resulting in sub-optimal visual-language integration. Contrastive learning is a well-recognized approach to align image-text patterns, but it has not been optimized for bridging the pattern gaps in medical language-guided segmentation that relies primarily on medical image details to characterize the underlying disease/targets. Current contrastive alignment techniques typically align…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRadiomics and Machine Learning in Medical Imaging · Brain Tumor Detection and Classification · Advanced Neural Network Applications
MethodsSoftmax · Attention Is All You Need · Focus
