Enhancing Label-efficient Medical Image Segmentation with Text-guided Diffusion Models
Chun-Mei Feng

TL;DR
TextDiff leverages inexpensive medical text annotations to enhance semantic representations in diffusion models, significantly improving medical image segmentation performance with minimal training data.
Contribution
The paper introduces TextDiff, a novel method that integrates diagnostic text with diffusion models for improved label-efficient medical image segmentation.
Findings
Outperforms state-of-the-art methods on QaTa-COVID19 and MoNuSeg datasets.
Requires only training of cross-attention and pixel classifier, reducing annotation costs.
Achieves superior segmentation accuracy with few training samples.
Abstract
Aside from offering state-of-the-art performance in medical image generation, denoising diffusion probabilistic models (DPM) can also serve as a representation learner to capture semantic information and potentially be used as an image representation for downstream tasks, e.g., segmentation. However, these latent semantic representations rely heavily on labor-intensive pixel-level annotations as supervision, limiting the usability of DPM in medical image segmentation. To address this limitation, we propose an enhanced diffusion segmentation model, called TextDiff, that improves semantic representation through inexpensive medical text annotations, thereby explicitly establishing semantic representation and language correspondence for diffusion models. Concretely, TextDiff extracts intermediate activations of the Markov step of the reverse diffusion process in a pretrained diffusion model…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRadiomics and Machine Learning in Medical Imaging · Image Retrieval and Classification Techniques · Handwritten Text Recognition Techniques
MethodsDiffusion
