A Fast and Efficient Modern BERT based Text-Conditioned Diffusion Model for Medical Image Segmentation
Venkata Siddharth Dhara, Pawan Kumar

TL;DR
This paper introduces FastTextDiff, a diffusion-based medical image segmentation model that leverages ModernBERT for efficient, text-conditioned segmentation, significantly improving accuracy and training speed by integrating clinical notes and multi-modal attention.
Contribution
The paper presents a novel, scalable diffusion model using ModernBERT for text-conditioned medical image segmentation, replacing Clinical BioBERT and enhancing efficiency and accuracy.
Findings
ModernBERT effectively encodes clinical knowledge for segmentation.
FastTextDiff outperforms traditional diffusion models in accuracy.
Training efficiency is improved using FlashAttention 2 and large-scale pretraining.
Abstract
In recent times, denoising diffusion probabilistic models (DPMs) have proven effective for medical image generation and denoising, and as representation learners for downstream segmentation. However, segmentation performance is limited by the need for dense pixel-wise labels, which are expensive, time-consuming, and require expert knowledge. We propose FastTextDiff, a label-efficient diffusion-based segmentation model that integrates medical text annotations to enhance semantic representations. Our approach uses ModernBERT, a transformer capable of processing long clinical notes, to tightly link textual annotations with semantic content in medical images. Trained on MIMIC-III and MIMIC-IV, ModernBERT encodes clinical knowledge that guides cross-modal attention between visual and textual features. This study validates ModernBERT as a fast, scalable alternative to Clinical BioBERT in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI in cancer detection · Advanced Neuroimaging Techniques and Applications · Medical Image Segmentation Techniques
