Panoptic Segmentation of Mammograms with Text-To-Image Diffusion Model
Kun Zhao, Jakub Prokop, Javier Montalt Tordera, and Sadegh Mohammadi

TL;DR
This paper introduces a novel approach using text-to-image diffusion models for panoptic segmentation of mammograms, significantly improving lesion delineation accuracy and aiding early breast cancer diagnosis.
Contribution
It leverages pretrained diffusion models and domain-specific encoders to enhance mammogram segmentation, bridging natural and medical imaging domains with state-of-the-art performance.
Findings
Achieved 40.25 AP0.1 in instance segmentation
Attained Dice scores around 39-41 for semantic segmentation
Demonstrated effective domain transfer with diffusion models
Abstract
Mammography is crucial for breast cancer surveillance and early diagnosis. However, analyzing mammography images is a demanding task for radiologists, who often review hundreds of mammograms daily, leading to overdiagnosis and overtreatment. Computer-Aided Diagnosis (CAD) systems have been developed to assist in this process, but their capabilities, particularly in lesion segmentation, remained limited. With the contemporary advances in deep learning their performance may be improved. Recently, vision-language diffusion models emerged, demonstrating outstanding performance in image generation and transferability to various downstream tasks. We aim to harness their capabilities for breast lesion segmentation in a panoptic setting, which encompasses both semantic and instance-level predictions. Specifically, we propose leveraging pretrained features from a Stable Diffusion model as inputs…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI in cancer detection
MethodsDiffusion
