Diffusion-empowered AutoPrompt MedSAM

Peng Huang; Shu Hu; Bo Peng; Xun Gong; Penghang Yin; Hongtu Zhu; Xi; Wu; Xin Wang

arXiv:2502.06817·eess.IV·April 17, 2025

Diffusion-empowered AutoPrompt MedSAM

Peng Huang, Shu Hu, Bo Peng, Xun Gong, Penghang Yin, Hongtu Zhu, Xi, Wu, Xin Wang

PDF

Open Access 1 Repo

TL;DR

AutoMedSAM introduces an automated, diffusion-based prompt encoder to MedSAM, significantly reducing manual effort and embedding semantic labels, thereby enhancing usability and segmentation accuracy in medical imaging.

Contribution

The paper presents AutoMedSAM, a novel diffusion-guided prompt encoder that automates prompt generation and incorporates semantic information, improving MedSAM's clinical applicability.

Findings

01

Achieves superior segmentation performance across multiple datasets.

02

Enables fully automated prompts with semantic labels.

03

Broadens MedSAM's usability for non-expert users.

Abstract

MedSAM, a medical foundation model derived from the SAM architecture, has demonstrated notable success across diverse medical domains. However, its clinical application faces two major challenges: the dependency on labor-intensive manual prompt generation, which imposes a significant burden on clinicians, and the absence of semantic labeling in the generated segmentation masks for organs or lesions, limiting its practicality for non-expert users. To address these limitations, we propose AutoMedSAM, an end-to-end framework derived from SAM, designed to enhance usability and segmentation performance. AutoMedSAM retains MedSAM's image encoder and mask decoder structure while introducing a novel diffusion-based class prompt encoder. The diffusion-based encoder employs a dual-decoder structure to collaboratively generate prompt embeddings guided by sparse and dense prompt definitions. These…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hp-ml/autopromptmedsam
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBrain Tumor Detection and Classification · Dendrimers and Hyperbranched Polymers

MethodsSegment Anything Model