Diffusion-empowered AutoPrompt MedSAM
Peng Huang, Shu Hu, Bo Peng, Xun Gong, Penghang Yin, Hongtu Zhu, Xi, Wu, Xin Wang

TL;DR
AutoMedSAM introduces an automated, diffusion-based prompt encoder to MedSAM, significantly reducing manual effort and embedding semantic labels, thereby enhancing usability and segmentation accuracy in medical imaging.
Contribution
The paper presents AutoMedSAM, a novel diffusion-guided prompt encoder that automates prompt generation and incorporates semantic information, improving MedSAM's clinical applicability.
Findings
Achieves superior segmentation performance across multiple datasets.
Enables fully automated prompts with semantic labels.
Broadens MedSAM's usability for non-expert users.
Abstract
MedSAM, a medical foundation model derived from the SAM architecture, has demonstrated notable success across diverse medical domains. However, its clinical application faces two major challenges: the dependency on labor-intensive manual prompt generation, which imposes a significant burden on clinicians, and the absence of semantic labeling in the generated segmentation masks for organs or lesions, limiting its practicality for non-expert users. To address these limitations, we propose AutoMedSAM, an end-to-end framework derived from SAM, designed to enhance usability and segmentation performance. AutoMedSAM retains MedSAM's image encoder and mask decoder structure while introducing a novel diffusion-based class prompt encoder. The diffusion-based encoder employs a dual-decoder structure to collaboratively generate prompt embeddings guided by sparse and dense prompt definitions. These…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBrain Tumor Detection and Classification · Dendrimers and Hyperbranched Polymers
MethodsSegment Anything Model
