Med-PerSAM: One-Shot Visual Prompt Tuning for Personalized Segment   Anything Model in Medical Domain

Hangyul Yoon; Doohyuk Jang; Jungeun Kim; Eunho Yang

arXiv:2411.16123·cs.CV·November 26, 2024

Med-PerSAM: One-Shot Visual Prompt Tuning for Personalized Segment Anything Model in Medical Domain

Hangyul Yoon, Doohyuk Jang, Jungeun Kim, Eunho Yang

PDF

Open Access 1 Repo

TL;DR

Med-PerSAM introduces a novel one-shot visual prompt tuning framework that enhances medical image segmentation by automating prompt generation and refinement, eliminating the need for additional training or human intervention.

Contribution

The paper presents Med-PerSAM, a lightweight, automated prompt generation method that improves medical segmentation using SAM without extra training or expert input.

Findings

01

Outperforms existing models on diverse medical datasets

02

Automates prompt generation, reducing reliance on human expertise

03

Enhances segmentation accuracy in medical imaging

Abstract

Leveraging pre-trained models with tailored prompts for in-context learning has proven highly effective in NLP tasks. Building on this success, recent studies have applied a similar approach to the Segment Anything Model (SAM) within a ``one-shot" framework, where only a single reference image and its label are employed. However, these methods face limitations in the medical domain, primarily due to SAM's essential requirement for visual prompts and the over-reliance on pixel similarity for generating them. This dependency may lead to (1) inaccurate prompt generation and (2) clustering of point prompts, resulting in suboptimal outcomes. To address these challenges, we introduce \textbf{Med-PerSAM}, a novel and straightforward one-shot framework designed for the medical domain. Med-PerSAM uses only visual prompt engineering and eliminates the need for additional training of the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

facebookresearch/segment-anything
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsData Visualization and Analytics

MethodsSegment Anything Model