Automating MedSAM by Learning Prompts with Weak Few-Shot Supervision

M\'elanie Gaillochet; Christian Desrosiers; Herv\'e Lombaert

arXiv:2409.20293·cs.CV·October 1, 2024

Automating MedSAM by Learning Prompts with Weak Few-Shot Supervision

M\'elanie Gaillochet, Christian Desrosiers, Herv\'e Lombaert

PDF

Open Access 1 Repo

TL;DR

This paper introduces a method to automate image segmentation by learning prompts directly from images using weak supervision, reducing the need for manual prompt design and extensive labeled data, especially in medical imaging.

Contribution

It proposes a lightweight module to learn prompt embeddings from images, enabling automatic segmentation with minimal supervision and fine-tuning of foundation models like MedSAM for medical images.

Findings

01

Effective segmentation with only 10 samples and weak labels

02

Validated on three medical datasets in MR and ultrasound imaging

03

Reduces manual prompt engineering and annotation effort

Abstract

Foundation models such as the recently introduced Segment Anything Model (SAM) have achieved remarkable results in image segmentation tasks. However, these models typically require user interaction through handcrafted prompts such as bounding boxes, which limits their deployment to downstream tasks. Adapting these models to a specific task with fully labeled data also demands expensive prior user interaction to obtain ground-truth annotations. This work proposes to replace conditioning on input prompts with a lightweight module that directly learns a prompt embedding from the image embedding, both of which are subsequently used by the foundation model to output a segmentation mask. Our foundation models with learnable prompts can automatically segment any specific region by 1) modifying the input through a prompt embedding predicted by a simple module, and 2) using weak labels (tight…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

minimel/medsamweakfewshotpromptautomation
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Domain Adaptation and Few-Shot Learning

MethodsSegment Anything Model