PGP-SAM: Prototype-Guided Prompt Learning for Efficient Few-Shot Medical   Image Segmentation

Zhonghao Yan; Zijin Yin; Tianyu Lin; Xiangzhu Zeng; Kongming Liang and; Zhanyu Ma

arXiv:2501.06692·cs.CV·January 14, 2025

PGP-SAM: Prototype-Guided Prompt Learning for Efficient Few-Shot Medical Image Segmentation

Zhonghao Yan, Zijin Yin, Tianyu Lin, Xiangzhu Zeng, Kongming Liang and, Zhanyu Ma

PDF

1 Repo

TL;DR

PGP-SAM introduces a prototype-guided prompt learning method that efficiently adapts the Segment Anything Model for medical image segmentation using limited annotated samples, reducing manual prompt design.

Contribution

It proposes a novel prototype-based few-shot tuning approach with a contextual modulation and cross-attention mechanism for automatic prompt generation.

Findings

01

Achieves higher Dice scores than existing prompt-free SAM variants.

02

Uses only 10% of the 2D slices for training.

03

Demonstrates effectiveness on multi-organ and ventricle datasets.

Abstract

The Segment Anything Model (SAM) has demonstrated strong and versatile segmentation capabilities, along with intuitive prompt-based interactions. However, customizing SAM for medical image segmentation requires massive amounts of pixel-level annotations and precise point- or box-based prompt designs. To address these challenges, we introduce PGP-SAM, a novel prototype-based few-shot tuning approach that uses limited samples to replace tedious manual prompts. Our key idea is to leverage inter- and intra-class prototypes to capture class-specific knowledge and relationships. We propose two main components: (1) a plug-and-play contextual modulation module that integrates multi-scale information, and (2) a class-guided cross-attention mechanism that fuses prototypes and features for automatic prompt generation. Experiments on a public multi-organ dataset and a private ventricle dataset…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

PRIS-CV/PGP-SAM
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSegment Anything Model