Foreground-Covering Prototype Generation and Matching for SAM-Aided Few-Shot Segmentation
Suho Park, SuBeen Lee, Hyun Seok Seong, Jaejoon Yoo, Jae-Pil Heo

TL;DR
This paper introduces a novel method for few-shot segmentation that leverages SAM and ResNet features with prototype matching and iterative cross-attention, achieving state-of-the-art results.
Contribution
It proposes a foreground-covering prototype generation and matching approach that effectively utilizes SAM and ResNet features with attention-guided pseudo-masks for improved FSS.
Findings
Achieves state-of-the-art performance on multiple datasets.
Effectively utilizes attention-based pseudo-masks for segmentation.
Demonstrates the benefit of combining SAM and ResNet features.
Abstract
We propose Foreground-Covering Prototype Generation and Matching to resolve Few-Shot Segmentation (FSS), which aims to segment target regions in unlabeled query images based on labeled support images. Unlike previous research, which typically estimates target regions in the query using support prototypes and query pixels, we utilize the relationship between support and query prototypes. To achieve this, we utilize two complementary features: SAM Image Encoder features for pixel aggregation and ResNet features for class consistency. Specifically, we construct support and query prototypes with SAM features and distinguish query prototypes of target regions based on ResNet features. For the query prototype construction, we begin by roughly guiding foreground regions within SAM features using the conventional pseudo-mask, then employ iterative cross-attention to aggregate foreground…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMedical Imaging Techniques and Applications · Geophysical Methods and Applications · Image Processing and 3D Reconstruction
MethodsAverage Pooling · Max Pooling · Global Average Pooling · Convolution · Kaiming Initialization · Focus · Segment Anything Model
