SAM-Aware Graph Prompt Reasoning Network for Cross-Domain Few-Shot Segmentation
Shi-Feng Peng, Guolei Sun, Yong Li, Hongsong Wang, Guo-Sen Xie

TL;DR
This paper introduces a SAM-aware graph prompt reasoning network that leverages large-scale pre-trained SAM to improve cross-domain few-shot segmentation by enhancing feature representation and semantic consistency across diverse domains.
Contribution
The work proposes a novel GPRN model with a SAM-aware prompt initialization, graph-based reasoning, and adaptive point selection to significantly improve CD-FSS performance.
Findings
Achieves state-of-the-art results on four CD-FSS datasets.
Effectively maintains semantic consistency across sub-region prompts.
Enhances segmentation accuracy with a feedback refinement module.
Abstract
The primary challenge of cross-domain few-shot segmentation (CD-FSS) is the domain disparity between the training and inference phases, which can exist in either the input data or the target classes. Previous models struggle to learn feature representations that generalize to various unknown domains from limited training domain samples. In contrast, the large-scale visual model SAM, pre-trained on tens of millions of images from various domains and classes, possesses excellent generalizability. In this work, we propose a SAM-aware graph prompt reasoning network (GPRN) that fully leverages SAM to guide CD-FSS feature representation learning and improve prediction accuracy. Specifically, we propose a SAM-aware prompt initialization module (SPI) to transform the masks generated by SAM into visual prompts enriched with high-level semantic information. Since SAM tends to divide an object…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdvanced Neural Network Applications · Medical Imaging Techniques and Applications · Advanced X-ray and CT Imaging
MethodsSegment Anything Model
