Unbiased Semantic Decoding with Vision Foundation Models for Few-shot Segmentation

Jin Wang; Bingfeng Zhang; Jian Pang; Weifeng Liu; Baodi Liu; Honglong Chen

arXiv:2511.15118·cs.CV·November 20, 2025

Unbiased Semantic Decoding with Vision Foundation Models for Few-shot Segmentation

Jin Wang, Bingfeng Zhang, Jian Pang, Weifeng Liu, Baodi Liu, Honglong Chen

PDF

Open Access

TL;DR

This paper introduces an Unbiased Semantic Decoding strategy for SAM that leverages both support and query sets along with CLIP to improve few-shot segmentation without retraining foundation models.

Contribution

It proposes a novel decoding approach that enhances SAM's semantic discrimination by integrating support and query information with CLIP, avoiding re-training of foundation models.

Findings

01

Enhanced segmentation accuracy demonstrated on benchmark datasets.

02

Effective bias reduction in semantic decoding process.

03

No need for re-training foundation models.

Abstract

Few-shot segmentation has garnered significant attention. Many recent approaches attempt to introduce the Segment Anything Model (SAM) to handle this task. With the strong generalization ability and rich object-specific extraction ability of the SAM model, such a solution shows great potential in few-shot segmentation. However, the decoding process of SAM highly relies on accurate and explicit prompts, making previous approaches mainly focus on extracting prompts from the support set, which is insufficient to activate the generalization ability of SAM, and this design is easy to result in a biased decoding process when adapting to the unknown classes. In this work, we propose an Unbiased Semantic Decoding (USD) strategy integrated with SAM, which extracts target information from both the support and query set simultaneously to perform consistent predictions guided by the semantics of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Advanced Neural Network Applications