Beyond Mask: Rethinking Guidance Types in Few-shot Segmentation
Shijie Chang, Youwei Pang, Xiaoqi Zhao, Lihe Zhang, Huchuan Lu

TL;DR
This paper introduces UniFSS, a universal vision-language framework for few-shot segmentation that integrates multiple guidance types, significantly improving performance and surpassing state-of-the-art methods, including weakly annotated paradigms.
Contribution
The work generalizes guidance types in FSS by proposing seven paradigms and develops UniFSS, a framework leveraging vision-language models for improved segmentation performance.
Findings
UniFSS outperforms existing state-of-the-art methods.
Weakly annotated box guidance surpasses mask guidance.
High-level spatial correction enhances semantic understanding.
Abstract
Existing few-shot segmentation (FSS) methods mainly focus on prototype feature generation and the query-support matching mechanism. As a crucial prompt for generating prototype features, the pair of image-mask types in the support set has become the default setting. However, various types such as image, text, box, and mask all can provide valuable information regarding the objects in context, class, localization, and shape appearance. Existing work focuses on specific combinations of guidance, leading FSS into different research branches. Rethinking guidance types in FSS is expected to explore the efficient joint representation of the coupling between the support set and query set, giving rise to research trends in the weakly or strongly annotated guidance to meet the customized requirements of practical users. In this work, we provide the generalized FSS with seven guidance paradigms…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · Advanced Image and Video Retrieval Techniques · Image and Object Detection Techniques
MethodsSparse Evolutionary Training · Focus
