Beyond Visual Cues: Leveraging General Semantics as Support for Few-Shot Segmentation
Jin Wang, Bingfeng Zhang, Jian Pang, Mengyu Liu, Honglong Chen, Weifeng Liu

TL;DR
This paper introduces a novel language-driven approach for few-shot segmentation that leverages large language models and multi-modal alignment to provide unbiased, detailed attribute guidance, outperforming existing methods.
Contribution
The paper proposes LDAG, a new architecture that uses language descriptions and multi-modal matching to improve few-shot segmentation, addressing intra-class variation issues.
Findings
Outperforms existing few-shot segmentation methods.
Achieves state-of-the-art performance on benchmark datasets.
Demonstrates the effectiveness of language-driven attribute guidance.
Abstract
Few-shot segmentation (FSS) aims to segment novel classes under the guidance of limited support samples by a meta-learning paradigm. Existing methods mainly mine references from support images as meta guidance. However, due to intra-class variations among visual representations, the meta information extracted from support images cannot produce accurate guidance to segment untrained classes. In this paper, we argue that the references from support images may not be essential, the key to the support role is to provide unbiased meta guidance for both trained and untrained classes. We then introduce a Language-Driven Attribute Generalization (LDAG) architecture to utilize inherent target property language descriptions to build robust support strategy. Specifically, to obtain an unbiased support representation, we design a Multi-attribute Enhancement (MaE) module, which produces multiple…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Advanced Neural Network Applications
