PartSeg: Few-shot Part Segmentation via Part-aware Prompt Learning
Mengya Han, Heliang Zheng, Chaoyue Wang, Yong Luo, Han Hu, Jing Zhang,, Yonggang Wen

TL;DR
PartSeg introduces a novel few-shot part segmentation approach leveraging part-aware prompt learning with CLIP, enabling effective generalization across object categories and achieving state-of-the-art results on benchmark datasets.
Contribution
The paper proposes a new multimodal learning method using part-aware prompts with CLIP for few-shot part segmentation, enhancing cross-category generalization.
Findings
Achieves state-of-the-art performance on PartImageNet and Pascal_Part datasets.
Effectively models part relationships across different object categories.
Demonstrates the benefit of textual space in few-shot visual segmentation.
Abstract
In this work, we address the task of few-shot part segmentation, which aims to segment the different parts of an unseen object using very few labeled examples. It is found that leveraging the textual space of a powerful pre-trained image-language model (such as CLIP) can be beneficial in learning visual features. Therefore, we develop a novel method termed PartSeg for few-shot part segmentation based on multimodal learning. Specifically, we design a part-aware prompt learning method to generate part-specific prompts that enable the CLIP model to better understand the concept of ``part'' and fully utilize its textual space. Furthermore, since the concept of the same part under different object categories is general, we establish relationships between these parts during the prompt learning process. We conduct extensive experiments on the PartImageNet and PascalPart datasets, and the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning · Advanced Neural Network Applications
MethodsContrastive Language-Image Pre-training
