PartSeg: Few-shot Part Segmentation via Part-aware Prompt Learning

Mengya Han; Heliang Zheng; Chaoyue Wang; Yong Luo; Han Hu; Jing Zhang,; Yonggang Wen

arXiv:2308.12757·cs.CV·August 25, 2023·2 cites

PartSeg: Few-shot Part Segmentation via Part-aware Prompt Learning

Mengya Han, Heliang Zheng, Chaoyue Wang, Yong Luo, Han Hu, Jing Zhang,, Yonggang Wen

PDF

Open Access

TL;DR

PartSeg introduces a novel few-shot part segmentation approach leveraging part-aware prompt learning with CLIP, enabling effective generalization across object categories and achieving state-of-the-art results on benchmark datasets.

Contribution

The paper proposes a new multimodal learning method using part-aware prompts with CLIP for few-shot part segmentation, enhancing cross-category generalization.

Findings

01

Achieves state-of-the-art performance on PartImageNet and Pascal_Part datasets.

02

Effectively models part relationships across different object categories.

03

Demonstrates the benefit of textual space in few-shot visual segmentation.

Abstract

In this work, we address the task of few-shot part segmentation, which aims to segment the different parts of an unseen object using very few labeled examples. It is found that leveraging the textual space of a powerful pre-trained image-language model (such as CLIP) can be beneficial in learning visual features. Therefore, we develop a novel method termed PartSeg for few-shot part segmentation based on multimodal learning. Specifically, we design a part-aware prompt learning method to generate part-specific prompts that enable the CLIP model to better understand the concept of ``part'' and fully utilize its textual space. Furthermore, since the concept of the same part under different object categories is general, we establish relationships between these parts during the prompt learning process. We conduct extensive experiments on the PartImageNet and Pascal $_$ Part datasets, and the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning · Advanced Neural Network Applications

MethodsContrastive Language-Image Pre-training