Training-Free Class Purification for Open-Vocabulary Semantic Segmentation
Qi Chen, Lingxiao Yang, Yun Chen, Nailong Zhao, Jianhuang Lai, Jie Shao, Xiaohua Xie

TL;DR
This paper introduces FreeCP, a training-free class purification framework that improves open-vocabulary semantic segmentation by addressing class redundancy and ambiguity, leading to better segmentation accuracy without additional training.
Contribution
The paper presents a novel training-free method, FreeCP, that enhances OVSS by purifying class representations to handle redundancy and ambiguity challenges.
Findings
FreeCP significantly improves segmentation performance across eight benchmarks.
FreeCP acts as an effective plug-and-play module with existing OVSS methods.
Addressing class redundancy and ambiguity enhances class activation maps.
Abstract
Fine-tuning pre-trained vision-language models has emerged as a powerful approach for enhancing open-vocabulary semantic segmentation (OVSS). However, the substantial computational and resource demands associated with training on large datasets have prompted interest in training-free methods for OVSS. Existing training-free approaches primarily focus on modifying model architectures and generating prototypes to improve segmentation performance. However, they often neglect the challenges posed by class redundancy, where multiple categories are not present in the current test image, and visual-language ambiguity, where semantic similarities among categories create confusion in class activation. These issues can lead to suboptimal class activation maps and affinity-refined activation maps. Motivated by these observations, we propose FreeCP, a novel training-free class purification…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning · Advanced Neural Network Applications
