Prototypical Progressive Alignment and Reweighting for Generalizable Semantic Segmentation
Yuhang Zhang, Zhengyu Zhang, Muxin Liao, Shishun Tian, Wenbin Zou, Lu Zhang, Chen Xu

TL;DR
This paper introduces PPAR, a novel framework that uses CLIP-based prototypes, progressive alignment, and reweighting to improve the generalization of semantic segmentation models to unseen domains.
Contribution
The paper proposes a new method combining CLIP-derived prototypes, progressive alignment, and reweighting to enhance domain generalization in semantic segmentation.
Findings
PPAR achieves state-of-the-art results on multiple benchmarks.
The progressive alignment strategy effectively reduces domain gaps.
Prototypical reweighting mitigates negative transfer from irrelevant source data.
Abstract
Generalizable semantic segmentation aims to perform well on unseen target domains, a critical challenge due to real-world applications requiring high generalizability. Class-wise prototypes, representing class centroids, serve as domain-invariant cues that benefit generalization due to their stability and semantic consistency. However, this approach faces three challenges. First, existing methods often adopt coarse prototypical alignment strategies, which may hinder performance. Second, naive prototypes computed by averaging source batch features are prone to overfitting and may be negatively affected by unrelated source data. Third, most methods treat all source samples equally, ignoring the fact that different features have varying adaptation difficulties. To address these limitations, we propose a novel framework for generalizable semantic segmentation: Prototypical Progressive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHandwritten Text Recognition Techniques · Natural Language Processing Techniques · Machine Learning and Data Classification
