Prototypical Progressive Alignment and Reweighting for Generalizable Semantic Segmentation

Yuhang Zhang; Zhengyu Zhang; Muxin Liao; Shishun Tian; Wenbin Zou; Lu Zhang; Chen Xu

arXiv:2507.11955·cs.CV·July 17, 2025

Prototypical Progressive Alignment and Reweighting for Generalizable Semantic Segmentation

Yuhang Zhang, Zhengyu Zhang, Muxin Liao, Shishun Tian, Wenbin Zou, Lu Zhang, Chen Xu

PDF

Open Access

TL;DR

This paper introduces PPAR, a novel framework that uses CLIP-based prototypes, progressive alignment, and reweighting to improve the generalization of semantic segmentation models to unseen domains.

Contribution

The paper proposes a new method combining CLIP-derived prototypes, progressive alignment, and reweighting to enhance domain generalization in semantic segmentation.

Findings

01

PPAR achieves state-of-the-art results on multiple benchmarks.

02

The progressive alignment strategy effectively reduces domain gaps.

03

Prototypical reweighting mitigates negative transfer from irrelevant source data.

Abstract

Generalizable semantic segmentation aims to perform well on unseen target domains, a critical challenge due to real-world applications requiring high generalizability. Class-wise prototypes, representing class centroids, serve as domain-invariant cues that benefit generalization due to their stability and semantic consistency. However, this approach faces three challenges. First, existing methods often adopt coarse prototypical alignment strategies, which may hinder performance. Second, naive prototypes computed by averaging source batch features are prone to overfitting and may be negatively affected by unrelated source data. Third, most methods treat all source samples equally, ignoring the fact that different features have varying adaptation difficulties. To address these limitations, we propose a novel framework for generalizable semantic segmentation: Prototypical Progressive…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHandwritten Text Recognition Techniques · Natural Language Processing Techniques · Machine Learning and Data Classification