Multi-Granularity Feature Calibration via VFM for Domain Generalized Semantic Segmentation
Xinhui Li, Xiaojie Guo

TL;DR
This paper introduces MGFC, a hierarchical feature calibration framework that leverages vision foundation models at multiple granularities to improve domain generalization in semantic segmentation tasks.
Contribution
It proposes a novel multi-granularity feature calibration method that aligns VFM features at coarse, medium, and fine levels for better domain robustness in semantic segmentation.
Findings
Outperforms state-of-the-art DGSS methods on benchmark datasets.
Effective hierarchical feature calibration improves segmentation accuracy.
Enhances robustness to domain shifts through multi-level feature adaptation.
Abstract
Domain Generalized Semantic Segmentation (DGSS) aims to improve the generalization ability of models across unseen domains without access to target data during training. Recent advances in DGSS have increasingly exploited vision foundation models (VFMs) via parameter-efficient fine-tuning strategies. However, most existing approaches concentrate on global feature fine-tuning, while overlooking hierarchical adaptation across feature levels, which is crucial for precise dense prediction. In this paper, we propose Multi-Granularity Feature Calibration (MGFC), a novel framework that performs coarse-to-fine alignment of VFM features to enhance robustness under domain shifts. Specifically, MGFC first calibrates coarse-grained features to capture global contextual semantics and scene-level structure. Then, it refines medium-grained features by promoting category-level feature discriminability.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Face recognition and analysis
