Prototypical Pseudo Label Denoising and Target Structure Learning for Domain Adaptive Semantic Segmentation
Pan Zhang, Bo Zhang, Ting Zhang, Dong Chen, Yong Wang, Fang Wen

TL;DR
This paper introduces a prototype-based approach for unsupervised domain adaptation in semantic segmentation, utilizing feature distances for pseudo label correction and target structure learning, leading to significant performance improvements.
Contribution
It proposes a novel prototype and feature distance-based method for pseudo label denoising and target structure alignment in domain adaptive segmentation.
Findings
Outperforms state-of-the-art methods significantly
Effective pseudo label correction using feature distances
Enhanced target feature space compactness
Abstract
Self-training is a competitive approach in domain adaptive segmentation, which trains the network with the pseudo labels on the target domain. However inevitably, the pseudo labels are noisy and the target features are dispersed due to the discrepancy between source and target domains. In this paper, we rely on representative prototypes, the feature centroids of classes, to address the two issues for unsupervised domain adaptation. In particular, we take one step further and exploit the feature distances from prototypes that provide richer information than mere prototypes. Specifically, we use it to estimate the likelihood of pseudo labels to facilitate online correction in the course of training. Meanwhile, we align the prototypical assignments based on relative feature distances for two different views of the same target, producing a more compact target feature space. Moreover, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Machine Learning and Data Classification
MethodsSpatial Pyramid Pooling · Batch Normalization · 1x1 Convolution · Dilated Convolution · Atrous Spatial Pyramid Pooling · DeepLabv3
