Balanced Learning for Domain Adaptive Semantic Segmentation
Wangkai Li, Rui Sun, Bohao Liao, Zhaoyang Li, Tianzhu Zhang

TL;DR
This paper introduces BLDA, a novel method for unsupervised domain adaptation in semantic segmentation that balances class predictions by analyzing logits distributions, leading to improved performance especially on under-predicted classes.
Contribution
BLDA is a new approach that directly assesses and alleviates class bias in UDA for semantic segmentation without prior distribution knowledge.
Findings
BLDA improves segmentation performance on benchmark datasets.
BLDA effectively balances class predictions during domain adaptation.
BLDA enhances under-predicted class accuracy.
Abstract
Unsupervised domain adaptation (UDA) for semantic segmentation aims to transfer knowledge from a labeled source domain to an unlabeled target domain. Despite the effectiveness of self-training techniques in UDA, they struggle to learn each class in a balanced manner due to inherent class imbalance and distribution shift in both data and label space between domains. To address this issue, we propose Balanced Learning for Domain Adaptation (BLDA), a novel approach to directly assess and alleviate class bias without requiring prior knowledge about the distribution shift. First, we identify over-predicted and under-predicted classes by analyzing the distribution of predicted logits. Subsequently, we introduce a post-hoc approach to align the logits distributions across different classes using shared anchor distributions. To further consider the network's need to generate unbiased…
Peer Reviews
Decision·Submitted to ICLR 2025
1. This paper provide a new way to measure the class distribution changes in semantic segmentation by the logits distribution. 2. The proposed module could easily be applied to existing UDA for semantic segmentation methods, potentially have a broad use in this area. 3. The proposed module is generally effective on most of the classes in the two benckmark tasks. 4. The visual aid is good, provide an intuition of the motivation, also demostrates the effectiveness of the proposed module.
1. The proposed method relies on the logits distribution. However, this distribution can be affected by data quality and model architecture, which can affect the accuracy of bias assessment. 2. As a DA for segmantation task, a very severe issue is its efficiency concern. Adaptation process already cost a lot of time and computational resources, the proposed method seems exacerbated this issue by multiple GMMs. An efficiency study including wall-clock time or other efficiency measurement will be
- The motivation is clear, with a thorough statistical analysis of the class bias issue in unsupervised domain adaptation (UDA) for semantic segmentation (Figures 1 and 2). - The paper is generally well-written, well-structured, and easy to follow. - The proposed method comprises four modules. Although each module is simple and widely used in the machine learning field (e.g., GMM and alignment with anchor distributions), these techniques are effective in addressing issues found in this task. - T
1. The proposed method is computationally heavy, as it includes an additional regression head with extra training objectives and requires GMM updates via EM algorithms. Consequently, this approach may incur significantly more computation time and memory usage than baseline methods. 2. In Tables 1, 2, and 4, all existing methods equipped with BLDA are outdated. It remains questionable whether current SOTA methods (in 2023 and 2024) are sufficient to address prediction bias issues.
1. The paper is well-written and easy to follow. The figures clearly show the distribution trends to help understand the core idea. 2. There are many formula languages to describe the proposed method precisely. 3. The experiments on the GTAv/SYNTHIA/Cityscapes benchmark show clear improvements over baseline methods.
1. The novelty is limited. The data distribution problem is not newly recognized, and the proposed method adopting anchor distributions for alignment and GMM for unbiased generation is also explored by previous methods. For example, the following papers [a-d] also adopt anchors and/or GMM methods to cross-domain alignment. Please consider providing more discussion with these related works. 2. The method is only verified on a relatively small-scale benchmark. The compared works are from two years
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Topic Modeling
