Learning Semantic Segmentation from Multiple Datasets with Label Shifts
Dongwan Kim, Yi-Hsuan Tsai, Yumin Suh, Masoud Faraki, Sparsh Garg,, Manmohan Chandraker, Bohyung Han

TL;DR
This paper introduces UniSeg, a novel method for training semantic segmentation models across multiple datasets with different label spaces, effectively handling label conflicts and relationships to improve generalization, especially on unseen datasets.
Contribution
UniSeg presents a new training framework with two specialized loss functions to automatically handle label conflicts and relationships across datasets without manual relabeling.
Findings
Achieves over 8% IoU gain on KITTI dataset.
Improves generalization to unseen datasets.
Outperforms multi-dataset baselines in experiments.
Abstract
With increasing applications of semantic segmentation, numerous datasets have been proposed in the past few years. Yet labeling remains expensive, thus, it is desirable to jointly train models across aggregations of datasets to enhance data volume and diversity. However, label spaces differ across datasets and may even be in conflict with one another. This paper proposes UniSeg, an effective approach to automatically train models across multiple datasets with differing label spaces, without any manual relabeling efforts. Specifically, we propose two losses that account for conflicting and co-occurring labels to achieve better generalization performance in unseen domains. First, a gradient conflict in training due to mismatched label spaces is identified and a class-independent binary cross-entropy loss is proposed to alleviate such label conflicts. Second, a loss function that considers…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Infrastructure Maintenance and Monitoring · Multimodal Machine Learning Applications
