Multichannel Semantic Segmentation with Unsupervised Domain Adaptation
Kohei Watanabe, Kuniaki Saito, Yoshitaka Ushiku, Tatsuya Harada

TL;DR
This paper introduces two novel methods leveraging multichannel inputs and unsupervised domain adaptation to improve semantic segmentation from synthetic to real RGBD images, reducing labeling effort.
Contribution
It proposes fusion-based and multitask learning approaches with UDA for better synthetic-to-real semantic segmentation, and establishes a new benchmark.
Findings
Multitask learning with post-processing improves segmentation accuracy.
The proposed methods outperform baseline models on the benchmark.
Unsupervised domain adaptation effectively bridges synthetic and real image domains.
Abstract
Most contemporary robots have depth sensors, and research on semantic segmentation with RGBD images has shown that depth images boost the accuracy of segmentation. Since it is time-consuming to annotate images with semantic labels per pixel, it would be ideal if we could avoid this laborious work by utilizing an existing dataset or a synthetic dataset which we can generate on our own. Robot motions are often tested in a synthetic environment, where multichannel (eg, RGB + depth + instance boundary) images plus their pixel-level semantic labels are available. However, models trained simply on synthetic images tend to demonstrate poor performance on real images. In order to address this, we propose two approaches that can efficiently exploit multichannel inputs combined with an unsupervised domain adaptation (UDA) algorithm. One is a fusion-based approach that uses depth images as inputs.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Robotics and Sensor-Based Localization · Advanced Vision and Imaging
