Multichannel Semantic Segmentation with Unsupervised Domain Adaptation

Kohei Watanabe; Kuniaki Saito; Yoshitaka Ushiku; Tatsuya Harada

arXiv:1812.04351·cs.CV·December 12, 2018·1 cites

Multichannel Semantic Segmentation with Unsupervised Domain Adaptation

Kohei Watanabe, Kuniaki Saito, Yoshitaka Ushiku, Tatsuya Harada

PDF

Open Access 1 Repo

TL;DR

This paper introduces two novel methods leveraging multichannel inputs and unsupervised domain adaptation to improve semantic segmentation from synthetic to real RGBD images, reducing labeling effort.

Contribution

It proposes fusion-based and multitask learning approaches with UDA for better synthetic-to-real semantic segmentation, and establishes a new benchmark.

Findings

01

Multitask learning with post-processing improves segmentation accuracy.

02

The proposed methods outperform baseline models on the benchmark.

03

Unsupervised domain adaptation effectively bridges synthetic and real image domains.

Abstract

Most contemporary robots have depth sensors, and research on semantic segmentation with RGBD images has shown that depth images boost the accuracy of segmentation. Since it is time-consuming to annotate images with semantic labels per pixel, it would be ideal if we could avoid this laborious work by utilizing an existing dataset or a synthetic dataset which we can generate on our own. Robot motions are often tested in a synthetic environment, where multichannel (eg, RGB + depth + instance boundary) images plus their pixel-level semantic labels are available. However, models trained simply on synthetic images tend to demonstrate poor performance on real images. In order to address this, we propose two approaches that can efficiently exploit multichannel inputs combined with an unsupervised domain adaptation (UDA) algorithm. One is a fusion-based approach that uses depth images as inputs.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

LittleWat/multichannel-semseg-with-uda
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Robotics and Sensor-Based Localization · Advanced Vision and Imaging