What Synthesis is Missing: Depth Adaptation Integrated with Weak Supervision for Indoor Scene Parsing
Keng-Chi Liu, Yi-Ting Shen, Jan P. Klopp, Liang-Gee Chen

TL;DR
This paper introduces a novel depth-based transfer learning approach combined with weak supervision for indoor scene parsing, significantly reducing annotation effort and domain gap, and outperforming previous methods.
Contribution
It proposes a depth transfer domain method combined with weak supervision in a teacher-student framework, improving indoor scene parsing accuracy with less manual annotation.
Findings
Depth transfer reduces domain discrepancy more effectively.
The combined approach halves the gap to fully supervised methods.
Outperforms previous transfer learning approaches on SUN RGB-D.
Abstract
Scene Parsing is a crucial step to enable autonomous systems to understand and interact with their surroundings. Supervised deep learning methods have made great progress in solving scene parsing problems, however, come at the cost of laborious manual pixel-level annotation. To alleviate this effort synthetic data as well as weak supervision have both been investigated. Nonetheless, synthetically generated data still suffers from severe domain shift while weak labels are often imprecise. Moreover, most existing works for weakly supervised scene parsing are limited to salient foreground objects. The aim of this work is hence twofold: Exploit synthetic data where feasible and integrate weak supervision where necessary. More concretely, we address this goal by utilizing depth as transfer domain because its synthetic-to-real discrepancy is much lower than for color. At the same time, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Advanced Neural Network Applications · Advanced Image and Video Retrieval Techniques
