Real-Time Joint Semantic Segmentation and Depth Estimation Using   Asymmetric Annotations

Vladimir Nekrasov; Thanuja Dharmasiri; Andrew Spek; Tom Drummond,; Chunhua Shen; Ian Reid

arXiv:1809.04766·cs.CV·February 28, 2019

Real-Time Joint Semantic Segmentation and Depth Estimation Using Asymmetric Annotations

Vladimir Nekrasov, Thanuja Dharmasiri, Andrew Spek, Tom Drummond,, Chunhua Shen, Ian Reid

PDF

4 Repos

TL;DR

This paper presents a real-time, efficient deep learning model that jointly performs semantic segmentation and depth estimation, handling asymmetric datasets and enabling dense 3D scene reconstruction.

Contribution

It introduces a modified real-time segmentation network with reduced computational cost and employs knowledge distillation to manage asymmetric annotations, enabling multi-task learning in a single model.

Findings

01

Achieves state-of-the-art performance with 13ms inference time and 6.5 GFLOPs.

02

Successfully handles indoor and outdoor scenes with a single model.

03

Enables dense 3D semantic reconstruction using raw network predictions.

Abstract

Deployment of deep learning models in robotics as sensory information extractors can be a daunting task to handle, even using generic GPU cards. Here, we address three of its most prominent hurdles, namely, i) the adaptation of a single model to perform multiple tasks at once (in this work, we consider depth estimation and semantic segmentation crucial for acquiring geometric and semantic understanding of the scene), while ii) doing it in real-time, and iii) using asymmetric datasets with uneven numbers of annotations per each modality. To overcome the first two issues, we adapt a recently proposed real-time semantic segmentation network, making changes to further reduce the number of floating point operations. To approach the third issue, we embrace a simple solution based on hard knowledge distillation under the assumption of having access to a powerful `teacher' network. We showcase…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsKnowledge Distillation