Predicting Depth from Semantic Segmentation using Game Engine Dataset

Mohammad Amin Kashi

arXiv:2106.15257·cs.CV·June 30, 2021

Predicting Depth from Semantic Segmentation using Game Engine Dataset

Mohammad Amin Kashi

PDF

Open Access

TL;DR

This paper introduces a novel depth estimation neural network that incorporates semantic labels, demonstrating a 52% reduction in relative error and improved robustness using synthetic game engine datasets.

Contribution

The paper proposes new network structures that integrate semantic labels with images for depth estimation, enhancing accuracy and robustness, especially when trained on synthetic datasets.

Findings

01

Semantic labels improve depth estimation accuracy.

02

Synthetic datasets can effectively train depth networks.

03

Semantic labels increase robustness against domain shift.

Abstract

Depth perception is fundamental for robots to understand the surrounding environment. As the view of cognitive neuroscience, visual depth perception methods are divided into three categories, namely binocular, active, and pictorial. The first two categories have been studied for decades in detail. However, research for the exploration of the third category is still in its infancy and has got momentum by the advent of deep learning methods in recent years. In cognitive neuroscience, it is known that pictorial depth perception mechanisms are dependent on the perception of seen objects. Inspired by this fact, in this thesis, we investigated the relation of perception of objects and depth estimation convolutional neural networks. For this purpose, we developed new network structures based on a simple depth estimation network that only used a single image at its input. Our proposed…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Image Processing Techniques and Applications · Advanced Image and Video Retrieval Techniques