Predicting Depth from Semantic Segmentation using Game Engine Dataset
Mohammad Amin Kashi

TL;DR
This paper introduces a novel depth estimation neural network that incorporates semantic labels, demonstrating a 52% reduction in relative error and improved robustness using synthetic game engine datasets.
Contribution
The paper proposes new network structures that integrate semantic labels with images for depth estimation, enhancing accuracy and robustness, especially when trained on synthetic datasets.
Findings
Semantic labels improve depth estimation accuracy.
Synthetic datasets can effectively train depth networks.
Semantic labels increase robustness against domain shift.
Abstract
Depth perception is fundamental for robots to understand the surrounding environment. As the view of cognitive neuroscience, visual depth perception methods are divided into three categories, namely binocular, active, and pictorial. The first two categories have been studied for decades in detail. However, research for the exploration of the third category is still in its infancy and has got momentum by the advent of deep learning methods in recent years. In cognitive neuroscience, it is known that pictorial depth perception mechanisms are dependent on the perception of seen objects. Inspired by this fact, in this thesis, we investigated the relation of perception of objects and depth estimation convolutional neural networks. For this purpose, we developed new network structures based on a simple depth estimation network that only used a single image at its input. Our proposed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Image Processing Techniques and Applications · Advanced Image and Video Retrieval Techniques
