Deep Robust Single Image Depth Estimation Neural Network Using Scene Understanding
Haoyu Ren, Mostafa El-khamy, Jungwon Lee

TL;DR
This paper introduces a two-stage deep learning framework for single image depth estimation that combines scene understanding with specialized depth networks, achieving high accuracy and efficiency across indoor and outdoor scenes.
Contribution
The paper presents a novel two-stage framework with scene classification and multi-task depth networks, improving robustness and accuracy in diverse environments.
Findings
Achieves competitive accuracy on NYU dataset.
Demonstrates good performance on ScanNet and KITTI datasets.
Outperforms ROB 2018 submissions in robustness.
Abstract
Single image depth estimation (SIDE) plays a crucial role in 3D computer vision. In this paper, we propose a two-stage robust SIDE framework that can perform blind SIDE for both indoor and outdoor scenes. At the first stage, the scene understanding module will categorize the RGB image into different depth-ranges. We introduce two different scene understanding modules based on scene classification and coarse depth estimation respectively. At the second stage, SIDE networks trained by the images of specific depth-range are applied to obtain an accurate depth map. In order to improve the accuracy, we further design a multi-task encoding-decoding SIDE network DS-SIDENet based on depthwise separable convolutions. DS-SIDENet is optimized to minimize both depth classification and depth regression losses. This improves the accuracy compared to a single-task SIDE network. Experimental results…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Image Processing Techniques and Applications · Advanced Image Processing Techniques
