Monocular Depth Prediction through Continuous 3D Loss
Minghan Zhu, Maani Ghaffari, Yuanxin Zhong, Pingping Lu, Zhong Cao,, Ryan M. Eustice, Huei Peng

TL;DR
This paper introduces a novel continuous 3D loss function for monocular depth prediction that leverages sparse LIDAR data to improve accuracy and 3D structure consistency across multiple state-of-the-art models.
Contribution
The paper proposes a continuous 3D loss function that enhances monocular depth estimation by effectively utilizing sparse LIDAR points during training.
Findings
Improved depth prediction accuracy across tested models.
Produced more geometrically consistent 3D point clouds.
Demonstrated general applicability of the loss function.
Abstract
This paper reports a new continuous 3D loss function for learning depth from monocular images. The dense depth prediction from a monocular image is supervised using sparse LIDAR points, which enables us to leverage available open source datasets with camera-LIDAR sensor suites during training. Currently, accurate and affordable range sensor is not readily available. Stereo cameras and LIDARs measure depth either inaccurately or sparsely/costly. In contrast to the current point-to-point loss evaluation approach, the proposed 3D loss treats point clouds as continuous objects; therefore, it compensates for the lack of dense ground truth depth due to LIDAR's sparsity measurements. We applied the proposed loss in three state-of-the-art monocular depth prediction approaches DORN, BTS, and Monodepth2. Experimental evaluation shows that the proposed loss improves the depth prediction accuracy…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Optical measurement and interference techniques · Robotics and Sensor-Based Localization
