LiDARTouch: Monocular metric depth estimation with a few-beam LiDAR
Florent Bartoccioni, \'Eloi Zablocki, Patrick P\'erez, Matthieu Cord,, Karteek Alahari

TL;DR
LiDARTouch introduces a self-supervised framework combining monocular vision with sparse, automotive-grade LiDAR to improve dense depth estimation, addressing scale ambiguity and infinite-depth issues without dense ground-truth data.
Contribution
The paper presents a novel self-supervised method integrating minimal LiDAR data with monocular images for accurate depth estimation, achieving state-of-the-art results.
Findings
Achieves new state-of-the-art in self-supervised depth estimation on KITTI.
Using sparse LiDAR alleviates scale ambiguity and infinite-depth problems.
Adapts supervised depth-completion methods to a self-supervised setting with minimal LiDAR.
Abstract
Vision-based depth estimation is a key feature in autonomous systems, which often relies on a single camera or several independent ones. In such a monocular setup, dense depth is obtained with either additional input from one or several expensive LiDARs, e.g., with 64 beams, or camera-only methods, which suffer from scale-ambiguity and infinite-depth problems. In this paper, we propose a new alternative of densely estimating metric depth by combining a monocular camera with a light-weight LiDAR, e.g., with 4 beams, typical of today's automotive-grade mass-produced laser scanners. Inspired by recent self-supervised methods, we introduce a novel framework, called LiDARTouch, to estimate dense depth maps from monocular images with the help of ``touches'' of LiDAR, i.e., without the need for dense ground-truth depth. In our setup, the minimal LiDAR input contributes on three different…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Robotics and Sensor-Based Localization · Image Processing Techniques and Applications
