LiDARTouch: Monocular metric depth estimation with a few-beam LiDAR

Florent Bartoccioni; \'Eloi Zablocki; Patrick P\'erez; Matthieu Cord,; Karteek Alahari

arXiv:2109.03569·cs.CV·November 28, 2022

LiDARTouch: Monocular metric depth estimation with a few-beam LiDAR

Florent Bartoccioni, \'Eloi Zablocki, Patrick P\'erez, Matthieu Cord,, Karteek Alahari

PDF

Open Access 1 Repo

TL;DR

LiDARTouch introduces a self-supervised framework combining monocular vision with sparse, automotive-grade LiDAR to improve dense depth estimation, addressing scale ambiguity and infinite-depth issues without dense ground-truth data.

Contribution

The paper presents a novel self-supervised method integrating minimal LiDAR data with monocular images for accurate depth estimation, achieving state-of-the-art results.

Findings

01

Achieves new state-of-the-art in self-supervised depth estimation on KITTI.

02

Using sparse LiDAR alleviates scale ambiguity and infinite-depth problems.

03

Adapts supervised depth-completion methods to a self-supervised setting with minimal LiDAR.

Abstract

Vision-based depth estimation is a key feature in autonomous systems, which often relies on a single camera or several independent ones. In such a monocular setup, dense depth is obtained with either additional input from one or several expensive LiDARs, e.g., with 64 beams, or camera-only methods, which suffer from scale-ambiguity and infinite-depth problems. In this paper, we propose a new alternative of densely estimating metric depth by combining a monocular camera with a light-weight LiDAR, e.g., with 4 beams, typical of today's automotive-grade mass-produced laser scanners. Inspired by recent self-supervised methods, we introduce a novel framework, called LiDARTouch, to estimate dense depth maps from monocular images with the help of ``touches'' of LiDAR, i.e., without the need for dense ground-truth depth. In our setup, the minimal LiDAR input contributes on three different…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

F-Barto/LiDARTouch
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Robotics and Sensor-Based Localization · Image Processing Techniques and Applications