Scale-aware direct monocular odometry

Carlos Campos; Juan D. Tard\'os

arXiv:2109.10077·cs.RO·July 25, 2022

Scale-aware direct monocular odometry

Carlos Campos, Juan D. Tard\'os

PDF

Open Access

TL;DR

This paper introduces a scale-aware monocular odometry framework that integrates multi-view depth prediction and robust optimization, significantly improving accuracy and eliminating scale drift in monocular SLAM.

Contribution

It proposes a novel depth residual formulation and a robust cost function, enabling effective multi-view depth integration without relying on specific neural networks.

Findings

01

Outperforms classic monocular SLAM by 5-9 times in accuracy.

02

Achieves accuracy comparable to stereo systems.

03

Validates the approach on the KITTI dataset with multiple neural networks.

Abstract

We present a generic framework for scale-aware direct monocular odometry based on depth prediction from a deep neural network. In contrast with previous methods where depth information is only partially exploited, we formulate a novel depth prediction residual which allows us to incorporate multi-view depth information. In addition, we propose to use a truncated robust cost function which prevents considering inconsistent depth estimations. The photometric and depth-prediction measurements are integrated into a tightly-coupled optimization leading to a scale-aware monocular system which does not accumulate scale drift. Our proposal does not particularize for a concrete neural network, being able to work along with the vast majority of the existing depth prediction solutions. We demonstrate the validity and generality of our proposal evaluating it on the KITTI odometry dataset, using two…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotics and Sensor-Based Localization · Advanced Vision and Imaging · Advanced Image and Video Retrieval Techniques