BEV-ODOM: Reducing Scale Drift in Monocular Visual Odometry with BEV   Representation

Yufei Wei; Sha Lu; Fuzhang Han; Rong Xiong; Yue Wang

arXiv:2411.10195·cs.RO·November 18, 2024

BEV-ODOM: Reducing Scale Drift in Monocular Visual Odometry with BEV Representation

Yufei Wei, Sha Lu, Fuzhang Han, Rong Xiong, Yue Wang

PDF

Open Access

TL;DR

BEV-ODOM introduces a novel Bird's Eye View-based monocular visual odometry framework that effectively reduces scale drift and improves long-term motion estimation accuracy without requiring depth supervision.

Contribution

The paper presents BEV-ODOM, a new MVO approach leveraging BEV representation and a depth-based PV encoder to enhance scale accuracy without complex optimization.

Findings

01

Reduces scale drift in long-term sequences

02

Achieves higher accuracy than existing MVO methods

03

Performs well across multiple datasets

Abstract

Monocular visual odometry (MVO) is vital in autonomous navigation and robotics, providing a cost-effective and flexible motion tracking solution, but the inherent scale ambiguity in monocular setups often leads to cumulative errors over time. In this paper, we present BEV-ODOM, a novel MVO framework leveraging the Bird's Eye View (BEV) Representation to address scale drift. Unlike existing approaches, BEV-ODOM integrates a depth-based perspective-view (PV) to BEV encoder, a correlation feature extraction neck, and a CNN-MLP-based decoder, enabling it to estimate motion across three degrees of freedom without the need for depth supervision or complex optimization techniques. Our framework reduces scale drift in long-term sequences and achieves accurate motion estimation across various datasets, including NCLT, Oxford, and KITTI. The results indicate that BEV-ODOM outperforms current MVO…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Robotics and Sensor-Based Localization · Image and Object Detection Techniques