BEV-DWPVO: BEV-based Differentiable Weighted Procrustes for Low Scale-drift Monocular Visual Odometry on Ground
Yufei Wei, Sha Lu, Wangtao Lu, Rong Xiong, Yue Wang

TL;DR
This paper introduces BEV-DWPVO, a ground-plane assumption-based monocular visual odometry system that uses Bird's-Eye View features and a differentiable Procrustes solver to reduce scale drift and improve pose estimation accuracy.
Contribution
It proposes a novel BEV-based approach with a differentiable weighted Procrustes method, simplifying pose estimation and enabling end-to-end training without auxiliary tasks.
Findings
Outperforms existing MVO methods on NCLT, Oxford, and KITTI datasets.
Effectively reduces scale drift in long-sequence visual odometry.
Simplifies pose estimation by reducing DoF from 6 to 3.
Abstract
Monocular Visual Odometry (MVO) provides a cost-effective, real-time positioning solution for autonomous vehicles. However, MVO systems face the common issue of lacking inherent scale information from monocular cameras. Traditional methods have good interpretability but can only obtain relative scale and suffer from severe scale drift in long-distance tasks. Learning-based methods under perspective view leverage large amounts of training data to acquire prior knowledge and estimate absolute scale by predicting depth values. However, their generalization ability is limited due to the need to accurately estimate the depth of each point. In contrast, we propose a novel MVO system called BEV-DWPVO. Our approach leverages the common assumption of a ground plane, using Bird's-Eye View (BEV) feature maps to represent the environment in a grid-based structure with a unified scale. This enables…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · Advanced Vision and Imaging · Soft Robotics and Applications
