RAUM-VO: Rotational Adjusted Unsupervised Monocular Visual Odometry
Claudio Cimarelli, Hriday Bavle, Jose Luis Sanchez-Lopez, Holger Voos

TL;DR
RAUM-VO introduces a novel unsupervised monocular visual odometry method that adjusts rotational estimates using a model-free epipolar constraint, improving accuracy over existing pose networks on KITTI dataset.
Contribution
The paper proposes RAUM-VO, a new approach that combines deep learning with epipolar geometry to enhance rotational accuracy in unsupervised monocular visual odometry.
Findings
Significant accuracy improvement over other unsupervised pose networks.
Reduces complexity compared to hybrid or traditional methods.
Achieves state-of-the-art results on KITTI dataset.
Abstract
Unsupervised learning for monocular camera motion and 3D scene understanding has gained popularity over traditional methods, relying on epipolar geometry or non-linear optimization. Notably, deep learning can overcome many issues of monocular vision, such as perceptual aliasing, low-textured areas, scale-drift, and degenerate motions. Also, concerning supervised learning, we can fully leverage video streams data without the need for depth or motion labels. However, in this work, we note that rotational motion can limit the accuracy of the unsupervised pose networks more than the translational component. Therefore, we present RAUM-VO, an approach based on a model-free epipolar constraint for frame-to-frame motion estimation (F2F) to adjust the rotation during training and online inference. To this end, we match 2D keypoints between consecutive frames using pre-trained deep networks,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
