Deep Direct Visual Odometry
Chaoqiang Zhao, Yang Tang, Qiyu Sun, Athanasios V. Vasilakos

TL;DR
This paper introduces TrajNet, a deep neural network with geometric constraints that enhances scale-consistency in monocular visual odometry, and integrates it into a new DVO architecture called DDSO for improved robustness and accuracy.
Contribution
It proposes a novel scale-to-trajectory constraint for unsupervised training and embeds TrajNet into a new DVO architecture, DDSO, to improve monocular odometry performance.
Findings
TrajNet significantly improves scale-consistency over previous methods.
DDSO achieves more robust and accurate initialization and tracking.
Experimental results on KITTI validate the effectiveness of the proposed approach.
Abstract
Traditional monocular direct visual odometry (DVO) is one of the most famous methods to estimate the ego-motion of robots and map environments from images simultaneously. However, DVO heavily relies on high-quality images and accurate initial pose estimation during tracking. With the outstanding performance of deep learning, previous works have shown that deep neural networks can effectively learn 6-DoF (Degree of Freedom) poses between frames from monocular image sequences in the unsupervised manner. However, these unsupervised deep learning-based frameworks cannot accurately generate the full trajectory of a long monocular video because of the scale-inconsistency between each pose. To address this problem, we use several geometric constraints to improve the scale-consistency of the pose network, including improving the previous loss function and proposing a novel scale-to-trajectory…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
