DeepAVO: Efficient Pose Refining with Feature Distilling for Deep Visual   Odometry

Ran Zhu; Mingkun Yang; Wang Liu; Rujun Song; Bo Yan; Zhuoling Xiao

arXiv:2105.09899·cs.CV·May 21, 2021·1 cites

DeepAVO: Efficient Pose Refining with Feature Distilling for Deep Visual Odometry

Ran Zhu, Mingkun Yang, Wang Liu, Rujun Song, Bo Yan, Zhuoling Xiao

PDF

Open Access

TL;DR

DeepAVO introduces a novel deep learning framework with feature distilling and attention mechanisms to improve monocular visual odometry accuracy, outperforming existing methods in diverse scenarios.

Contribution

The paper proposes a four-branch CNN architecture with channel-spatial attention for enhanced motion-specific feature extraction in monocular VO.

Findings

01

DeepAVO outperforms state-of-the-art monocular VO methods.

02

Achieves competitive results with stereo VO algorithms.

03

Demonstrates strong generalization across outdoor and indoor datasets.

Abstract

The technology for Visual Odometry (VO) that estimates the position and orientation of the moving object through analyzing the image sequences captured by on-board cameras, has been well investigated with the rising interest in autonomous driving. This paper studies monocular VO from the perspective of Deep Learning (DL). Unlike most current learning-based methods, our approach, called DeepAVO, is established on the intuition that features contribute discriminately to different motion patterns. Specifically, we present a novel four-branch network to learn the rotation and translation by leveraging Convolutional Neural Networks (CNNs) to focus on different quadrants of optical flow input. To enhance the ability of feature selection, we further introduce an effective channel-spatial attention mechanism to force each branch to explicitly distill related information for specific Frame to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Robotics and Sensor-Based Localization · Video Surveillance and Tracking Methods