Deep Patch Visual Odometry
Zachary Teed, Lahav Lipson, Jia Deng

TL;DR
Deep Patch Visual Odometry (DPVO) introduces a recurrent patch-based matching system that achieves superior accuracy and efficiency in monocular VO, outperforming dense flow-based methods with less computational cost.
Contribution
DPVO demonstrates that sparse patch-based matching can surpass dense flow approaches in accuracy and efficiency for visual odometry, challenging previous assumptions.
Findings
DPVO outperforms prior VO systems on benchmarks.
DPVO uses a third of the memory of dense flow methods.
DPVO runs three times faster than previous state-of-the-art.
Abstract
We propose Deep Patch Visual Odometry (DPVO), a new deep learning system for monocular Visual Odometry (VO). DPVO uses a novel recurrent network architecture designed for tracking image patches across time. Recent approaches to VO have significantly improved the state-of-the-art accuracy by using deep networks to predict dense flow between video frames. However, using dense flow incurs a large computational cost, making these previous methods impractical for many use cases. Despite this, it has been assumed that dense flow is important as it provides additional redundancy against incorrect matches. DPVO disproves this assumption, showing that it is possible to get the best accuracy and efficiency by exploiting the advantages of sparse patch-based matching over dense flow. DPVO introduces a novel recurrent update operator for patch based correspondence coupled with differentiable bundle…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsRobotics and Sensor-Based Localization · Advanced Vision and Imaging · Advanced Image and Video Retrieval Techniques
