WGANVO: Monocular Visual Odometry based on Generative Adversarial   Networks

Javier Cremona; Lucas Uzal; Taih\'u Pire

arXiv:2007.13704·cs.CV·July 28, 2020·5 cites

WGANVO: Monocular Visual Odometry based on Generative Adversarial Networks

Javier Cremona, Lucas Uzal, Taih\'u Pire

PDF

Open Access 1 Repo

TL;DR

WGANVO introduces a deep learning monocular visual odometry method that estimates absolute scale without prior knowledge, trained semi-supervised, and performs in real-time with promising accuracy on KITTI dataset.

Contribution

It presents WGANVO, a novel neural network approach for monocular visual odometry that recovers absolute scale without additional information.

Findings

01

Operates in real-time on KITTI dataset.

02

Achieves encouraging accuracy in pose estimation.

03

Does not require prior scene scale knowledge.

Abstract

In this work we present WGANVO, a Deep Learning based monocular Visual Odometry method. In particular, a neural network is trained to regress a pose estimate from an image pair. The training is performed using a semi-supervised approach. Unlike geometry based monocular methods, the proposed method can recover the absolute scale of the scene without neither prior knowledge nor extra information. The evaluation of the system is carried out on the well-known KITTI dataset where it is shown to work in real time and the accuracy obtained is encouraging to continue the development of Deep Learning based methods.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

CIFASIS/wganvo
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotics and Sensor-Based Localization · Advanced Vision and Imaging · Image and Object Detection Techniques