GANVO: Unsupervised Deep Monocular Visual Odometry and Depth Estimation with Generative Adversarial Networks
Yasin Almalioglu, Muhamad Risqi U. Saputra, Pedro P. B. de Gusmao,, Andrew Markham, Niki Trigoni

TL;DR
This paper introduces an unsupervised deep learning framework using GANs for monocular visual odometry and depth estimation from unlabelled image sequences, outperforming existing methods on standard datasets.
Contribution
It presents a novel generative adversarial network-based approach for unsupervised monocular VO and depth estimation, eliminating the need for labeled data.
Findings
Outperforms traditional and existing unsupervised VO methods
Achieves better pose estimation accuracy
Provides superior depth recovery results
Abstract
In the last decade, supervised deep learning approaches have been extensively employed in visual odometry (VO) applications, which is not feasible in environments where labelled data is not abundant. On the other hand, unsupervised deep learning approaches for localization and mapping in unknown environments from unlabelled data have received comparatively less attention in VO research. In this study, we propose a generative unsupervised learning framework that predicts 6-DoF pose camera motion and monocular depth map of the scene from unlabelled RGB image sequences, using deep convolutional Generative Adversarial Networks (GANs). We create a supervisory signal by warping view sequences and assigning the re-projection minimization to the objective loss function that is adopted in multi-view pose estimation and single-view depth generation network. Detailed quantitative and qualitative…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
