Unsupervised Learning of Depth and Ego-Motion from Video

Tinghui Zhou; Matthew Brown; Noah Snavely; David G. Lowe

arXiv:1704.07813·cs.CV·August 2, 2017·222 cites

Unsupervised Learning of Depth and Ego-Motion from Video

Tinghui Zhou, Matthew Brown, Noah Snavely, David G. Lowe

PDF

Open Access 2 Repos 1 Video

TL;DR

This paper introduces an unsupervised learning approach for estimating depth and camera motion from monocular video, achieving results comparable to supervised methods without requiring ground-truth labels.

Contribution

The authors propose a novel unsupervised framework that jointly learns depth and ego-motion estimation using view synthesis as supervision, applicable independently at test time.

Findings

01

Depth estimation matches supervised methods on KITTI.

02

Pose estimation is competitive with SLAM systems.

03

Framework works on unstructured video sequences.

Abstract

We present an unsupervised learning framework for the task of monocular depth and camera motion estimation from unstructured video sequences. We achieve this by simultaneously training depth and camera pose estimation networks using the task of view synthesis as the supervisory signal. The networks are thus coupled via the view synthesis objective during training, but can be applied independently at test time. Empirical evaluation on the KITTI dataset demonstrates the effectiveness of our approach: 1) monocular depth performing comparably with supervised methods that use either ground-truth pose or depth for training, and 2) pose estimation performing favorably with established SLAM systems under comparable input settings.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

Unsupervised Learning of Depth and Ego-Motion From Video· youtube

Taxonomy

TopicsAdvanced Vision and Imaging · Robotics and Sensor-Based Localization · Optical measurement and interference techniques