Unsupervised Monocular Depth Learning in Dynamic Scenes

Hanhan Li; Ariel Gordon; Hang Zhao; Vincent Casser; Anelia Angelova

arXiv:2010.16404·cs.CV·November 10, 2020·57 cites

Unsupervised Monocular Depth Learning in Dynamic Scenes

Hanhan Li, Ariel Gordon, Hang Zhao, Vincent Casser, Anelia Angelova

PDF

Open Access 5 Repos

TL;DR

This paper introduces a novel unsupervised method for monocular depth estimation in dynamic scenes by jointly learning depth, ego-motion, and object translation fields using photometric consistency and prior regularization, outperforming previous methods.

Contribution

It proposes a new unsupervised approach that leverages prior knowledge about 3D translation sparsity and rigidity to improve depth prediction in dynamic scenes without semantic input.

Findings

01

Achieves higher accuracy than prior methods on dynamic scenes

02

Uses only monocular images and photometric consistency for supervision

03

Regularization based on sparsity and rigidity is effective

Abstract

We present a method for jointly training the estimation of depth, ego-motion, and a dense 3D translation field of objects relative to the scene, with monocular photometric consistency being the sole source of supervision. We show that this apparently heavily underdetermined problem can be regularized by imposing the following prior knowledge about 3D translation fields: they are sparse, since most of the scene is static, and they tend to be constant for rigid moving objects. We show that this regularization alone is sufficient to train monocular depth prediction models that exceed the accuracy achieved in prior work for dynamic scenes, including methods that require semantic input. Code is at https://github.com/google-research/google-research/tree/master/depth_and_motion_learning .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Optical measurement and interference techniques · Human Pose and Action Recognition