Region Deformer Networks for Unsupervised Depth Estimation from   Unconstrained Monocular Videos

Haofei Xu; Jianmin Zheng; Jianfei Cai; Juyong Zhang

arXiv:1902.09907·cs.CV·May 24, 2019·1 cites

Region Deformer Networks for Unsupervised Depth Estimation from Unconstrained Monocular Videos

Haofei Xu, Jianmin Zheng, Jianfei Cai, Juyong Zhang

PDF

Open Access 1 Repo

TL;DR

This paper introduces Region Deformer Networks (RDN) for unsupervised depth estimation from unconstrained monocular videos, effectively handling dynamic scenes with rigid and non-rigid motions, and achieving state-of-the-art results.

Contribution

The novel RDN architecture models individual object motions using a deformation-based representation, enabling depth estimation in complex, real-world scenarios without ground truth supervision.

Findings

01

Achieves state-of-the-art results on KITTI and Cityscapes benchmarks.

02

Demonstrates effectiveness on crowded pedestrian tracking datasets.

03

Handles diverse object motions in unconstrained videos.

Abstract

While learning based depth estimation from images/videos has achieved substantial progress, there still exist intrinsic limitations. Supervised methods are limited by a small amount of ground truth or labeled data and unsupervised methods for monocular videos are mostly based on the static scene assumption, not performing well on real world scenarios with the presence of dynamic objects. In this paper, we propose a new learning based method consisting of DepthNet, PoseNet and Region Deformer Networks (RDN) to estimate depth from unconstrained monocular videos without ground truth supervision. The core contribution lies in RDN for proper handling of rigid and non-rigid motions of various objects such as rigidly moving cars and deformable humans. In particular, a deformation based motion representation is proposed to model individual object motion on 2D images. This representation enables…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

haofeixu/rdn4depth
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Video Surveillance and Tracking Methods · Advanced Image Processing Techniques