Learning Residual Flow as Dynamic Motion from Stereo Videos

Seokju Lee; Sunghoon Im; Stephen Lin; In So Kweon

arXiv:1909.06999·cs.CV·September 17, 2019

Learning Residual Flow as Dynamic Motion from Stereo Videos

Seokju Lee, Sunghoon Im, Stephen Lin, In So Kweon

PDF

TL;DR

This paper introduces an unsupervised learning framework that decomposes scene flow from stereo videos into static and dynamic components, improving motion understanding and scene reconstruction.

Contribution

It proposes a joint reasoning approach with three networks for stereo matching, camera motion, and residual flow, leveraging geometry for accurate 3D scene flow estimation.

Findings

01

Outperforms state-of-the-art on KITTI for optical flow

02

Effective separation of static and dynamic scene components

03

Accurate 3D motion estimation of dynamic objects

Abstract

We present a method for decomposing the 3D scene flow observed from a moving stereo rig into stationary scene elements and dynamic object motion. Our unsupervised learning framework jointly reasons about the camera motion, optical flow, and 3D motion of moving objects. Three cooperating networks predict stereo matching, camera motion, and residual flow, which represents the flow component due to object motion and not from camera motion. Based on rigid projective geometry, the estimated stereo depth is used to guide the camera motion estimation, and the depth and camera motion are used to guide the residual flow estimation. We also explicitly estimate the 3D scene flow of dynamic objects based on the residual flow and scene depth. Experiments on the KITTI dataset demonstrate the effectiveness of our approach and show that our method outperforms other state-of-the-art algorithms on the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.