Learning Optical Flow, Depth, and Scene Flow without Real-World Labels
Vitor Guizilini, Kuan-Hui Lee, Rares Ambrus, Adrien Gaidon

TL;DR
This paper introduces DRAFT, a self-supervised method that jointly learns depth, optical flow, and scene flow from monocular videos, effectively handling dynamic scenes and achieving state-of-the-art results on KITTI.
Contribution
DRAFT combines synthetic data with geometric self-supervision to jointly learn depth, optical flow, and scene flow, addressing the ill-posed nature of the problem.
Findings
Achieves state-of-the-art performance on KITTI benchmark.
Effectively models dynamic scenes with explicit scene flow estimation.
Leverages optical flow as an intermediate task to improve depth and scene flow learning.
Abstract
Self-supervised monocular depth estimation enables robots to learn 3D perception from raw video streams. This scalable approach leverages projective geometry and ego-motion to learn via view synthesis, assuming the world is mostly static. Dynamic scenes, which are common in autonomous driving and human-robot interaction, violate this assumption. Therefore, they require modeling dynamic objects explicitly, for instance via estimating pixel-wise 3D motion, i.e. scene flow. However, the simultaneous self-supervised learning of depth and scene flow is ill-posed, as there are infinitely many combinations that result in the same 3D point. In this paper we propose DRAFT, a new method capable of jointly learning depth, optical flow, and scene flow by combining synthetic data with geometric self-supervision. Building upon the RAFT architecture, we learn optical flow as an intermediate task to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Image Processing Techniques and Applications · Optical measurement and interference techniques
