TL;DR
This paper introduces a deep learning method for estimating 3D scene flow in autonomous driving scenes, leveraging object-level reasoning and weak supervision to improve accuracy and generalization.
Contribution
The method enables rigid 3D scene flow estimation with minimal supervision by integrating object-level reasoning and test-time optimization.
Findings
Effective on four autonomous driving datasets
Requires only binary segmentation and ego-motion annotations
Generalizes well to different datasets
Abstract
We propose a data-driven scene flow estimation algorithm exploiting the observation that many 3D scenes can be explained by a collection of agents moving as rigid bodies. At the core of our method lies a deep architecture able to reason at the \textbf{object-level} by considering 3D scene flow in conjunction with other 3D tasks. This object level abstraction, enables us to relax the requirement for dense scene flow supervision with simpler binary background segmentation mask and ego-motion annotations. Our mild supervision requirements make our method well suited for recently released massive data collections for autonomous driving, which do not contain dense scene flow annotations. As output, our model provides low-level cues like pointwise flow and higher-level cues such as holistic scene understanding at the level of rigid objects. We further propose a test-time optimization refining…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
