What Matters for 3D Scene Flow Network
Guangming Wang, Yunzhe Hu, Zhe Liu, Yiyang Zhou, Masayoshi Tomizuka,, Wei Zhan, and Hesheng Wang

TL;DR
This paper introduces a novel all-to-all flow embedding layer with backward validation for 3D scene flow estimation from point clouds, significantly improving accuracy and achieving state-of-the-art results on benchmark datasets.
Contribution
The paper proposes a new all-to-all flow embedding layer with backward reliability validation and systematically compares key design choices, leading to a superior 3D scene flow network.
Findings
Achieves state-of-the-art performance on FlyingThings3D and KITTI datasets.
Surpasses existing methods by at least 38.2% and 24.7% in EPE3D metric.
Demonstrates the effectiveness of the proposed all-to-all flow embedding and validation approach.
Abstract
3D scene flow estimation from point clouds is a low-level 3D motion perception task in computer vision. Flow embedding is a commonly used technique in scene flow estimation, and it encodes the point motion between two consecutive frames. Thus, it is critical for the flow embeddings to capture the correct overall direction of the motion. However, previous works only search locally to determine a soft correspondence, ignoring the distant points that turn out to be the actual matching ones. In addition, the estimated correspondence is usually from the forward direction of the adjacent point clouds, and may not be consistent with the estimated correspondence acquired from the backward direction. To tackle these problems, we propose a novel all-to-all flow embedding layer with backward reliability validation during the initial scene flow estimation. Besides, we investigate and compare…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Human Pose and Action Recognition · Advanced Image Processing Techniques
