What Matters for 3D Scene Flow Network

Guangming Wang; Yunzhe Hu; Zhe Liu; Yiyang Zhou; Masayoshi Tomizuka,; Wei Zhan; and Hesheng Wang

arXiv:2207.09143·cs.CV·July 20, 2022

What Matters for 3D Scene Flow Network

Guangming Wang, Yunzhe Hu, Zhe Liu, Yiyang Zhou, Masayoshi Tomizuka,, Wei Zhan, and Hesheng Wang

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel all-to-all flow embedding layer with backward validation for 3D scene flow estimation from point clouds, significantly improving accuracy and achieving state-of-the-art results on benchmark datasets.

Contribution

The paper proposes a new all-to-all flow embedding layer with backward reliability validation and systematically compares key design choices, leading to a superior 3D scene flow network.

Findings

01

Achieves state-of-the-art performance on FlyingThings3D and KITTI datasets.

02

Surpasses existing methods by at least 38.2% and 24.7% in EPE3D metric.

03

Demonstrates the effectiveness of the proposed all-to-all flow embedding and validation approach.

Abstract

3D scene flow estimation from point clouds is a low-level 3D motion perception task in computer vision. Flow embedding is a commonly used technique in scene flow estimation, and it encodes the point motion between two consecutive frames. Thus, it is critical for the flow embeddings to capture the correct overall direction of the motion. However, previous works only search locally to determine a soft correspondence, ignoring the distant points that turn out to be the actual matching ones. In addition, the estimated correspondence is usually from the forward direction of the adjacent point clouds, and may not be consistent with the estimated correspondence acquired from the backward direction. To tackle these problems, we propose a novel all-to-all flow embedding layer with backward reliability validation during the initial scene flow estimation. Besides, we investigate and compare…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

irmvlab/3dflow
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Human Pose and Action Recognition · Advanced Image Processing Techniques