Self-Supervised Scene Flow Estimation with Point-Voxel Fusion and Surface Representation
Xuezhi Xiang, Xi Wang, Lei Zhang, Denis Ombati, Himaloy Himu and, Xiantong Zhen

TL;DR
This paper introduces a novel self-supervised scene flow estimation method combining point-voxel fusion and surface representation to improve accuracy and efficiency in capturing 3D motion from point clouds.
Contribution
It proposes a new point-voxel fusion approach with surface feature encoding, outperforming existing self-supervised methods and rivaling fully supervised techniques.
Findings
Outperforms all other self-supervised methods on benchmark datasets.
Achieves significant reduction in endpoint error (EPE) by over 8% on KITTI datasets.
Demonstrates effectiveness of surface features in capturing complex 3D structures.
Abstract
Scene flow estimation aims to generate the 3D motion field of points between two consecutive frames of point clouds, which has wide applications in various fields. Existing point-based methods ignore the irregularity of point clouds and have difficulty capturing long-range dependencies due to the inefficiency of point-level computation. Voxel-based methods suffer from the loss of detail information. In this paper, we propose a point-voxel fusion method, where we utilize a voxel branch based on sparse grid attention and the shifted window strategy to capture long-range dependencies and a point branch to capture fine-grained features to compensate for the information loss in the voxel branch. In addition, since xyz coordinates are difficult to describe the geometric structure of complex 3D objects in the scene, we explicitly encode the local surface information of the point cloud through…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Computer Graphics and Visualization Techniques · Advanced Image Processing Techniques
MethodsSoftmax · Attention Is All You Need
