TL;DR
This paper introduces a new deep learning-based local 3D descriptor, DIPs, that is rotation-invariant, robust to occlusions, and effective for point cloud registration across various sensors and environments.
Contribution
The paper proposes DIPs, a novel end-to-end learned 3D local descriptor that is rotation-invariant and generalizes well across different sensor modalities and scene types.
Findings
DIPs achieve comparable performance to state-of-the-art on indoor RGB-D datasets.
DIPs outperform existing descriptors on outdoor laser-scanner datasets.
DIPs successfully generalize to scenes reconstructed with ARCore.
Abstract
We present a simple but yet effective method for learning distinctive 3D local deep descriptors (DIPs) that can be used to register point clouds without requiring an initial alignment. Point cloud patches are extracted, canonicalised with respect to their estimated local reference frame and encoded into rotation-invariant compact descriptors by a PointNet-based deep neural network. DIPs can effectively generalise across different sensor modalities because they are learnt end-to-end from locally and randomly sampled points. Because DIPs encode only local geometric information, they are robust to clutter, occlusions and missing regions. We evaluate and compare DIPs against alternative hand-crafted and deep descriptors on several indoor and outdoor datasets consisting of point clouds reconstructed using different sensors. Results show that DIPs (i) achieve comparable results to the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
