TL;DR
MinkLoc3D introduces a novel, efficient 3D point cloud descriptor using sparse voxelization and 3D convolutions, significantly improving large-scale place recognition accuracy over existing methods.
Contribution
The paper proposes MinkLoc3D, a new approach leveraging sparse voxelization and 3D convolutions for discriminative point cloud descriptors, outperforming prior methods.
Findings
MinkLoc3D outperforms state-of-the-art descriptors on standard benchmarks.
The method is simple, efficient, and suitable for large-scale place recognition.
Code is publicly available for reproducibility.
Abstract
The paper presents a learning-based method for computing a discriminative 3D point cloud descriptor for place recognition purposes. Existing methods, such as PointNetVLAD, are based on unordered point cloud representation. They use PointNet as the first processing step to extract local features, which are later aggregated into a global descriptor. The PointNet architecture is not well suited to capture local geometric structures. Thus, state-of-the-art methods enhance vanilla PointNet architecture by adding different mechanism to capture local contextual information, such as graph convolutional networks or using hand-crafted features. We present an alternative approach, dubbed MinkLoc3D, to compute a discriminative 3D point cloud descriptor, based on a sparse voxelized point cloud representation and sparse 3D convolutions. The proposed method has a simple and efficient architecture.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsGraph Convolutional Networks · eToro Customer Care Number +1-833-534-1729
