TL;DR
This paper introduces submanifold sparse convolutional networks (SSCNs) that efficiently process sparse 3D data like point clouds, achieving state-of-the-art results in 3D semantic segmentation tasks.
Contribution
The paper presents novel sparse convolutional operations and networks tailored for sparse 3D data, significantly improving efficiency and accuracy over previous methods.
Findings
SSCNs outperform prior state-of-the-art in 3D semantic segmentation
The new sparse convolutions are more efficient for sparse data
Models achieve strong performance on benchmark datasets
Abstract
Convolutional networks are the de-facto standard for analyzing spatio-temporal data such as images, videos, and 3D shapes. Whilst some of this data is naturally dense (e.g., photos), many other data sources are inherently sparse. Examples include 3D point clouds that were obtained using a LiDAR scanner or RGB-D camera. Standard "dense" implementations of convolutional networks are very inefficient when applied on such sparse data. We introduce new sparse convolutional operations that are designed to process spatially-sparse data more efficiently, and use them to develop spatially-sparse convolutional networks. We demonstrate the strong performance of the resulting models, called submanifold sparse convolutional networks (SSCNs), on two tasks involving semantic segmentation of 3D point clouds. In particular, our models outperform all prior state-of-the-art on the test set of a recent…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSubmanifold Convolution
