ShellNet: Efficient Point Cloud Convolutional Neural Networks using Concentric Shells Statistics
Zhiyuan Zhang, Binh-Son Hua, Sai-Kit Yeung

TL;DR
ShellNet introduces ShellConv, a permutation-invariant convolution using concentric shells, enabling efficient, fast training of deep neural networks directly on point cloud data for various 3D scene understanding tasks.
Contribution
The paper proposes ShellConv, a novel convolution operator based on concentric shells, and ShellNet, an efficient neural network architecture for point cloud analysis.
Findings
Achieves state-of-the-art results in object classification and segmentation.
Maintains fast training speed and simple architecture.
Effectively handles large receptive fields on point clouds.
Abstract
Deep learning with 3D data has progressed significantly since the introduction of convolutional neural networks that can handle point order ambiguity in point cloud data. While being able to achieve good accuracies in various scene understanding tasks, previous methods often have low training speed and complex network architecture. In this paper, we address these problems by proposing an efficient end-to-end permutation invariant convolution for point cloud deep learning. Our simple yet effective convolution operator named ShellConv uses statistics from concentric spherical shells to define representative features and resolve the point order ambiguity, allowing traditional convolution to perform on such features. Based on ShellConv we further build an efficient neural network named ShellNet to directly consume the point clouds with larger receptive fields while maintaining less layers.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topics3D Shape Modeling and Analysis · 3D Surveying and Cultural Heritage · Remote Sensing and LiDAR Applications
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Convolution
