Point-Voxel CNN for Efficient 3D Deep Learning

Zhijian Liu; Haotian Tang; Yujun Lin; Song Han

arXiv:1907.03739·cs.CV·December 11, 2019·337 cites

Point-Voxel CNN for Efficient 3D Deep Learning

Zhijian Liu, Haotian Tang, Yujun Lin, Song Han

PDF

Open Access 4 Repos 1 Video

TL;DR

PVCNN combines point-based and voxel-based methods to create a memory- and computation-efficient 3D deep learning model that outperforms existing approaches in accuracy and speed.

Contribution

The paper introduces PVCNN, a novel hybrid architecture that reduces memory and computation costs while improving accuracy in 3D deep learning tasks.

Findings

01

Achieves 10x GPU memory reduction over voxel-based models.

02

Outperforms state-of-the-art point-based models with 7x speedup.

03

Outperforms PointNet with 2x speedup and higher accuracy.

Abstract

We present Point-Voxel CNN (PVCNN) for efficient, fast 3D deep learning. Previous work processes 3D data using either voxel-based or point-based NN models. However, both approaches are computationally inefficient. The computation cost and memory footprints of the voxel-based models grow cubically with the input resolution, making it memory-prohibitive to scale up the resolution. As for point-based networks, up to 80% of the time is wasted on structuring the sparse data which have rather poor memory locality, not on the actual feature extraction. In this paper, we propose PVCNN that represents the 3D input data in points to reduce the memory consumption, while performing the convolutions in voxels to reduce the irregular, sparse data access and improve the locality. Our PVCNN model is both memory and computation efficient. Evaluated on semantic and part segmentation datasets, it achieves…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

[NeurIPS 2019 Spotlight] Point-Voxel CNN for Efficient 3D Deep Learning· youtube

Taxonomy

Topics3D Shape Modeling and Analysis · Advanced Neural Network Applications · Robotics and Sensor-Based Localization

MethodseToro Customer Care Number +1-833-534-1729