TL;DR
PointAcc is a specialized hardware accelerator that significantly improves the speed and energy efficiency of deep learning on sparse 3D point clouds, enabling real-time applications on edge devices.
Contribution
The paper introduces PointAcc, a novel accelerator design that efficiently handles sparse point cloud operations, outperforming existing hardware in speed and energy consumption.
Findings
Achieves 3.7X speedup over RTX 2080Ti GPU
Reduces energy consumption by 22X compared to GPU
Rivals prior accelerators with 100X speedup and higher accuracy
Abstract
Deep learning on point clouds plays a vital role in a wide range of applications such as autonomous driving and AR/VR. These applications interact with people in real-time on edge devices and thus require low latency and low energy. Compared to projecting the point cloud to 2D space, directly processing the 3D point cloud yields higher accuracy and lower #MACs. However, the extremely sparse nature of point cloud poses challenges to hardware acceleration. For example, we need to explicitly determine the nonzero outputs and search for the nonzero neighbors (mapping operation), which is unsupported in existing accelerators. Furthermore, explicit gather and scatter of sparse features are required, resulting in large data movement overhead. In this paper, we comprehensively analyze the performance bottleneck of modern point cloud networks on CPU/GPU/TPU. To address the challenges, we then…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Micro'21 PointAcc: Efficient Point Cloud Accelerator· youtube
Lightning Talk Micro' 21 PointAcc: Efficient Point Cloud Accelerator· youtube
