TL;DR
Spira is a GPU-optimized sparse convolution engine for 3D point cloud networks that exploits voxel properties to significantly improve efficiency and reduce overhead.
Contribution
It introduces a voxel-property-aware engine with a one-shot kernel map construction, packed-native processing, and flexible dataflow for enhanced performance.
Findings
Spira outperforms prior engines by 1.68x on average.
Achieves up to 3.04x speedup in end-to-end inference.
Provides a flexible, network-wide parallelization strategy.
Abstract
Sparse Convolution (SpC) powers 3D point cloud networks widely used in autonomous driving and augmented/virtual reality. SpC builds a kernel map that stores mappings between input voxel coordinates, output coordinates, and weight offsets, then uses this map to compute feature vectors for output coordinates. Our work identifies three key properties of voxel coordinates: they are integer-valued, bounded within a limited spatial range, and geometrically continuous, i.e., neighboring voxels on the same object surface are highly likely to exist at small spatial offsets from each other. Prior SpC engines do not fully exploit these properties and suffer from high pre-processing and post-processing overheads during kernel map construction. To address this, we design Spira, the first voxel-property-aware SpC engine for GPUs. Spira proposes (i) a high-performance one-shot search algorithm that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
