FuseFPS: Accelerating Farthest Point Sampling with Fusing KD-tree Construction for Point Clouds
Meng Han, Liang Wang, Limin Xiao, Hao Zhang, Chenhao Zhang, Xilong, Xie, Shuai Zheng, Jin Dong

TL;DR
FuseFPS introduces a hardware-efficient approach that fuses KD-tree construction with point sampling, significantly accelerating and reducing power consumption in real-time point cloud processing.
Contribution
It proposes a novel algorithm and hardware architecture that fuse KD-tree construction with sampling, optimizing performance and energy efficiency for bucket-based FPS.
Findings
Achieves 4.3× speedup over QuickFPS
Reduces power consumption by 6.1×
Effectively accelerates real-time point cloud processing
Abstract
Point cloud analytics has become a critical workload for embedded and mobile platforms across various applications. Farthest point sampling (FPS) is a fundamental and widely used kernel in point cloud processing. However, the heavy external memory access makes FPS a performance bottleneck for real-time point cloud processing. Although bucket-based farthest point sampling can significantly reduce unnecessary memory accesses during the point sampling stage, the KD-tree construction stage becomes the predominant contributor to execution time. In this paper, we present FuseFPS, an architecture and algorithm co-design for bucket-based farthest point sampling. We first propose a hardware-friendly sampling-driven KD-tree construction algorithm. The algorithm fuses the KD-tree construction stage into the point sampling stage, further reducing memory accesses. Then, we design an efficient…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topics3D Shape Modeling and Analysis · Optical measurement and interference techniques · 3D Surveying and Cultural Heritage
