IC-FPS: Instance-Centroid Faster Point Sampling Module for 3D Point-base Object Detection
Hu Haotian, Wang Fanyi, Su Jingwen, Gao Shiyu, Zhang Zhiwang

TL;DR
This paper introduces IC-FPS, a novel point sampling module that significantly enhances the efficiency and speed of 3D object detection models on large-scale point clouds, enabling real-time performance.
Contribution
The paper proposes IC-FPS, a new sampling module combining background filtering and centroid-based sampling, which replaces the computationally expensive FPS in point-based models.
Findings
Speeds up inference by 3.8 times on Waymo dataset
Enables real-time detection in large-scale point clouds
Improves baseline model performance significantly
Abstract
3D object detection is one of the most important tasks in autonomous driving and robotics. Our research focuses on tackling low efficiency issue of point-based methods on large-scale point clouds. Existing point-based methods adopt farthest point sampling (FPS) strategy for downsampling, which is computationally expensive in terms of inference time and memory consumption when the number of point cloud increases. In order to improve efficiency, we propose a novel Instance-Centroid Faster Point Sampling Module (IC-FPS) , which effectively replaces the first Set Abstraction (SA) layer that is extremely tedious. IC-FPS module is comprised of two methods, local feature diffusion based background point filter (LFDBF) and Centroid-Instance Sampling Strategy (CISS). LFDBF is constructed to exclude most invalid background points, while CISS substitutes FPS strategy by fast sampling centroids and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · Advanced Neural Network Applications · Remote Sensing and LiDAR Applications
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Diffusion
