RBGNet: Ray-based Grouping for 3D Object Detection
Haiyang Wang, Shaoshuai Shi, Ze Yang, Rongyao Fang, Qi Qian, Hongsheng, Li, Bernt Schiele, Liwei Wang

TL;DR
RBGNet introduces a ray-based feature grouping and foreground biased sampling to improve 3D object detection accuracy from point clouds, leveraging surface geometry for better shape representation and detection performance.
Contribution
The paper proposes a novel ray-based grouping module and a foreground biased sampling strategy, enhancing 3D detection by utilizing surface geometry and focused point sampling.
Findings
Achieves state-of-the-art results on ScanNet V2 and SUN RGB-D datasets.
Significant performance improvements over previous methods.
Effective surface-aware feature aggregation enhances 3D box prediction.
Abstract
As a fundamental problem in computer vision, 3D object detection is experiencing rapid growth. To extract the point-wise features from the irregularly and sparsely distributed points, previous methods usually take a feature grouping module to aggregate the point features to an object candidate. However, these methods have not yet leveraged the surface geometry of foreground objects to enhance grouping and 3D box generation. In this paper, we propose the RBGNet framework, a voting-based 3D detector for accurate 3D object detection from point clouds. In order to learn better representations of object shape to enhance cluster features for predicting 3D boxes, we propose a ray-based feature grouping module, which aggregates the point-wise features on object surfaces using a group of determined rays uniformly emitted from cluster centers. Considering the fact that foreground points are more…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Video Surveillance and Tracking Methods · Human Pose and Action Recognition
