Deformable PV-RCNN: Improving 3D Object Detection with Learned Deformations
Prarthana Bhattacharyya, Krzysztof Czarnecki

TL;DR
Deformable PV-RCNN introduces a novel proposal refinement module with learned deformations and context gating, significantly enhancing 3D object detection accuracy in point clouds, especially under varying object scales and densities.
Contribution
The paper proposes a deformable refinement module and context gating mechanism, enabling adaptive feature gathering for improved 3D detection performance.
Findings
Achieves state-of-the-art results on KITTI dataset.
Effectively handles scale variation and point-cloud density differences.
Improves detection accuracy through learned deformations.
Abstract
We present Deformable PV-RCNN, a high-performing point-cloud based 3D object detector. Currently, the proposal refinement methods used by the state-of-the-art two-stage detectors cannot adequately accommodate differing object scales, varying point-cloud density, part-deformation and clutter. We present a proposal refinement module inspired by 2D deformable convolution networks that can adaptively gather instance-specific features from locations where informative content exists. We also propose a simple context gating mechanism which allows the keypoints to select relevant context information for the refinement stage. We show state-of-the-art results on the KITTI dataset.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · 3D Surveying and Cultural Heritage · Human Pose and Action Recognition
MethodsConvolution · Deformable Convolution
