PAI3D: Painting Adaptive Instance-Prior for 3D Object Detection
Hao Liu, Zhuoran Xu, Dan Wang, Baofeng Zhang, Guan Wang, Bo Dong, Xin, Wen, and Xinyu Xu

TL;DR
PAI3D introduces a novel multi-modal fusion framework that leverages instance-level image semantics to significantly enhance 3D object detection accuracy in autonomous driving scenarios.
Contribution
The paper proposes PAI3D, a flexible method that fuses instance-level image semantics with point cloud data to improve 3D detection performance.
Findings
Achieves 71.4 mAP and 74.2 NDS on nuScenes dataset.
Instance-level semantics contribute most to performance gains.
Compatible with various segmentation models and 3D encoders.
Abstract
3D object detection is a critical task in autonomous driving. Recently multi-modal fusion-based 3D object detection methods, which combine the complementary advantages of LiDAR and camera, have shown great performance improvements over mono-modal methods. However, so far, no methods have attempted to utilize the instance-level contextual image semantics to guide the 3D object detection. In this paper, we propose a simple and effective Painting Adaptive Instance-prior for 3D object detection (PAI3D) to fuse instance-level image semantics flexibly with point cloud features. PAI3D is a multi-modal sequential instance-level fusion framework. It first extracts instance-level semantic information from images, the extracted information, including objects categorical label, point-to-object membership and object position, are then used to augment each LiDAR point in the subsequent 3D detection…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Autonomous Vehicle Technology and Safety · Robotics and Sensor-Based Localization
MethodsTest
