LDRFusion: A LiDAR-Dominant multimodal refinement framework for 3D object detection
Jijun Wang, Yan Wu, Yujian Mo, Junqiao Zhao, Jun Yan, Yinghao Hu

TL;DR
LDRFusion introduces a LiDAR-dominant two-stage refinement framework for 3D object detection that effectively combines LiDAR and pseudo point clouds, improving accuracy by addressing noise and leveraging modality reliability.
Contribution
The paper proposes a novel LiDAR-dominant multi-sensor fusion framework with hierarchical pseudo point residual encoding, enhancing 3D detection accuracy over existing methods.
Findings
Consistently outperforms previous methods on KITTI dataset
Effective in detecting challenging instances with pseudo point clouds
Improves local structure representation in pseudo points
Abstract
Existing LiDAR-Camera fusion methods have achieved strong results in 3D object detection. To address the sparsity of point clouds, previous approaches typically construct spatial pseudo point clouds via depth completion as auxiliary input and adopts a proposal-refinement framework to generate detection results. However, introducing pseudo points inevitably brings noise, potentially resulting in inaccurate predictions. Considering the differing roles and reliability levels of each modality, we propose LDRFusion, a novel Lidar-dominant two-stage refinement framework for multi-sensor fusion. The first stage soley relies on LiDAR to produce accurately localized proposals, followed by a second stage where pseudo point clouds are incorporated to detect challenging instances. The instance-level results from both stages are subsequently merged. To further enhance the representation of local…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Image Processing and 3D Reconstruction · 3D Surveying and Cultural Heritage
