Explore the LiDAR-Camera Dynamic Adjustment Fusion for 3D Object Detection
Yiran Yang, Xu Gao, Tong Wang, Xin Hao, Yifeng Shi, Xiao Tan, Xiaoqing, Ye, Jingdong Wang

TL;DR
This paper introduces a dynamic adjustment fusion method for LiDAR and camera data to improve 3D object detection in autonomous driving, addressing modality gaps through domain alignment and adaptive learning.
Contribution
It proposes a triphase domain aligning module and adaptive learning techniques to enhance multimodal fusion for 3D detection, which is a novel approach in this context.
Findings
Achieves competitive results on nuScenes dataset
Effectively reduces modality distribution gaps
Improves 3D detection accuracy
Abstract
Camera and LiDAR serve as informative sensors for accurate and robust autonomous driving systems. However, these sensors often exhibit heterogeneous natures, resulting in distributional modality gaps that present significant challenges for fusion. To address this, a robust fusion technique is crucial, particularly for enhancing 3D object detection. In this paper, we introduce a dynamic adjustment technology aimed at aligning modal distributions and learning effective modality representations to enhance the fusion process. Specifically, we propose a triphase domain aligning module. This module adjusts the feature distributions from both the camera and LiDAR, bringing them closer to the ground truth domain and minimizing differences. Additionally, we explore improved representation acquisition methods for dynamic fusion, which includes modal interaction and specialty enhancement. Finally,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInfrared Target Detection Methodologies · Industrial Vision Systems and Defect Detection · Robotics and Sensor-Based Localization
