Robust Multimodal 3D Object Detection via Modality-Agnostic Decoding and Proximity-based Modality Ensemble
Juhan Cha, Minseok Joo, Jihwan Park, Sanghyeok Lee, Injae, Kim, Hyunwoo J. Kim

TL;DR
MEFormer introduces a modality-agnostic decoding framework and proximity-based ensemble to enhance 3D object detection robustness and performance across various sensor conditions.
Contribution
It presents MOAD and PME modules that improve multi-modal fusion, reduce LiDAR over-reliance, and enhance robustness against sensor noise and environmental changes.
Findings
Achieves 73.9% NDS and 71.5% mAP on nuScenes.
Improves robustness in sensor malfunction scenarios.
Outperforms existing methods in multi-modal 3D detection.
Abstract
Recent advancements in 3D object detection have benefited from multi-modal information from the multi-view cameras and LiDAR sensors. However, the inherent disparities between the modalities pose substantial challenges. We observe that existing multi-modal 3D object detection methods heavily rely on the LiDAR sensor, treating the camera as an auxiliary modality for augmenting semantic details. This often leads to not only underutilization of camera data but also significant performance degradation in scenarios where LiDAR data is unavailable. Additionally, existing fusion methods overlook the detrimental impact of sensor noise induced by environmental changes, on detection performance. In this paper, we propose MEFormer to address the LiDAR over-reliance problem by harnessing critical information for 3D object detection from every available modality while concurrently safeguarding…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Image and Object Detection Techniques · Industrial Vision Systems and Defect Detection
