Uncertainty-Encoded Multi-Modal Fusion for Robust Object Detection in Autonomous Driving
Yang Lou, Qun Song, Qian Xu, Rui Tan, Jianping Wang

TL;DR
This paper introduces UMoE, a novel multi-modal fusion method that explicitly incorporates sensor uncertainties, significantly improving object detection robustness in autonomous driving under adverse conditions.
Contribution
The paper proposes UMoE, a new uncertainty-aware fusion framework that effectively combines LiDAR and camera data by encoding uncertainties, enhancing robustness against challenging scenarios.
Findings
UMoE outperforms state-of-the-art detectors under extreme weather.
UMoE improves detection accuracy during adversarial attacks.
UMoE enhances robustness against sensor blinding.
Abstract
Multi-modal fusion has shown initial promising results for object detection of autonomous driving perception. However, many existing fusion schemes do not consider the quality of each fusion input and may suffer from adverse conditions on one or more sensors. While predictive uncertainty has been applied to characterize single-modal object detection performance at run time, incorporating uncertainties into the multi-modal fusion still lacks effective solutions due primarily to the uncertainty's cross-modal incomparability and distinct sensitivities to various adverse conditions. To fill this gap, this paper proposes Uncertainty-Encoded Mixture-of-Experts (UMoE) that explicitly incorporates single-modal uncertainties into LiDAR-camera fusion. UMoE uses individual expert network to process each sensor's detection result together with encoded uncertainty. Then, the expert networks' outputs…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Adversarial Robustness in Machine Learning · Infrared Target Detection Methodologies
