Diffusion-Based Restoration for Multi-Modal 3D Object Detection in Adverse Weather

Zhijian He; Feifei Liu; Yuwei Li; Zhanpeng Luo; Jintao Cheng; Xieyuanli Chen; Xiaoyu Tang

arXiv:2512.13107·cs.CV·December 19, 2025

Diffusion-Based Restoration for Multi-Modal 3D Object Detection in Adverse Weather

Zhijian He, Feifei Liu, Yuwei Li, Zhanpeng Luo, Jintao Cheng, Xieyuanli Chen, Xiaoyu Tang

PDF

Open Access

TL;DR

DiffFusion is a diffusion-based framework that enhances multi-modal 3D object detection robustness in adverse weather by restoring degraded data and aligning modalities, outperforming existing methods in challenging conditions.

Contribution

The paper introduces DiffFusion, a novel diffusion-based approach with adaptive fusion and alignment modules for robust multi-modal 3D detection in adverse weather.

Findings

01

Achieves state-of-the-art robustness under adverse weather conditions.

02

Maintains high performance on clean data.

03

Demonstrates strong generalization in zero-shot real-world tests.

Abstract

Multi-modal 3D object detection is important for reliable perception in robotics and autonomous driving. However, its effectiveness remains limited under adverse weather conditions due to weather-induced distortions and misalignment between different data modalities. In this work, we propose DiffFusion, a novel framework designed to enhance robustness in challenging weather through diffusion-based restoration and adaptive cross-modal fusion. Our key insight is that diffusion models possess strong capabilities for denoising and generating data that can adapt to various weather conditions. Building on this, DiffFusion introduces Diffusion-IR restoring images degraded by weather effects and Point Cloud Restoration (PCR) compensating for corrupted LiDAR data using image object cues. To tackle misalignments between two modalities, we develop Bidirectional Adaptive Fusion and Alignment Module…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Image Enhancement Techniques