WeatherOcc3D: VLM-Assisted Adverse Weather Aware 3D Semantic Occupancy Prediction
A. Enes Doruk, Abdelaziz Hussein, Hasan F. Ates

TL;DR
WeatherOcc3D introduces a VLM-assisted framework that adaptively fuses camera and LiDAR data for 3D semantic occupancy prediction under adverse weather conditions, guided by environmental cues.
Contribution
It leverages pre-trained CLIP embeddings and a gating strategy to dynamically re-weight sensor inputs based on weather-related environmental factors.
Findings
Achieves 26.3 mIoU on nuScenes with OccMamba architecture.
Outperforms traditional baselines significantly.
Effectively handles adverse weather variability in sensor fusion.
Abstract
While multi-modal 3D semantic occupancy prediction typically enhances robustness by fusing camera and LiDAR inputs, its effectiveness is fundamentally constrained by environmental variability. Specifically, camera sensors suffer from severe low-light degradation, while LiDAR sensors encounter significant backscatter noise during heavy precipitation. These adverse conditions create a modality trust problem, as static fusion strategies fail to adaptively re-weight inputs when a specific sensor becomes unreliable. To address this, we propose a VLM-assisted framework leveraging the pre-trained CLIP latent space to guide multi-sensor integration via linguistic environmental cues. We utilize a parameter-efficient adapter to align weather-specific text embeddings with sensor features, coupled with a gating strategy that decomposes environmental uncertainty into two factors: visibility and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
