WeatherOcc3D: VLM-Assisted Adverse Weather Aware 3D Semantic Occupancy Prediction

A. Enes Doruk; Abdelaziz Hussein; Hasan F. Ates

arXiv:2605.16127·cs.CV·May 18, 2026

WeatherOcc3D: VLM-Assisted Adverse Weather Aware 3D Semantic Occupancy Prediction

A. Enes Doruk, Abdelaziz Hussein, Hasan F. Ates

PDF

TL;DR

WeatherOcc3D introduces a VLM-assisted framework that adaptively fuses camera and LiDAR data for 3D semantic occupancy prediction under adverse weather conditions, guided by environmental cues.

Contribution

It leverages pre-trained CLIP embeddings and a gating strategy to dynamically re-weight sensor inputs based on weather-related environmental factors.

Findings

01

Achieves 26.3 mIoU on nuScenes with OccMamba architecture.

02

Outperforms traditional baselines significantly.

03

Effectively handles adverse weather variability in sensor fusion.

Abstract

While multi-modal 3D semantic occupancy prediction typically enhances robustness by fusing camera and LiDAR inputs, its effectiveness is fundamentally constrained by environmental variability. Specifically, camera sensors suffer from severe low-light degradation, while LiDAR sensors encounter significant backscatter noise during heavy precipitation. These adverse conditions create a modality trust problem, as static fusion strategies fail to adaptively re-weight inputs when a specific sensor becomes unreliable. To address this, we propose a VLM-assisted framework leveraging the pre-trained CLIP latent space to guide multi-sensor integration via linguistic environmental cues. We utilize a parameter-efficient adapter to align weather-specific text embeddings with sensor features, coupled with a gating strategy that decomposes environmental uncertainty into two factors: visibility and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.