3D Smoke Scene Reconstruction Guided by Vision Priors from Multimodal Large Language Models

Xinye Zheng; Fei Wang; Yiqi Nie; Kun Li; Junjie Chen; Jiaqi Zhao; Yanyan Wei; Zhiliang Wu

arXiv:2604.05687·cs.CV·April 23, 2026

3D Smoke Scene Reconstruction Guided by Vision Priors from Multimodal Large Language Models

Xinye Zheng, Fei Wang, Yiqi Nie, Kun Li, Junjie Chen, Jiaqi Zhao, Yanyan Wei, Zhiliang Wu

PDF

TL;DR

This paper introduces a novel framework for 3D smoke scene reconstruction that combines visual priors with efficient 3D modeling, improving robustness and clarity in challenging smoke environments.

Contribution

It integrates Nano-Banana-Pro for enhanced image clarity and develops Smoke-GS, a medium-aware 3D Gaussian Splatting method with a view-dependent branch for better smoke scene reconstruction.

Findings

01

Effective in generating consistent novel views in smoke environments.

02

Preserves rendering efficiency of Gaussian Splatting.

03

Improves robustness to smoke-induced degradation.

Abstract

Reconstructing 3D scenes from smoke-degraded multi-view images is particularly difficult because smoke introduces strong scattering effects, view-dependent appearance changes, and severe degradation of cross-view consistency. To address these issues, we propose a framework that integrates visual priors with efficient 3D scene modeling. We employ Nano-Banana-Pro to enhance smoke-degraded images and provide clearer visual observations for reconstruction and develop Smoke-GS, a medium-aware 3D Gaussian Splatting framework for smoke scene reconstruction and restoration-oriented novel view synthesis. Smoke-GS models the scene using explicit 3D Gaussians and introduces a lightweight view-dependent medium branch to capture direction-dependent appearance variations caused by smoke. Our method preserves the rendering efficiency of 3D Gaussian Splatting while improving robustness to smoke-induced…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.