Training-Free Multi-View Extension of IC-Light for Textual Position-Aware Scene Relighting

Jiangnan Ye; Jiedong Zhuang; Lianrui Mu; Wenjie Zheng; Jiaqi Hu; Xingze Zou; Jing Wang; Haoji Hu

arXiv:2511.13684·cs.CV·November 18, 2025

Training-Free Multi-View Extension of IC-Light for Textual Position-Aware Scene Relighting

Jiangnan Ye, Jiedong Zhuang, Lianrui Mu, Wenjie Zheng, Jiaqi Hu, Xingze Zou, Jing Wang, Haoji Hu

PDF

Open Access

TL;DR

GS-Light is a training-free, multi-view, text-guided relighting pipeline for 3D scenes that combines large vision-language models, geometry estimators, and diffusion models to produce high-fidelity relit scenes reflecting user prompts.

Contribution

It introduces a novel training-free extension of diffusion models for multi-view scene relighting guided by textual prompts, integrating lighting priors and view-geometry constraints.

Findings

01

Outperforms state-of-the-art baselines in multi-view consistency and image quality.

02

Produces high-fidelity, artistically relit 3D scenes from user prompts.

03

Demonstrates effectiveness on both indoor and outdoor scenes.

Abstract

We introduce GS-Light, an efficient, textual position-aware pipeline for text-guided relighting of 3D scenes represented via Gaussian Splatting (3DGS). GS-Light implements a training-free extension of a single-input diffusion model to handle multi-view inputs. Given a user prompt that may specify lighting direction, color, intensity, or reference objects, we employ a large vision-language model (LVLM) to parse the prompt into lighting priors. Using off-the-shelf estimators for geometry and semantics (depth, surface normals, and semantic segmentation), we fuse these lighting priors with view-geometry constraints to compute illumination maps and generate initial latent codes for each view. These meticulously derived init latents guide the diffusion model to generate relighting outputs that more accurately reflect user expectations, especially in terms of lighting direction. By feeding…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComputer Graphics and Visualization Techniques · Interactive and Immersive Displays · 3D Shape Modeling and Analysis