UniLumos: Fast and Unified Image and Video Relighting with Physics-Plausible Feedback
Ropeway Liu, Hangjie Yuan, Bo Dong, Jiazheng Xing, Jinwang Wang, Rui Zhao, Yan Xing, Weihua Chen, Fan Wang

TL;DR
UniLumos is a unified framework for fast, physically plausible image and video relighting that incorporates scene geometry feedback and structured attribute control, outperforming previous methods in quality and speed.
Contribution
The paper introduces UniLumos, a novel relighting method that combines RGB-space geometry feedback with path consistency learning for enhanced realism and efficiency.
Findings
Achieves state-of-the-art relighting quality with improved physical consistency.
Provides a 20x speedup in relighting tasks for images and videos.
Introduces LumosBench for evaluating lighting controllability.
Abstract
Relighting is a crucial task with both practical demand and artistic value, and recent diffusion models have shown strong potential by enabling rich and controllable lighting effects. However, as they are typically optimized in semantic latent space, where proximity does not guarantee physical correctness in visual space, they often produce unrealistic results, such as overexposed highlights, misaligned shadows, and incorrect occlusions. We address this with UniLumos, a unified relighting framework for both images and videos that brings RGB-space geometry feedback into a flow matching backbone. By supervising the model with depth and normal maps extracted from its outputs, we explicitly align lighting effects with the scene structure, enhancing physical plausibility. Nevertheless, this feedback requires high-quality outputs for supervision in visual space, making standard multi-step…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Image Enhancement Techniques · Computer Graphics and Visualization Techniques
