CamLit: Unified Video Diffusion with Explicit Camera and Lighting Control
Zhiyi Kuang, Chengan He, Egor Zakharov, Yuxuan Xue, Shunsuke Saito, Olivier Maury, Timur Bagautdinov, Youyi Zheng, Giljoo Nam

TL;DR
CamLit introduces a unified video diffusion model capable of synthesizing temporally coherent videos with novel viewpoints and relighting from a single image, enabling high-quality control over camera and lighting conditions.
Contribution
This work is the first to unify view synthesis and relighting in a single diffusion model for videos, providing integrated control and high-quality outputs.
Findings
Achieves high-fidelity novel view synthesis and relighting
Produces temporally coherent and spatially aligned videos
Maintains competitive performance with state-of-the-art methods
Abstract
We present CamLit, the first unified video diffusion model that jointly performs novel view synthesis (NVS) and relighting from a single input image. Given one reference image, a user-defined camera trajectory, and an environment map, CamLit synthesizes a video of the scene from new viewpoints under the specified illumination. Within a single generative process, our model produces temporally coherent and spatially aligned outputs, including relit novel-view frames and corresponding albedo frames, enabling high-quality control of both camera pose and lighting. Qualitative and quantitative experiments demonstrate that CamLit achieves high-fidelity outputs on par with state-of-the-art methods in both novel view synthesis and relighting, without sacrificing visual quality in either task. We show that a single generative model can effectively integrate camera and lighting control,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Image Enhancement Techniques · Generative Adversarial Networks and Image Synthesis
