Efficient Camera-Controlled Video Generation of Static Scenes via Sparse Diffusion and 3D Rendering
Jieying Chen, Jeffrey Hu, Joan Lasenby, Ayush Tewari

TL;DR
This paper introduces SRENDER, a method that generates static scene videos efficiently by combining sparse keyframe diffusion with 3D rendering, achieving over 40x speedup while maintaining quality.
Contribution
The paper proposes a novel approach that combines sparse diffusion-based keyframe generation with 3D reconstruction to produce videos efficiently and adaptively for static scenes.
Findings
Over 40 times faster video generation compared to baseline.
Maintains high visual fidelity and temporal stability.
Adaptive keyframe prediction improves efficiency for different camera trajectories.
Abstract
Modern video generative models based on diffusion models can produce very realistic clips, but they are computationally inefficient, often requiring minutes of GPU time for just a few seconds of video. This inefficiency poses a critical barrier to deploying generative video in applications that require real-time interactions, such as embodied AI and VR/AR. This paper explores a new strategy for camera-conditioned video generation of static scenes: using diffusion-based generative models to generate a sparse set of keyframes, and then synthesizing the full video through 3D reconstruction and rendering. By lifting keyframes into a 3D representation and rendering intermediate views, our approach amortizes the generation cost across hundreds of frames while enforcing geometric consistency. We further introduce a model that predicts the optimal number of keyframes for a given camera…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Human Motion and Animation · 3D Shape Modeling and Analysis
