Cinematographic Camera Diffusion Model
Hongda Jiang, Xi Wang, Marc Christie, Libin Liu, Baoquan Chen

TL;DR
This paper introduces a transformer-based diffusion model for generating diverse, high-quality virtual camera trajectories conditioned on textual descriptions, with enhanced control features like keyframing and motion blending.
Contribution
It presents a novel text-to-camera motion generation method using diffusion models, integrating keyframing and latent interpolation for improved control and diversity.
Findings
Generated camera trajectories are diverse and qualitatively convincing.
The model effectively incorporates high-level textual descriptions.
Professional artists provided positive feedback on the system's usability.
Abstract
Designing effective camera trajectories in virtual 3D environments is a challenging task even for experienced animators. Despite an elaborate film grammar, forged through years of experience, that enables the specification of camera motions through cinematographic properties (framing, shots sizes, angles, motions), there are endless possibilities in deciding how to place and move cameras with characters. Dealing with these possibilities is part of the complexity of the problem. While numerous techniques have been proposed in the literature (optimization-based solving, encoding of empirical rules, learning from real examples,...), the results either lack variety or ease of control. In this paper, we propose a cinematographic camera diffusion model using a transformer-based architecture to handle temporality and exploit the stochasticity of diffusion models to generate diverse and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDigital Media and Visual Art · Infrared Target Detection Methodologies · Satellite Image Processing and Photogrammetry
