CineTechBench: A Benchmark for Cinematographic Technique Understanding and Generation
Xinran Wang, Songyu Xu, Xiangxuan Shan, Yuxuan Zhang, Muxi Diao, Xueyan Duan, Yanhua Huang, Kongming Liang, Zhanyu Ma

TL;DR
CineTechBench is a new benchmark with expert annotations designed to evaluate and improve multimodal models' ability to understand and generate cinematographic techniques in film, covering key visual aspects with over 600 images and 120 clips.
Contribution
It introduces a comprehensive, expert-annotated benchmark for cinematography, enabling systematic evaluation of models' understanding and generation of film techniques.
Findings
Current models show limitations in understanding cinematographic nuances.
The benchmark reveals gaps in models' ability to generate cinema-quality camera movements.
Insights guide future research in film-related AI applications.
Abstract
Cinematography is a cornerstone of film production and appreciation, shaping mood, emotion, and narrative through visual elements such as camera movement, shot composition, and lighting. Despite recent progress in multimodal large language models (MLLMs) and video generation models, the capacity of current models to grasp and reproduce cinematographic techniques remains largely uncharted, hindered by the scarcity of expert-annotated data. To bridge this gap, we present CineTechBench, a pioneering benchmark founded on precise, manual annotation by seasoned cinematography experts across key cinematography dimensions. Our benchmark covers seven essential aspects-shot scale, shot angle, composition, camera movement, lighting, color, and focal length-and includes over 600 annotated movie images and 120 movie clips with clear cinematographic techniques. For the understanding task, we design…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsCinema and Media Studies · Multimedia Communication and Technology · Data Visualization and Analytics
