MovieCuts: A New Dataset and Benchmark for Cut Type Recognition
Alejandro Pardo, Fabian Caba Heilbron, Juan Le\'on Alc\'azar, Ali, Thabet, Bernard Ghanem

TL;DR
This paper introduces MovieCuts, a large dataset for recognizing different types of video cuts in movies, and benchmarks various models, revealing the task's complexity and potential for advancing video editing technology.
Contribution
The paper presents a new dataset, MovieCuts, for cut type recognition, and provides baseline benchmarks demonstrating the challenge and importance of this task in video analysis.
Findings
Best model achieves 47.7% mAP, indicating difficulty of the task.
Multi-modal approaches improve cut recognition performance.
The dataset enables future research in automated video editing.
Abstract
Understanding movies and their structural patterns is a crucial task in decoding the craft of video editing. While previous works have developed tools for general analysis, such as detecting characters or recognizing cinematography properties at the shot level, less effort has been devoted to understanding the most basic video edit, the Cut. This paper introduces the Cut type recognition task, which requires modeling multi-modal information. To ignite research in this new task, we construct a large-scale dataset called MovieCuts, which contains 173,967 video clips labeled with ten cut types defined by professionals in the movie industry. We benchmark a set of audio-visual approaches, including some dealing with the problem's multi-modal nature. Our best model achieves 47.7% mAP, which suggests that the task is challenging and that attaining highly accurate Cut type recognition is an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Analysis and Summarization · Digital Media Forensic Detection · Cinema and Media Studies
