AutoTransition: Learning to Recommend Video Transition Effects
Yaojie Shen, Libo Zhang, Kai Xu, Xiaojie Jin

TL;DR
AutoTransition introduces a novel multi-modal framework for automatically recommending video transition effects, significantly aiding non-professionals and improving editing efficiency by 300 times.
Contribution
This work is the first to formulate automatic video transition recommendation as a multi-modal retrieval task using vision and audio cues, with a new dataset and a transformer-based model.
Findings
Achieves comparable scores to professional editors in user studies.
Improves video editing efficiency by 300 times.
Demonstrates effectiveness through quantitative and qualitative experiments.
Abstract
Video transition effects are widely used in video editing to connect shots for creating cohesive and visually appealing videos. However, it is challenging for non-professionals to choose best transitions due to the lack of cinematographic knowledge and design skills. In this paper, we present the premier work on performing automatic video transitions recommendation (VTR): given a sequence of raw video shots and companion audio, recommend video transitions for each pair of neighboring shots. To solve this task, we collect a large-scale video transition dataset using publicly available video templates on editing softwares. Then we formulate VTR as a multi-modal retrieval problem from vision/audio to video transitions and propose a novel multi-modal matching framework which consists of two parts. First we learn the embedding of video transitions through a video transition classification…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Analysis and Summarization · Cinema and Media Studies · Video Coding and Compression Technologies
