AutoMatch: A Large-scale Audio Beat Matching Benchmark for Boosting Deep Learning Assistant Video Editing
Sen Pei, Jingya Yu, Qi Chen, Wozhou He

TL;DR
AutoMatch introduces a large-scale dataset and a novel model for audio beat matching to assist video editing, aiming to reduce manual effort and enhance creative workflows in short video production.
Contribution
This paper presents the first large-scale dataset and a new model for audio beat matching, addressing a practical challenge in video editing with innovative evaluation and training techniques.
Findings
AutoMatch dataset contains over 87,000 annotated music samples.
BeatX model effectively predicts transition timestamps for video editing.
Proposed label scope improves training by handling data imbalance.
Abstract
The explosion of short videos has dramatically reshaped the manners people socialize, yielding a new trend for daily sharing and access to the latest information. These rich video resources, on the one hand, benefited from the popularization of portable devices with cameras, but on the other, they can not be independent of the valuable editing work contributed by numerous video creators. In this paper, we investigate a novel and practical problem, namely audio beat matching (ABM), which aims to recommend the proper transition time stamps based on the background music. This technique helps to ease the labor-intensive work during video editing, saving energy for creators so that they can focus more on the creativity of video content. We formally define the ABM problem and its evaluation protocol. Meanwhile, a large-scale audio dataset, i.e., the AutoMatch with over 87k finely annotated…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Video Analysis and Summarization · Music Technology and Sound Studies
