A Study of Annotation and Alignment Accuracy for Performance Comparison in Complex Orchestral Music
Thassilo Gadermaier, Gerhard Widmer

TL;DR
This paper investigates the accuracy of human annotations and audio features for aligning complex orchestral music recordings, providing insights into annotation consistency and optimal alignment methods for computational musicology.
Contribution
It offers a systematic analysis of human annotation variability and evaluates various audio features for improving automatic alignment accuracy in orchestral music recordings.
Findings
Quantified timing deviations among human annotators.
Provided a ground truth for alignment accuracy evaluation.
Identified effective audio features for precise audio-to-audio alignment.
Abstract
Quantitative analysis of commonalities and differences between recorded music performances is an increasingly common task in computational musicology. A typical scenario involves manual annotation of different recordings of the same piece along the time dimension, for comparative analysis of, e.g., the musical tempo, or for mapping other performance-related information between performances. This can be done by manually annotating one reference performance, and then automatically synchronizing other performances, using audio-to-audio alignment algorithms. In this paper we address several questions related to those tasks. First, we analyze different annotations of the same musical piece, quantifying timing deviations between the respective human annotators. A statistical evaluation of the marker time stamps will provide (a) an estimate of the expected timing precision of human annotations…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Music Technology and Sound Studies · Speech and Audio Processing
