TL;DR
This paper introduces efficient real-time algorithms for audio-to-score alignment that handle errors, tempo changes, and arbitrary repeats/skips in music performances, enabling practical automatic accompaniment systems.
Contribution
It proposes two hidden Markov models and algorithms that efficiently align monophonic performances with complex errors and repeats, suitable for scores of practical length.
Findings
Real-time alignment of scores with ~10,000 notes achieved on a laptop.
Tracking within 0.7 seconds after repeats/skips in clarinet performances.
Algorithms extend previous methods to handle errors and arbitrary repeats/skips efficiently.
Abstract
This paper discusses real-time alignment of audio signals of music performance to the corresponding score (a.k.a. score following) which can handle tempo changes, errors and arbitrary repeats and/or skips (repeats/skips) in performances. This type of score following is particularly useful in automatic accompaniment for practices and rehearsals, where errors and repeats/skips are often made. Simple extensions of the algorithms previously proposed in the literature are not applicable in these situations for scores of practical length due to the problem of large computational complexity. To cope with this problem, we present two hidden Markov models of monophonic performance with errors and arbitrary repeats/skips, and derive efficient score-following algorithms with an assumption that the prior probability distributions of score positions before and after repeats/skips are independent…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
