Musical Rhythm Transcription Based on Bayesian Piece-Specific Score Models Capturing Repetitions
Eita Nakamura, Kazuyoshi Yoshii

TL;DR
This paper introduces Bayesian Markov models that capture global repetitive structures in musical scores, improving rhythm transcription accuracy by inferring piece-specific models from MIDI data, especially for unseen scores.
Contribution
The paper presents a novel Bayesian approach to model global repetitions in musical scores, enabling piece-specific rhythm transcription without prior score annotations.
Findings
Bayesian models improved transcription accuracy for most tested types.
Explicit modeling of approximate repetitions enhanced performance.
Effective data representation maximized accuracy and efficiency.
Abstract
Most work on musical score models (a.k.a. musical language models) for music transcription has focused on describing the local sequential dependence of notes in musical scores and failed to capture their global repetitive structure, which can be a useful guide for transcribing music. Focusing on rhythm, we formulate several classes of Bayesian Markov models of musical scores that describe repetitions indirectly using the sparse transition probabilities of notes or note patterns. This enables us to construct piece-specific models for unseen scores with an unfixed repetitive structure and to derive tractable inference algorithms. Moreover, to describe approximate repetitions, we explicitly incorporate a process for modifying the repeated notes/note patterns. We apply these models as prior musical score models for rhythm transcription, where piece-specific score models are inferred from…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Music Technology and Sound Studies · Speech Recognition and Synthesis
