Reconstructing the Charlie Parker Omnibook using an audio-to-score automatic transcription pipeline
Xavier Riley, Simon Dixon

TL;DR
This paper presents a new audio-to-score transcription pipeline specifically designed for jazz saxophone recordings, aiming to automatically reconstruct Charlie Parker's solos from audio with high accuracy.
Contribution
It introduces a novel modular transcription pipeline combining source separation, MIDI transcription, and score reconstruction, along with an enhanced dataset for benchmarking.
Findings
Achieved improved transcription accuracy on jazz saxophone recordings.
Provided a new benchmark dataset with aligned score-audio pairs.
Demonstrated potential for automatic jazz transcription to aid music education.
Abstract
The Charlie Parker Omnibook is a cornerstone of jazz music education, described by pianist Ethan Iverson as "the most important jazz education text ever published". In this work we propose a new transcription pipeline and explore the extent to which state of the art music technology is able to reconstruct these scores directly from the audio without human intervention. Our pipeline includes: a newly trained source separation model for saxophone, a new MIDI transcription model for solo saxophone and an adaptation of an existing MIDI-to-score method for monophonic instruments. To assess this pipeline we also provide an enhanced dataset of Charlie Parker transcriptions as score-audio pairs with accurate MIDI alignments and downbeat annotations. This represents a challenging new benchmark for automatic audio-to-score transcription that we hope will advance research into areas beyond…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Music Technology and Sound Studies · Video Analysis and Summarization
