An End-to-End Neural Network for Polyphonic Piano Music Transcription
Siddharth Sigtia, Emmanouil Benetos, Simon Dixon

TL;DR
This paper introduces a neural network-based system for transcribing polyphonic piano music, combining acoustic and language models with efficient inference to improve accuracy and enable real-time performance.
Contribution
It presents a novel end-to-end neural network architecture for polyphonic music transcription that integrates acoustic and music language models with an efficient beam search.
Findings
Convolutional neural network acoustic models outperform other architectures.
Music language models improve transcription accuracy.
Proposed beam search variant enables real-time processing.
Abstract
We present a supervised neural network model for polyphonic piano music transcription. The architecture of the proposed model is analogous to speech recognition systems and comprises an acoustic model and a music language model. The acoustic model is a neural network used for estimating the probabilities of pitches in a frame of audio. The language model is a recurrent neural network that models the correlations between pitch combinations over time. The proposed model is general and can be used to transcribe polyphonic music without imposing any constraints on the polyphony. The acoustic and language model predictions are combined using a probabilistic graphical model. Inference over the output variables is performed using the beam search algorithm. We perform two sets of experiments. We investigate various neural network architectures for the acoustic models and also investigate the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Speech and Audio Processing · Music Technology and Sound Studies
