Machine Learning Techniques in Automatic Music Transcription: A Systematic Survey
Fatemeh Jamshidi, Gary Pike, Amit Das, Richard Chapman

TL;DR
This paper systematically reviews machine learning methods in automatic music transcription, highlighting progress, challenges, and future directions to improve accuracy in converting complex audio signals into musical notation.
Contribution
It provides a comprehensive analysis of existing ML techniques in AMT, identifies current limitations, and suggests future research pathways for fully automated, accurate music transcription systems.
Findings
Current systems lag behind human transcription accuracy
Deep learning models have advanced AMT but still face challenges with complex harmonies
Future research should focus on reducing manual intervention and improving model robustness
Abstract
In the domain of Music Information Retrieval (MIR), Automatic Music Transcription (AMT) emerges as a central challenge, aiming to convert audio signals into symbolic notations like musical notes or sheet music. This systematic review accentuates the pivotal role of AMT in music signal analysis, emphasizing its importance due to the intricate and overlapping spectral structure of musical harmonies. Through a thorough examination of existing machine learning techniques utilized in AMT, we explore the progress and constraints of current models and methodologies. Despite notable advancements, AMT systems have yet to match the accuracy of human experts, largely due to the complexities of musical harmonies and the need for nuanced interpretation. This review critically evaluates both fully automatic and semi-automatic AMT systems, emphasizing the importance of minimal user intervention and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Diverse Musicological Studies
