Investigation on the use of Hidden-Markov Models in automatic transcription of music
D. Cazau, G. Nuel

TL;DR
This paper explores the application of Hidden Markov Models in automatic music transcription, integrating them with Probabilistic Latent Component Analysis to improve note segmentation and harmonic transition modeling across different instruments.
Contribution
It introduces novel HMM-based modules within a PLCA framework for enhanced note segmentation and harmonic transition modeling in AMT systems.
Findings
HMM integration improves transcription accuracy.
Performance varies with instrument repertoire.
Higher-order HMMs better model note durations.
Abstract
Hidden Markov Models (HMMs) are a ubiquitous tool to model time series data, and have been widely used in two main tasks of Automatic Music Transcription (AMT): note segmentation, i.e. identifying the played notes after a multi-pitch estimation, and sequential post-processing, i.e. correcting note segmentation using training data. In this paper, we employ the multi-pitch estimation method called Probabilistic Latent Component Analysis (PLCA), and develop AMT systems by integrating different HMM-based modules in this framework. For note segmentation, we use two different twostate on/o? HMMs, including a higher-order one for duration modeling. For sequential post-processing, we focused on a musicological modeling of polyphonic harmonic transitions, using a first- and second-order HMMs whose states are defined through candidate note mixtures. These different PLCA plus HMM systems have been…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Music Technology and Sound Studies · Speech and Audio Processing
