Investigation on the use of Hidden-Markov Models in automatic   transcription of music

D. Cazau; G. Nuel

arXiv:1704.03711·stat.ML·April 13, 2017·1 cites

Investigation on the use of Hidden-Markov Models in automatic transcription of music

D. Cazau, G. Nuel

PDF

Open Access

TL;DR

This paper explores the application of Hidden Markov Models in automatic music transcription, integrating them with Probabilistic Latent Component Analysis to improve note segmentation and harmonic transition modeling across different instruments.

Contribution

It introduces novel HMM-based modules within a PLCA framework for enhanced note segmentation and harmonic transition modeling in AMT systems.

Findings

01

HMM integration improves transcription accuracy.

02

Performance varies with instrument repertoire.

03

Higher-order HMMs better model note durations.

Abstract

Hidden Markov Models (HMMs) are a ubiquitous tool to model time series data, and have been widely used in two main tasks of Automatic Music Transcription (AMT): note segmentation, i.e. identifying the played notes after a multi-pitch estimation, and sequential post-processing, i.e. correcting note segmentation using training data. In this paper, we employ the multi-pitch estimation method called Probabilistic Latent Component Analysis (PLCA), and develop AMT systems by integrating different HMM-based modules in this framework. For note segmentation, we use two different twostate on/o? HMMs, including a higher-order one for duration modeling. For sequential post-processing, we focused on a musicological modeling of polyphonic harmonic transitions, using a first- and second-order HMMs whose states are defined through candidate note mixtures. These different PLCA plus HMM systems have been…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic and Audio Processing · Music Technology and Sound Studies · Speech and Audio Processing