Calibration of a two-state pitch-wise HMM method for note segmentation   in Automatic Music Transcription systems

Dorian Cazau; Yuancheng Wang; Olivier Adam; Qiao Wang and; Gr\'egory Nuel

arXiv:1704.08729·stat.ME·May 1, 2017·2 cites

Calibration of a two-state pitch-wise HMM method for note segmentation in Automatic Music Transcription systems

Dorian Cazau, Yuancheng Wang, Olivier Adam, Qiao Wang and, Gr\'egory Nuel

PDF

Open Access

TL;DR

This paper introduces a two-state Hidden Markov Model for note segmentation in automatic music transcription, demonstrating that HMM-based soft thresholding with optimized parameters improves transcription accuracy across various scenarios.

Contribution

The study presents a novel parametrization of the HMM sigmoid function and compares fixed thresholding with HMM-based soft thresholding for note segmentation.

Findings

01

HMM soft thresholding outperforms fixed thresholding in transcription accuracy.

02

Optimized parameters {alpha, beta} significantly enhance segmentation performance.

03

Evaluation follows MIREX standards using the MAPS dataset.

Abstract

Many methods for automatic music transcription involves a multi-pitch estimation method that estimates an activity score for each pitch. A second processing step, called note segmentation, has to be performed for each pitch in order to identify the time intervals when the notes are played. In this study, a pitch-wise two-state on/off firstorder Hidden Markov Model (HMM) is developed for note segmentation. A complete parametrization of the HMM sigmoid function is proposed, based on its original regression formulation, including a parameter alpha of slope smoothing and beta? of thresholding contrast. A comparative evaluation of different note segmentation strategies was performed, differentiated according to whether they use a fixed threshold, called "Hard Thresholding" (HT), or a HMM-based thresholding method, called "Soft Thresholding" (ST). This evaluation was done following MIREX…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic and Audio Processing · Music Technology and Sound Studies · Speech and Audio Processing