Calibration of a two-state pitch-wise HMM method for note segmentation in Automatic Music Transcription systems
Dorian Cazau, Yuancheng Wang, Olivier Adam, Qiao Wang and, Gr\'egory Nuel

TL;DR
This paper introduces a two-state Hidden Markov Model for note segmentation in automatic music transcription, demonstrating that HMM-based soft thresholding with optimized parameters improves transcription accuracy across various scenarios.
Contribution
The study presents a novel parametrization of the HMM sigmoid function and compares fixed thresholding with HMM-based soft thresholding for note segmentation.
Findings
HMM soft thresholding outperforms fixed thresholding in transcription accuracy.
Optimized parameters {alpha, beta} significantly enhance segmentation performance.
Evaluation follows MIREX standards using the MAPS dataset.
Abstract
Many methods for automatic music transcription involves a multi-pitch estimation method that estimates an activity score for each pitch. A second processing step, called note segmentation, has to be performed for each pitch in order to identify the time intervals when the notes are played. In this study, a pitch-wise two-state on/off firstorder Hidden Markov Model (HMM) is developed for note segmentation. A complete parametrization of the HMM sigmoid function is proposed, based on its original regression formulation, including a parameter alpha of slope smoothing and beta? of thresholding contrast. A comparative evaluation of different note segmentation strategies was performed, differentiated according to whether they use a fixed threshold, called "Hard Thresholding" (HT), or a HMM-based thresholding method, called "Soft Thresholding" (ST). This evaluation was done following MIREX…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Music Technology and Sound Studies · Speech and Audio Processing
