Uncertainty Quantification in Melody Estimation using Histogram Representation
Kavya Ranjan Saxena, Vipul Arora

TL;DR
This paper introduces a regression-based approach using histogram representations for melody estimation uncertainty quantification, outperforming classification methods in reliability and accuracy.
Contribution
It reformulates melody estimation as a regression problem and proposes three novel methods, including a Bayesian approach, for better uncertainty estimation from histogram data.
Findings
Regression-based methods provide more reliable uncertainty estimates.
The Bayesian method outperforms other approaches in accuracy.
Proposed methods effectively distinguish correct and incorrect predictions.
Abstract
Confidence estimation can improve the reliability of melody estimation by indicating which predictions are likely incorrect. The existing classification-based approach provides confidence for predicted pitch classes but fails to capture the magnitude of deviation from the ground truth. To address this limitation, we reformulate melody estimation as a regression problem and propose a novel approach to estimate uncertainty directly from the histogram representation of the pitch values, which correlates well with the deviation between the prediction and the ground-truth. We design three methods to model pitch on a continuous support range of histogram, which introduces the challenge of handling the discontinuity of unvoiced from the voiced pitch values. The first two methods address the abrupt discontinuity by mapping the pitch values to a continuous range, while the third adopts a fully…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Music Technology and Sound Studies · Neuroscience and Music Perception
