A Comprehensive Trainable Error Model for Sung Music Queries

W. P. Birmingham; C. J. Meek

arXiv:1107.0054·cs.AI·July 4, 2011

A Comprehensive Trainable Error Model for Sung Music Queries

W. P. Birmingham, C. J. Meek

PDF

TL;DR

This paper introduces a trainable error model based on HMMs for accurately measuring similarity between sung queries and musical targets, improving query-by-humming retrieval systems.

Contribution

It presents a comprehensive, trainable error model that captures various types of errors in sung queries, enhancing music information retrieval accuracy.

Findings

01

Model effectively captures diverse error types

02

Simulations show high discriminatory power

03

Real query tests demonstrate practical relevance

Abstract

We propose a model for errors in sung queries, a variant of the hidden Markov model (HMM). This is a solution to the problem of identifying the degree of similarity between a (typically error-laden) sung query and a potential target in a database of musical works, an important problem in the field of music information retrieval. Similarity metrics are a critical component of query-by-humming (QBH) applications which search audio and multimedia databases for strong matches to oral queries. Our model comprehensively expresses the types of error or variation between target and query: cumulative and non-cumulative local errors, transposition, tempo and tempo changes, insertions, deletions and modulation. The model is not only expressive, but automatically trainable, or able to learn and generalize from query examples. We present results of simulations, designed to assess the discriminatory…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.