IncompeBench: A Permissively Licensed, Fine-Grained Benchmark for Music Information Retrieval

Benjamin Clavi\'e; Atoof Shakir; Jonah Turner; Sean Lee; Aamir Shakir; Makoto P. Kato

arXiv:2602.11941·cs.IR·February 13, 2026

IncompeBench: A Permissively Licensed, Fine-Grained Benchmark for Music Information Retrieval

Benjamin Clavi\'e, Atoof Shakir, Jonah Turner, Sean Lee, Aamir Shakir, Makoto P. Kato

PDF

Open Access

TL;DR

IncompeBench is a new, high-quality, permissively licensed benchmark dataset designed for evaluating music information retrieval systems, featuring extensive annotations and relevance judgments to facilitate progress in the field.

Contribution

The paper introduces IncompeBench, a comprehensive, annotated benchmark dataset for music retrieval, addressing the lack of high-quality evaluation resources in MIR.

Findings

01

High agreement between human annotators

02

Large dataset with over 125,000 relevance judgments

03

Publicly available datasets for research use

Abstract

Multimodal Information Retrieval has made significant progress in recent years, leveraging the increasingly strong multimodal abilities of deep pre-trained models to represent information across modalities. Music Information Retrieval (MIR), in particular, has considerably increased in quality, with neural representations of music even making its way into everyday life products. However, there is a lack of high-quality benchmarks for evaluating music retrieval performance. To address this issue, we introduce \textbf{IncompeBench}, a carefully annotated benchmark comprising $1, 574$ permissively licensed, high-quality music snippets, $500$ diverse queries, and over $125, 000$ individual relevance judgements. These annotations were created through the use of a multi-stage pipeline, resulting in high agreement between human annotators and the generated data. The resulting datasets are…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic and Audio Processing · Topic Modeling · Mobile Crowdsensing and Crowdsourcing