Quantifying the Corpus Bias Problem in Automatic Music Transcription   Systems

Luk\'a\v{s} Samuel Mart\'ak; Patricia Hu; Gerhard Widmer

arXiv:2408.04737·cs.SD·August 12, 2024

Quantifying the Corpus Bias Problem in Automatic Music Transcription Systems

Luk\'a\v{s} Samuel Mart\'ak, Patricia Hu, Gerhard Widmer

PDF

1 Repo

TL;DR

This paper investigates how current automatic music transcription systems perform poorly on music genres and styles outside of classical piano, highlighting the corpus bias problem and its impact on generalization.

Contribution

It introduces new test sets to evaluate the effect of musical distribution shifts and quantifies the performance gap caused by corpus bias in AMT systems.

Findings

01

Significant performance drop on non-piano music genres

02

Corpus bias limits generalization of state-of-the-art AMT systems

03

Performance gap increases with greater musical distribution shift

Abstract

Automatic Music Transcription (AMT) is the task of recognizing notes in audio recordings of music. The State-of-the-Art (SotA) benchmarks have been dominated by deep learning systems. Due to the scarcity of high quality data, they are usually trained and evaluated exclusively or predominantly on classical piano music. Unfortunately, that hinders our ability to understand how they generalize to other music. Previous works have revealed several aspects of memorization and overfitting in these systems. We identify two primary sources of distribution shift: the music, and the sound. Complementing recent results on the sound axis (i.e. acoustics, timbre), we investigate the musical one (i.e. note combinations, dynamics, genre). We evaluate the performance of several SotA AMT systems on two new experimental test sets which we carefully construct to emulate different levels of musical…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

CPJKU/musical_distribution_shift
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.