Data Cleansing with Contrastive Learning for Vocal Note Event Annotations
Gabriel Meseguer-Brocal, Rachel Bittner, Simon Durand, Brian Brost

TL;DR
This paper introduces a contrastive learning-based data cleansing method tailored for time-varying, structured labels in music data, significantly improving vocal note event annotation accuracy and estimating annotation error rates.
Contribution
It presents a novel contrastive learning model for cleansing structured, time-varying labels in music datasets, enhancing transcription accuracy and error estimation.
Findings
Transcription accuracy improves with the proposed cleansing strategy.
The model effectively estimates annotation error rates in datasets.
Potential for broader applications in music data annotation quality control.
Abstract
Data cleansing is a well studied strategy for cleaning erroneous labels in datasets, which has not yet been widely adopted in Music Information Retrieval. Previously proposed data cleansing models do not consider structured (e.g. time varying) labels, such as those common to music data. We propose a novel data cleansing model for time-varying, structured labels which exploits the local structure of the labels, and demonstrate its usefulness for vocal note event annotations in music. %Our model is trained in a contrastive learning manner by automatically creating local deformations of likely correct labels. Our model is trained in a contrastive learning manner by automatically contrasting likely correct labels pairs against local deformations of them. We demonstrate that the accuracy of a transcription model improves greatly when trained using our proposed strategy compared with the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Machine Learning and Data Classification · Anomaly Detection Techniques and Applications
MethodsContrastive Learning
