TL;DR
This paper introduces a novel unsupervised labelling function that segments and classifies bird song audio units to reduce label noise in large datasets, enhancing data quality for bird sound analysis.
Contribution
It presents a new data-centric approach combining segmentation, feature extraction, and classification to automatically improve label accuracy in bird song datasets.
Findings
Segmentation alone increased label noise from 10% to 83% depending on species.
The labelling function reduced label noise by up to three times.
Validated on 44 West-Palearctic bird species.
Abstract
Open audio databases such as Xeno-Canto are widely used to build datasets to explore bird song repertoire or to train models for automatic bird sound classification by deep learning algorithms. However, such databases suffer from the fact that bird sounds are weakly labelled: a species name is attributed to each audio recording without timestamps that provide the temporal localization of the bird song of interest. Manual annotations can solve this issue, but they are time consuming, expert-dependent, and cannot run on large datasets. Another solution consists in using a labelling function that automatically segments audio recordings before assigning a label to each segmented audio sample. Although labelling functions were introduced to expedite strong label assignment, their classification performance remains mostly unknown. To address this issue and reduce label noise (wrong label…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
