Unsupervised classification to improve the quality of a bird song   recording dataset

F\'elix Michaud (ISYEB ); J\'er\^ome Sueur (ISYEB ); Maxime Le Cesne; (ISYEB ); Sylvain Haupert (ISYEB )

arXiv:2302.07560·cs.LG·February 16, 2023

Unsupervised classification to improve the quality of a bird song recording dataset

F\'elix Michaud (ISYEB ), J\'er\^ome Sueur (ISYEB ), Maxime Le Cesne, (ISYEB ), Sylvain Haupert (ISYEB )

PDF

1 Repo

TL;DR

This paper introduces a novel unsupervised labelling function that segments and classifies bird song audio units to reduce label noise in large datasets, enhancing data quality for bird sound analysis.

Contribution

It presents a new data-centric approach combining segmentation, feature extraction, and classification to automatically improve label accuracy in bird song datasets.

Findings

01

Segmentation alone increased label noise from 10% to 83% depending on species.

02

The labelling function reduced label noise by up to three times.

03

Validated on 44 West-Palearctic bird species.

Abstract

Open audio databases such as Xeno-Canto are widely used to build datasets to explore bird song repertoire or to train models for automatic bird sound classification by deep learning algorithms. However, such databases suffer from the fact that bird sounds are weakly labelled: a species name is attributed to each audio recording without timestamps that provide the temporal localization of the bird song of interest. Manual annotations can solve this issue, but they are time consuming, expert-dependent, and cannot run on large datasets. Another solution consists in using a labelling function that automatically segments audio recordings before assigning a label to each segmented audio sample. Although labelling functions were introduced to expedite strong label assignment, their classification performance remains mostly unknown. To address this issue and reduce label noise (wrong label…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ear-team/bambird
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.