Annotation aggregation of multi-label ecological datasets via Bayesian   modeling

Haoxuan Wang; Patrik Lauha; David B. Dunson

arXiv:2406.15844·stat.ME·October 14, 2024

Annotation aggregation of multi-label ecological datasets via Bayesian modeling

Haoxuan Wang, Patrik Lauha, David B. Dunson

PDF

Open Access 1 Repo

TL;DR

This paper introduces a Bayesian hierarchical model to effectively aggregate sparse, variable expert annotations in ecological datasets, enhancing bird species classification and uncertainty quantification in large-scale audio monitoring.

Contribution

It presents a novel Bayesian modeling method for combining expert annotations with varying accuracy, improving classification and providing performance scores.

Findings

01

Improved bird species classification accuracy.

02

Effective uncertainty quantification for expert labels.

03

Enhanced engagement through performance scoring.

Abstract

Ecological and conservation studies monitoring bird communities typically rely on species classification based on bird vocalizations. Historically, this has been based on expert volunteers going into the field and making lists of the bird species that they observe. Recently, machine learning algorithms have emerged that can accurately classify bird species based on audio recordings of their vocalizations. Such algorithms crucially rely on training data that are labeled by experts. Automated classification is challenging when multiple species are vocalizing simultaneously, there is background noise, and/or the bird is far from the microphone. In continuously monitoring different locations, the size of the audio data become immense and it is only possible for human experts to label a tiny proportion of the available data. In addition, experts can vary in their accuracy and breadth of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Master-Savitar/Bayes-Species-Identification
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpecies Distribution and Climate Change · Statistical and Computational Modeling