Automated bird sound recognition in realistic settings
Timos Papadopoulos, Stephen J. Roberts, Katherine J. Willis

TL;DR
This study evaluates an automated bird sound recognition system trained on crowd-sourced data, demonstrating its effectiveness in realistic settings and highlighting the importance of certainty measures for reliable identification.
Contribution
It introduces a fully automated, crowd-sourced training approach and assesses two classification methods, emphasizing the role of certainty measures in practical bird sound identification.
Findings
Both classifiers achieved similar accuracy.
kNN offers more flexibility than SVM.
Certainty measures improve reliability of results.
Abstract
We evaluated the effectiveness of an automated bird sound identification system in a situation that emulates a realistic, typical application. We trained classification algorithms on a crowd-sourced collection of bird audio recording data and restricted our training methods to be completely free of manual intervention. The approach is hence directly applicable to the analysis of multiple species collections, with labelling provided by crowd-sourced collection. We evaluated the performance of the bird sound recognition system on a realistic number of candidate classes, corresponding to real conditions. We investigated the use of two canonical classification methods, chosen due to their widespread use and ease of interpretation, namely a k Nearest Neighbour (kNN) classifier with histogram-based features and a Support Vector Machine (SVM) with time-summarisation features. We further…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnimal Vocal Communication and Behavior · Music and Audio Processing · Species Distribution and Climate Change
MethodsInterpretability
